Section 01
Introduction: MolmoAct2—A Breakthrough in Real-World Deployment of Open VLA Models
MolmoAct2 is a fully open-source Visual-Language-Action (VLA) model developed by the Allen AI team, designed specifically for real-world deployment. Through five core innovations (MolmoER backbone network, three new datasets, OpenFAST action tokenizer, flow-matching continuous action expert architecture, and MolmoThink adaptive reasoning), it outperforms strong baselines like Pi-05 on 7 simulation and real-world benchmarks, providing an open and scalable research platform for the robotics field.