Meta unveils V-JEPA 2: AI world model achieves state-of-the-art visual understanding, prediction, and zero-shot robot planning

Introducing the V-JEPA 2 world model and new benchmarks for physical reasoning

TakeawaysMeta Video Joint Embedding Predictive Architecture 2 (V-JEPA 2) is a world model that achieves state-of-the-art performance on visual understanding and prediction in the physical world. Our model can also be used for zero-shot robot planning to interact with unfamiliar objects in new environments.V-JEPA 2 represents our next step toward our goal of achieving advanced machine intelligence (AMI) and building useful AI agents that can operate in the physical world.We’re also releasing thre...