V-JEPA 2 - Meta introduces an AI world model
Meta released the world model Video Joint Embedding Predictive Architecture 2 (V-JEPA 2). A world model in machine learning is a way to show how an environment works, helping an AI or a robot to predict results and simulate interactions within it.

The details
- According to Meta, V-JEPA 2 delivers state-of-the-art performance in visual understanding and in the physical world. It is a self-supervised foundation model with 1.2 billion-parameter.
- The model was pretrained on more than 1 million hours of video and 1 million images. So it learned how objects move and interact in the physical world.
- Additionally, Meta has released three new benchmarks to evaluate world models. People score with an accuracy of 85% - 95% on these tests. Current world models struggle with the tasks.
- Meta’s goal is to achieve AMI (Advanced Machine Intelligence). According to Meta AI chief and Turing Award winner Yann LeCun, the new model is another step in this direction. Meta provides the results to the open-source community.
Our thoughts
World models are important in robotics. These models help a robot understand its surroundings and predict events, allowing it to navigate more effectively.
V-JEPA 2 is impressive, but it is still far from human capabilities. It can accurately predict and plan physical scenes, but it lacks the contextual decision-making, emotional intelligence, and general adaptability of a human.
More information: 🔗 Heise Online | Meta AI | Meta GitHub
Magic AI tool of the week
This week, we explain how you can generate professional short videos for free using OpenAI’s video generator, Sora. For this, you need to download Microsoft’s Bing app (available for iOS and Android).

Step-by-step guide:
- Download the Bing app to your smartphone. Then, select the apps tab in the bottom right corner.
- Click on the “Video Creator” icon. Then, write a detailed prompt describing your desired video.
- Generate your 5-second video (9:16 video format) and then download it.
If you are not completely satisfied, try refining your prompt further. That’s it!
Hand-picked articles
- An Introduction to Anthropic’s Model Context Protocol (MCP) with Python
- Understand and Implement an Artificial Neural Network from Scratch
- Mastering the Capital Asset Pricing Model (CAPM) Using Python
😀 Do you enjoy our content? If so, why not support us with a small financial contribution? This helps us fund our work to ensure we can stick around long-term.