Technology NewsTechnology NewsTechnology News
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Reading: Ai2 Launches Molmo 2 to Improve Video Understanding in AI
Share
Font ResizerAa
Technology NewsTechnology News
Font ResizerAa
Search
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Follow US
  • Cookie Policy (EU)
  • Contact
  • About
© 2025 NEWSLINKER - Powered by LK SOFTWARE
AI

Ai2 Launches Molmo 2 to Improve Video Understanding in AI

Highlights

  • Ai2 released Molmo 2, an open multimodal AI for video understanding tasks.

  • Molmo 2 achieves strong results with fewer parameters and less training data.

  • Open datasets and technical resources are publicly available for research use.

Kaan Demirel
Last updated: 21 December, 2025 - 4:49 pm 4:49 pm
Kaan Demirel 3 hours ago
Share
SHARE

Contents
What Are Molmo 2’s Key Capabilities?How Does Molmo 2 Perform Against Open and Closed Models?What Resources and Access Does Ai2 Provide?

Researchers and developers seeking accessible and capable artificial intelligence models for video analysis may find new possibilities with the release of Molmo 2 by the Allen Institute for AI (Ai2). The organization says its latest multimodal suite brings precise spatial and temporal video understanding, supporting everything from robotics to automated surveillance, without relying on proprietary or closed systems. Molmo 2’s design offers public access to advanced tracking, event localization, and nuanced visual reasoning, features often locked behind paywalls. This move signals a shift toward more inclusive innovation and encourages wider collaboration, especially as multimodal AI becomes central to various technological pursuits.

Molmo 2 introduces notable advances compared to earlier models and recent announcements from Ai2. Previous reports highlighted the original Molmo’s capacity to handle video and multi-image tasks, but required significantly more parameters and training data. Now, Molmo 2, with only 8B parameters, reportedly achieves higher accuracy and broader capabilities—even outperforming larger models like Gemini 3 and requiring far less data than competitors such as Meta’s PerceptionLM. This progression underlines a growing emphasis on efficiency and reproducibility, with Ai2 maintaining its open-access ethos.

What Are Molmo 2’s Key Capabilities?

The new model offers skills such as pixel-level grounding, temporal alignment, multi-object tracking, and dense video captioning. According to Ai2, Molmo 2 can pinpoint events and object positions within complex scenes and sustain object identities despite occlusions or changing backgrounds.

“With a fraction of the data, Molmo 2 surpasses many frontier models on key video understanding tasks,”

explained Ali Farhadi, the CEO of Ai2. These functions make the system suitable for a range of applications within robotics, transportation, and industrial automation by increasing the potential for accurate real-world sensor processing.

How Does Molmo 2 Perform Against Open and Closed Models?

In benchmarks for short-video understanding, Molmo 2 competes neck-and-neck with large proprietary systems, delivering robust results on datasets such as MVBench, MotionQA, and NextQA. Ai2 claims its model doubles or even triples grounding accuracy over prior open models and matches or surpasses leading commercial solutions on tracking and pointing challenges. The system’s ability to provide detailed human-understandable video captions and detect anomalies further supports practical deployment across real-world environments.

What Resources and Access Does Ai2 Provide?

To foster transparency, Ai2 made the entire training dataset and technical report available to the community. The new release includes nine open datasets totaling over nine million annotated video and image examples. Users can choose between several Molmo 2 variants, such as the 4B, 7B (Molmo 2-O powered by Olmo), and 8B models, each tailored for specific types of reasoning and tracking. All assets are downloadable via major platforms including GitHub and Hugging Face, and an interactive playground is provided for immediate testing.

“We are excited to see the immense impact this model will have on the AI landscape, adding another piece to our fully open model ecosystem,”

Farhadi added.

Even as open multimodal AI models become more prevalent, Ai2 continues to push efficiency by reducing both size and training data requirements while attempting to match or exceed proprietary alternatives. Earlier versions of Molmo and other public models often struggled to achieve results comparable with closed systems or required extensive computational resources. By offering transparent datasets, benchmarks, and a choice of model variants under open licenses, Ai2 fosters reproducibility and collaborative innovation in a field marked by concerns around opacity and data provenance. For researchers and developers, Molmo 2’s approach provides useful reference points and tools for building explainable, reliable AI systems in vision-centric domains.

You can follow us on Youtube, Telegram, Facebook, Linkedin, Twitter ( X ), Mastodon and Bluesky

You Might Also Like

Health Insurers and Providers Intensify AI Battle Over Medical Coding

Zara Uses AI to Streamline Retail Image Production Processes

RealMan Robotics Releases RealSource Dataset for Robotics Research

Verizon Boosts Kodiak AI Driverless Trucks With 5G and IoT Solutions

Wall Street Banks Accelerate AI Integration as Workforce Planning Shifts

Share This Article
Facebook Twitter Copy Link Print
Kaan Demirel
By Kaan Demirel
Kaan Demirel is a 28-year-old gaming enthusiast residing in Ankara. After graduating from the Statistics department of METU, he completed his master's degree in computer science. Kaan has a particular interest in strategy and simulation games and spends his free time playing competitive games and continuously learning new things about technology and game development. He is also interested in electric vehicles and cyber security. He works as a content editor at NewsLinker, where he leverages his passion for technology and gaming.
Previous Article Tesla Targets Sun Glare in Full Self-Driving with New Patent

Stay Connected

6.2kLike
8kFollow
2.3kSubscribe
1.7kFollow

Latest News

Tesla Targets Sun Glare in Full Self-Driving with New Patent
Electric Vehicle
Delaware Court Reinstates Elon Musk’s $139 Billion Tesla Pay Deal
Electric Vehicle
US Prosecutors Secure Guilty Plea From Nefilim Ransomware Operator
Cybersecurity
Cybersecurity Workers Plead Guilty to Orchestrating Ransomware Attacks
Cybersecurity
Triple-Fan DDR5 Memory Sticks Raise Prices and Performance Hopes
Computing
NEWSLINKER – your premier source for the latest updates in ai, robotics, electric vehicle, gaming, and technology. We are dedicated to bringing you the most accurate, timely, and engaging content from across these dynamic industries. Join us on our journey of discovery and stay informed in this ever-evolving digital age.

ARTIFICAL INTELLIGENCE

  • Can Artificial Intelligence Achieve Consciousness?
  • What is Artificial Intelligence (AI)?
  • How does Artificial Intelligence Work?
  • Will AI Take Over the World?
  • What Is OpenAI?
  • What is Artifical General Intelligence?

ELECTRIC VEHICLE

  • What is Electric Vehicle in Simple Words?
  • How do Electric Cars Work?
  • What is the Advantage and Disadvantage of Electric Cars?
  • Is Electric Car the Future?

RESEARCH

  • Robotics Market Research & Report
  • Everything you need to know about IoT
  • What Is Wearable Technology?
  • What is FANUC Robotics?
  • What is Anthropic AI?
Technology NewsTechnology News
Follow US
About Us   -  Cookie Policy   -   Contact

© 2025 NEWSLINKER. Powered by LK SOFTWARE
Welcome Back!

Sign in to your account

Register Lost your password?