frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Molmo 2: State-of-the-art video understanding, pointing, and tracking multimodal

https://allenai.org/blog/molmo2
1•maxloh•7h ago

Comments

maxloh•7h ago
From Allen AI's Discord:

*Introducing Molmo 2* : State-of-the-art video understanding, pointing, and tracking

Last year, Molmo helped push image understanding forward with pointing—grounded answers you can verify. Now, *Molmo 2 *brings those capabilities to video—so the model doesn’t just answer questions, it can show you where & when something is happening.

On major industry benchmarks, Molmo 2 *surpasses most open multimodal models* and even *rivals closed peers* like Gemini 3 Pro and Claude Sonnet 4.5.

Molmo 2 returns pixel coordinates + timestamps over videos and coordinates over images, enabling: *◘ Video + image QA ◘ Counting-by-pointing ◘ Dense captioning ◘ Artifact detection ◘ Subtitle-aware analysis …and more!*

Three variants depending on your needs: *Molmo 2 (8B)*: Qwen 3 backbone, best overall performance *Molmo 2 (4B)*: Qwen 3 backbone, fast + efficient *Molmo 2-O (7B)*: Olmo backbone, fully open model flow

Demos: *Counting objects & actions* (“How many times does the ball hit the ground?”)—returns the count plus space–time pointers for each event: https://www.youtube.com/watch?v=fvYfPTTTZ_w *Ask-it-anything long-video QA* (“Why does the player change strategy here?”)—points to the moments supporting the answer: https://www.youtube.com/watch?v=Ej3Hb3kRiac *Object tracking* (“Follow the red race car.”)—tracks it across frames with coordinates over time: https://www.youtube.com/watch?v=uot140v_h08

We’ve also *significantly upgraded the Ai2 Playground* You can now upload a video or multiple images to try summarization, tracking, and counting—while seeing exactly where the model is looking.

Try it and learn more: ▶ Playground: https://playground.allenai.org/ ⬇ Models: https://huggingface.co/collections/allenai/molmo2 Blog: https://allenai.org/blog/molmo2 Report: https://allenai.org/papers/molmo2 API coming soon

Flow – A Programmer's Text Editor

https://flow-control.dev/
1•css_apologist•56s ago•0 comments

Show HN: GPT Image 1.5 – An AI image editor with conversational editing

https://gptimage15.app
1•jackson_mile•2m ago•0 comments

California threatens to ban Tesla sales for 30 days

https://www.sfchronicle.com/california/article/tesla-autopilot-claims-possible-sales-ban-21246659...
1•dangle1•4m ago•0 comments

Get Food with Your Colleagues

https://lcmchris.github.io/posts/get_food_with_your_colleagues
1•lcmchris•5m ago•0 comments

Canada launches its own quantum research program

https://betakit.com/canada-launches-it-own-quantum-research-program-to-rival-darpa-initiative/
1•gangtao•7m ago•0 comments

Ask HN: What happens when a new user's submission disappears?

1•ursAxZA•8m ago•0 comments

Show HN: Learn Japanese contextually while browsing

https://lingoku.ai/learn-japanese
2•englishcat•11m ago•0 comments

New MI6 chief: Tech bosses are becoming as powerful as nations

https://www.thetimes.com/uk/defence/article/new-mi6-chief-blaise-metrew-russia-speech-bqlvlx5hq
3•voxadam•12m ago•1 comments

Detecting hidden market regimes beyond correlation (empirical results)

https://github.com/johnoliveiradev/Multiscale-structural-regime-benchmark/tree/main/results/BTC%2...
1•johnoliveiradev•14m ago•1 comments

A universal law could explain how large trades change stock prices

https://phys.org/news/2025-12-universal-law-large-stock-prices.html
1•pseudolus•16m ago•0 comments

An Interview with a YouTube Writer Behind 500M+ Views

https://www.humaninvariant.com/blog/youtube-interview
2•gwintrob•20m ago•0 comments

LLM Pricing Calculator

https://app.hatrio.ai/free/llm-pricing-calculator
1•DinakarS•21m ago•0 comments

Time Team Map of Episodes (2021)

https://deparkes.co.uk/2021/04/16/time-team-map-of-episodes/
2•zeristor•23m ago•0 comments

The Jagged AI Frontier Is a Data Frontier

https://huggingface.co/spaces/lvwerra/jagged-data-frontier
1•in-silico•23m ago•0 comments

X updates terms, countersues to lay claim to the 'Twitter' trademark

https://techcrunch.com/2025/12/16/x-updates-its-terms-files-countersuit-to-lay-claim-to-the-twitt...
4•SanjayMehta•24m ago•1 comments

All printable snow-based triboelectric nanogenerator: Snow-TENG

https://www.sciencedirect.com/science/article/abs/pii/S2211285519302204
1•westurner•25m ago•0 comments

Synthetic key enzyme enables the conversion of CO2 into formic acid

https://phys.org/news/2025-12-synthetic-key-enzyme-enables-conversion.html
2•westurner•25m ago•0 comments

Hot for its bot, McKinsey may cut jobs

https://www.theregister.com/2025/12/16/mckinsey_may_cut_staff/
2•OptionOfT•27m ago•2 comments

The Longest Suicide Note in American History

https://www.theatlantic.com/ideas/2025/12/national-security-strategy-democracy/685270/
5•petethomas•33m ago•0 comments

Prototypes Are the New PRDs

https://www.figma.com/blog/prototypes-are-the-new-prds/
2•gmays•33m ago•0 comments

Windows 11 will ask consent before sharing personal files with AI after outrage

https://www.windowslatest.com/2025/12/17/microsoft-confirms-windows-11-will-ask-for-consent-befor...
12•jinxmeta•34m ago•1 comments

Racks of AI chips are too damn heavy

https://www.theverge.com/ai-artificial-intelligence/844966/heavy-ai-data-center-buildout
2•jnord•36m ago•0 comments

Understanding Email Encryption

https://www.fastmail.com/blog/email-encryption/
3•nmjenkins•36m ago•0 comments

Commodore 64 Ultimate Review

https://www.ign.com/articles/commodore-64-ultimate-review
3•amichail•37m ago•1 comments

Shmøergh Moduleur: analog DIY-friendly modular synth

https://www.shmoergh.com/moduleur/
1•Philpax•39m ago•0 comments

Most Parked Domains Now Serving Malicious Content

https://krebsonsecurity.com/2025/12/most-parked-domains-now-serving-malicious-content/
2•jnord•41m ago•1 comments

WikiFlix shows us what Netflix would have been like 100 years ago

https://wikiflix.toolforge.org/#/
2•jnord•42m ago•0 comments

An open letter to Mozilla's new CEO: Firefox doesn't need AI

https://old.reddit.com/r/firefox/comments/1poe7kb/an_open_letter_to_mozillas_new_ceo_firefox_doesnt/
6•bpierre•43m ago•1 comments

The brawl over the Colorado River is about more than water

https://www.politico.com/news/2025/12/16/colorado-river-water-users-association-conference-00676796
1•bikenaga•47m ago•0 comments

HP Wolf Security Threat Insights December 2025

https://threatresearch.ext.hp.com/hp-wolf-security-threat-insights-report-december-2025/
1•dexter_it•54m ago•0 comments