frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: We beat Google DeepMind but got killed by Zhipu AI

https://github.com/minitap-ai/mobile-use
4•orangepomodoro•1h ago
Two months ago, my friends in AI and I asked: What if an AI could actually use a phone like a human?

So we built an agentic framework that taps, swipes, types… and somehow it’s outperforming giant labs like Google DeepMind and Microsoft Research on the AndroidWorld benchmark.

We were thrilled about our results until a massive lab (Zhipu AI) released its results last week to take the top spot.

They’re slightly ahead, but they have an army of 50+ phds and I don't see how a team like us can compete with them, that does not seem realistic... except that they're closed source.

And we decided to open-source everything. That way, even as a small team, we can make our work count.

We’re currently building our own custom mobile RL gyms, training environments made to push this agent further and get closer to 100% on the benchmark.

What do you think can make a small team like us compete against such giants?

Repo’s here if you want to check it out or contribute: https://github.com/minitap-ai/mobile-use

Our discord: https://discord.gg/6nSqmQ9pQs

The Pleasure of Patterns in Art

https://thereader.mitpress.mit.edu/why-repetition-in-art-pleases-the-brain/
3•billybuckwheat•1m ago•0 comments

YouTube Shorts are almost certainly being AI upscaled

https://old.reddit.com/r/youtube/comments/1lllnse/youtube_shorts_are_almost_certainly_being_ai/
1•Erikun•1m ago•0 comments

NASA AI model can predict when a solar storm may strike

https://www.technologyreview.com/2025/08/20/1122163/nasa-ibm-ai-predict-solar-storm/
1•pseudosavant•2m ago•0 comments

AI Tools Now Use Radar to Wiretap Your Phone from 10 Feet Away

https://www.offthegridnews.com/privacy/ai-tools-now-use-radar-to-wiretap-your-phone-from-10-feet-away/
1•warrenm•2m ago•1 comments

Biggest Data Breaches of All Time [Updated 2025]

https://www.upguard.com/blog/biggest-data-breaches
1•warrenm•3m ago•0 comments

Upgrade Context MCP and Agent for K8s and OSS Projects (Istio, Kafka)

https://www.chkk.io/blog/chkk-upgrade-context-server-upgrade-agent-for-coding-assistants
1•akhayam•3m ago•1 comments

A Tale of Two Jurists in the Trump Era

https://www.newyorker.com/news/the-lede/a-tale-of-two-jurists-in-the-trump-era
2•mitchbob•4m ago•1 comments

Update: We're Building an Open-Sourced, Privacy-Focused, Free PDF WebApp:)

2•PseudoComputer•8m ago•0 comments

Study Reveals Vitamin D May Slow Biological Aging

https://scitechdaily.com/groundbreaking-study-reveals-that-vitamin-d-may-slow-biological-aging/
1•geox•8m ago•1 comments

The Show Horse and the Work Horse

https://granolashotgun.wordpress.com/2019/07/22/the-show-horse-and-the-work-horse/
1•trevin•9m ago•0 comments

Show HN: Superhuman for LinkedIn

https://usenarrow.com
1•yashgupta417•12m ago•0 comments

AI search ranks content by neural models, not backlinks or traffic metrics

https://generative-engine.org/blog
1•flixing•14m ago•0 comments

Open Source Shipwreck Osint

https://github.com/Alfredredbird/Open-Wrecks
1•alfredredbird•14m ago•1 comments

Show HN: I Found Publicly Accessible Databases Using the Tool, Peekleaks

https://www.peekleaks.com/
1•hharana7889•15m ago•0 comments

Introduction to Bluesky's AT Protocol

https://mackuba.eu/2025/08/20/introduction-to-atproto/
2•psionides•16m ago•0 comments

Qclojure: Functional quantum computer programming library for Clojure

https://github.com/lsolbach/qclojure
2•simonpure•17m ago•0 comments

OSS under attack: four lessons in how trust gets exploited"

https://www.open-source-ward.com/suppl/
1•avervaet•18m ago•0 comments

Addiction alloys: the cross-promotion of internet compulsions

https://internettalk.xyz/blog/addiction-alloys/
1•pityJuke•18m ago•1 comments

Tech, chip stock sell-off continues as AI bubble fears mount

https://finance.yahoo.com/news/tech-chip-stock-sell-off-continues-as-ai-bubble-fears-mount-184837135.html
11•pera•20m ago•0 comments

Nuclear fusion gets a boost from a controversial debunked experiment

https://www.newscientist.com/article/2493372-nuclear-fusion-gets-a-boost-from-a-controversial-debunked-experiment/
2•voxadam•22m ago•2 comments

Skillshare Names Paul Slavin as Chief Executive Officer

https://www.businesswire.com/news/home/20250811766802/en/Skillshare-Names-Paul-Slavin-as-Chief-Executive-Officer
1•petecooper•23m ago•0 comments

Few Americans Read for Pleasure

https://www.washingtonpost.com/technology/2025/08/20/american-reading-declines-attention-spans/
4•perihelions•25m ago•1 comments

Google's "Linux development environment" for Android: Is this the end of termux?

https://old.reddit.com/r/termux/comments/1mugsih/is_this_the_end_of_termux/
2•sipofwater•26m ago•3 comments

Revisionist Glaciology: Better Iceberg Illustrations Show Undersea Surprises

https://99percentinvisible.org/article/revisionist-glaciology-fixing-iceberg-illustrations-to-better-reflect-reality/
1•huftis•26m ago•0 comments

Communicate Early and Often

https://dontbreakprod.com/posts/communicate-early-and-often
2•dorkrawk•28m ago•0 comments

FBI: Russian spies exploit 7yo Cisco bug to slurp critical infrastructure config

https://www.theregister.com/2025/08/20/russian_fsb_cyberspies_exploiting_cisco_bug/
4•rntn•28m ago•1 comments

Grounding with Google Search

https://ai.google.dev/gemini-api/docs/google-search
2•jonbaer•29m ago•0 comments

Will there be no more non-reasoning models?

https://community.openai.com/t/will-there-be-no-more-non-reasoning-models/1352676
1•softwaredoug•29m ago•0 comments

Don't Worry Village: The young S. Koreans who left Seoul, seeking community

https://www.aljazeera.com/features/2025/8/19/dont-worry-village-the-young-s-koreans-who-left-seoul-seeking-community
2•Qem•31m ago•0 comments

Compute Where It Counts: a trainable LLM sparsity enabling 4x CPU speed

https://crystalai.org/blog/2025-08-18-compute-where-it-counts
2•cyris•32m ago•1 comments