frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Droidrun – LLM Agent for Android

1•nodueck•9h ago
Hi HN,

I'm Nikolai, software engineer and co-founder at DroidRun. We built DroidRun, an LLM-based agent that leverages the Android Accessibility Tree for precise control and understanding of UI elements. It works on real phones and emulators, and it's open source.

How it started:

Our co-founder Niels Schmidt (you’ll see him in the demos) coded a prototype and shared a quick video. It went viral, about 50k views on X in under 2 hours. That moment pushed us to go all-in on DroidRun and soon after, we open-sourced it.

How it works:

Most agents rely on screenshots alone for context. We do that plus feed the Accessibility Tree into the LLM. That gives structural, hierarchical, and spatial metadata about UI elements.

Here’s an example:

Screenshot of a real UI: https://imgur.com/a/ePRLpyv

And a matching accessibility JSON snippet:

  {
    "index": 3,
    "resourceId": "com.android.settings:id\\/search_action_bar",
    "className": "LinearLayout",
    "text": "search_action_bar",
    "bounds": "42, 149, 1038, 338",
    "children": [
      {
        "index": 4,
        "resourceId": "com.android.settings:id\\/search_bar_title",
        "className": "TextView",
        "text": "In Einstellungen suchen",
        "bounds": "189, 205, 768, 282",
        "children": []
      }
    ]
  }
We also annotate UI regions in screenshots with numbers, then match them in the tree. This structure gives the agent a deep understanding of what’s on screen, even across different device types like tablets.

This allows for better generalization across devices and screen sizes. Agents can act with greater confidence and fewer hallucinations.

Current Status:

- Ranked #1 on AndroidWorld until recently (it became highly competitive)

- Supports real devices + Emulators

- Strong performance on simple and complex UI tasks

- Gemini 2.5 Pro works best so far, but we’re iterating fast

What's next:

We’re working on a cloud platform where you can run prompts on Android devices without setup. Think of LLM controlling a phone in the cloud, ready to test your automations.

Looking for:

- Feedback from HN

- Collaborators who love Android, LLMs, agents

- OSS contributors

Channel 4 makes TV history with Britain's first AI presenter

https://www.channel4.com/press/news/channel-4-makes-tv-history-britains-first-ai-presenter
1•ChrisArchitect•21s ago•0 comments

AI News Anchor Debuts on U.K.'S Channel 4 in Stunt Proving Dangers of AI

https://variety.com/2025/tv/news/ai-news-anchor-channel-4-1236557295/
3•bookofjoe•4m ago•0 comments

The Unreasonable Effectiveness of Fiber

https://www.empirical.health/blog/dietary-fiber-reduces-all-cause-morality/
2•brandonb•4m ago•1 comments

Why Millennials and Gen Z Are Going Gray Early, According to Experts

https://www.newsweek.com/millennials-gen-z-gray-hair-experts-young-mineral-deficiency-2011928
5•austinallegro•5m ago•0 comments

No One Knows What a Moon Is

https://www.theatlantic.com/science/2025/10/quasi-moon-definition/684710/
1•fortran77•9m ago•1 comments

How to scale AI without using nuclear reactors (Adaptive attention)

https://medium.com/@hyborian_/sparse-adaptive-attention-moe-how-i-solved-openais-650b-problem-wit...
1•unconsciousllm•9m ago•0 comments

Aspire – Orchestrate front ends, APIs, containers, and databases effortlessly

https://aspire.dev/
1•vyrotek•10m ago•0 comments

Beyond Accuracy: A 5-Step Framework for Meaningful AI Evaluation

https://oblsk.com/blog/framework-for-meaningful-ai-evaluation/
1•munroe•10m ago•0 comments

Neo the Home Robot

https://www.1x.tech/
1•strzalek•11m ago•1 comments

How to turn off Meta AI on Facebook – What you can and can't control

https://proton.me/blog/turn-off-meta-ai-facebook
4•jethronethro•11m ago•0 comments

Show HN: Globe of History – Interactive 3D Map of 6k Years of Human Events

https://www.globeofhistory.com/
2•yamsasson•11m ago•0 comments

Free File Hosting and Sharing

https://iofiles.adverx.site/
1•anonyxbiz•13m ago•0 comments

If things in America weren't stupid enough, Texas is suing Tylenol maker

https://arstechnica.com/health/2025/10/if-things-in-america-werent-stupid-enough-texas-is-suing-t...
2•unsnap_biceps•14m ago•1 comments

Colors and Numbers

https://mail.cyberneticforests.com/untitled-2/
1•smartmic•14m ago•0 comments

Everyone's a Free-Speech Hypocrite

https://www.thefire.org/news/everyones-free-speech-hypocrite
1•everybodyknows•14m ago•0 comments

Google Beam: Future of Communication

https://beam.google/
2•wanderer2323•16m ago•0 comments

U.S. energy supply chains are unlikely to meet anticipated demand

https://hub.jhu.edu/2025/10/09/us-energy-supply-falling-short/
3•geox•16m ago•0 comments

Minecraftonia a voxel engine built with C# 13/.NET 9 and Avalonia

https://github.com/wieslawsoltes/Minecraftonia
1•wiso•17m ago•0 comments

Sicilian Arabic

https://en.wikipedia.org/wiki/Siculo-Arabic
2•nothrowaways•17m ago•0 comments

Ask HN: What's one small habit you started that surprisingly changed your life?

2•jimsojim•17m ago•0 comments

Godo: Fast parallel sandboxes for any Git project

https://github.com/cortesi/godo
2•sea-gold•20m ago•1 comments

Writing an LLM from scratch, part 24 – the transcript hack

https://www.gilesthomas.com/2025/10/llm-from-scratch-24-the-transcript-hack
1•gpjt•20m ago•0 comments

Show HN: I made simple Email Marketing app with base features

https://app.getparlo.io
1•ivona52•20m ago•0 comments

A Strange brew: the case of the man behind a Scottish tea fraud

https://www.theguardian.com/uk-news/2025/oct/28/scottish-grown-tea-tam-o-braan
1•neaden•22m ago•0 comments

Leaked Documents Show OpenAI Has a Clear Definition of 'AGI'

https://gizmodo.com/leaked-documents-show-openai-has-a-very-clear-definition-of-agi-2000543339
1•deegles•25m ago•0 comments

Avoiding the Trailing Slash Tax on GitHub Pages and Astro

https://justoffbyone.com/posts/trailing-slash-tax/
2•cancan•25m ago•2 comments

Inside Amazon's engineering culture: Lessons from their senior principals

https://olshansky.substack.com/p/inside-amazons-engineering-culture
8•Olshansky•25m ago•1 comments

Amazon Nova Multimodal Embeddings

https://aws.amazon.com/blogs/aws/amazon-nova-multimodal-embeddings-now-available-in-amazon-bedrock/
1•gslin•26m ago•0 comments

Data Centers Are Getting Big

https://www.distilled.earth/p/these-data-centers-are-getting-really
2•smartmic•27m ago•0 comments

OpenAI completed its for-profit restructuring and new deal with Microsoft

https://www.theverge.com/news/807875/openai-microsoft-for-profit-agi
1•Palmik•30m ago•1 comments