news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

https://machinelearning.apple.com/research/ferret-ui

18•CharlesW•4d ago

Comments

brudgers•2d ago

direct to paper, https://arxiv.org/pdf/2509.26539

bensyverson•1h ago

I recently experimented with Apple's Foundation Models framework, and I came away impressed at the speed and accuracy of the LLM. You can't ask it to build you a web app, but it can reliably translate a written instruction into tool use within your native app. I think there's a lot of merit to Apple's approach, using specialist tiny models like Ferret-UI Lite, though I don't think we'll see the full fruits of their labor for another year or two.

But it's a vision that I can get behind, where basic tasks like transcription, computer use, in-app tool, image understanding, etc, are local, secure and private.

w10-1•6m ago

I'm disappointed that they are taking the long way around, with screen shots and visual recognition.

Apple GUI's have underlying accessibility annotations that if surfaced would make UI manipulation easy for LLM's.

"Back in the day" - 1990's - Apple had Virtual User, basically a lisp derivative that reported UI state as S-expressions (like a web DOM) and allowed scripts to manipulate settings and perform UI actions.

With such a curated DOM/model and selective UI inputs, they could manage privacy and safety, opening up LLM control to users who would otherwise never trust a machine.

I hope they're working on that approach and training models for it. It's one way they could distinguish the Apple platform as being more controllable, with safety and permissions built into the subsystems instead of giving the LLM full control over UI input.

AirSnitch: Demystifying and breaking client isolation in Wi-Fi networks [pdf]

https://www.ndss-symposium.org/wp-content/uploads/2026-f1282-paper.pdf

161•DamnInteresting•2h ago•70 comments

Open Source Endowment – new funding source for open source maintainers

https://endowment.dev/

67•kvinogradov•1h ago•29 comments

Nano Banana 2: Google's latest AI image generation model

https://blog.google/innovation-and-ai/technology/ai/nano-banana-2/

234•davidbarker•1h ago•228 comments

Palm OS User Interface Guidelines [pdf, 2003]

https://cs.uml.edu/~fredm/courses/91.308-spr05/files/palmdocs/uiguidelines.pdf

21•spiffytech•56m ago•3 comments

Google Street View in 2026

https://tech.marksblogg.com/google-street-view-coverage.html

11•marklit•19m ago•0 comments

Show HN: Terminal Phone – E2EE Walkie Talkie from the Command Line

https://gitlab.com/here_forawhile/terminalphone

229•smalltorch•7h ago•53 comments

Google API keys weren't secrets, but then Gemini changed the rules

https://trufflesecurity.com/blog/google-api-keys-werent-secrets-but-then-gemini-changed-the-rules

1058•hiisthisthingon•22h ago•256 comments

Bild AI (YC W25) Is Hiring Interns to Make Housing Affordable

https://www.workatastartup.com/jobs/80596

1•rooppal•57m ago

Anthropic ditches its core safety promise

https://www.cnn.com/2026/02/25/tech/anthropic-safety-policy-change

488•motbus3•5h ago•278 comments

BuildKit: Docker's Hidden Gem That Can Build Almost Anything

https://tuananh.net/2026/02/25/buildkit-docker-hidden-gem/

52•jasonpeacock•3h ago•21 comments

just-bash: Bash for Agents

https://github.com/vercel-labs/just-bash

58•tosh•4h ago•36 comments

Will vibe coding end like the maker movement?

https://read.technically.dev/p/vibe-coding-and-the-maker-movement

38•itunpredictable•1h ago•33 comments

Better to Skip a Year for Hardware Upgrades?

https://boilingsteam.com/poll-better-to-skip-a-year-for-pc-upgrades/

16•ekianjo•3d ago•18 comments

Show HN: Hacker Smacker – spot great (and terrible) HN commenters at a glance

https://hackersmacker.org

4•conesus•1d ago•0 comments

Tell HN: YC companies scrape GitHub activity, send spam emails to users

430•miki123211•8h ago•143 comments

Jimi Hendrix was a systems engineer

https://spectrum.ieee.org/jimi-hendrix-systems-engineer

610•tintinnabula•21h ago•200 comments

Banned in California

https://www.bannedincalifornia.org/

401•pie_flavor•18h ago•469 comments

Those who can, teach history

https://www.historytoday.com/archive/making-history/those-who-can-teach-history

30•hhs•4d ago•28 comments

How will OpenAI compete?

https://www.ben-evans.com/benedictevans/2026/2/19/how-will-openai-compete-nkg2x

399•iamskeole•19h ago•556 comments

The Pentagon Feuding with an AI Company Is a Bad Sign

https://foreignpolicy.com/2026/02/25/anthropic-pentagon-feud-ai/

32•Jimmc414•1h ago•11 comments

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

https://machinelearning.apple.com/research/ferret-ui

18•CharlesW•4d ago•3 comments

First Website (1992)

https://info.cern.ch

288•shrikaranhanda•18h ago•82 comments

A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight

https://arxiv.org/abs/2602.06811

52•Terretta•4d ago•15 comments

Windows 11 Notepad to support Markdown

https://blogs.windows.com/windows-insider/2026/01/21/notepad-and-paint-updates-begin-rolling-out-...

334•andreynering•1d ago•501 comments

Story of XZ Backdoor [video]

https://www.youtube.com/watch?v=aoag03mSuXQ

72•Ulf950•3h ago•28 comments

Making MCP cheaper via CLI

https://kanyilmaz.me/2026/02/23/cli-vs-mcp.html

291•thellimist•21h ago•112 comments

Some silly Z3 scripts I wrote

https://www.hillelwayne.com/post/z3-examples/

28•azhenley•3d ago•6 comments

Artist who “paints” portraits on glass by hitting it with a hammer

https://simonbergerart.com

229•cs702•4d ago•101 comments

Time Is Different

https://shkspr.mobi/blog/2026/02/this-time-is-different/

29•speckx•4h ago•25 comments

Bus stop balancing is fast, cheap, and effective

https://worksinprogress.co/issue/the-united-states-needs-fewer-bus-stops/

406•surprisetalk•1d ago•588 comments