frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

https://machinelearning.apple.com/research/ferret-ui
18•CharlesW•4d ago

Comments

brudgers•2d ago
direct to paper, https://arxiv.org/pdf/2509.26539
bensyverson•1h ago
I recently experimented with Apple's Foundation Models framework, and I came away impressed at the speed and accuracy of the LLM. You can't ask it to build you a web app, but it can reliably translate a written instruction into tool use within your native app. I think there's a lot of merit to Apple's approach, using specialist tiny models like Ferret-UI Lite, though I don't think we'll see the full fruits of their labor for another year or two.

But it's a vision that I can get behind, where basic tasks like transcription, computer use, in-app tool, image understanding, etc, are local, secure and private.

w10-1•6m ago
I'm disappointed that they are taking the long way around, with screen shots and visual recognition.

Apple GUI's have underlying accessibility annotations that if surfaced would make UI manipulation easy for LLM's.

"Back in the day" - 1990's - Apple had Virtual User, basically a lisp derivative that reported UI state as S-expressions (like a web DOM) and allowed scripts to manipulate settings and perform UI actions.

With such a curated DOM/model and selective UI inputs, they could manage privacy and safety, opening up LLM control to users who would otherwise never trust a machine.

I hope they're working on that approach and training models for it. It's one way they could distinguish the Apple platform as being more controllable, with safety and permissions built into the subsystems instead of giving the LLM full control over UI input.

AirSnitch: Demystifying and breaking client isolation in Wi-Fi networks [pdf]

https://www.ndss-symposium.org/wp-content/uploads/2026-f1282-paper.pdf
161•DamnInteresting•2h ago•70 comments

Open Source Endowment – new funding source for open source maintainers

https://endowment.dev/
67•kvinogradov•1h ago•29 comments

Nano Banana 2: Google's latest AI image generation model

https://blog.google/innovation-and-ai/technology/ai/nano-banana-2/
234•davidbarker•1h ago•228 comments

Palm OS User Interface Guidelines [pdf, 2003]

https://cs.uml.edu/~fredm/courses/91.308-spr05/files/palmdocs/uiguidelines.pdf
21•spiffytech•56m ago•3 comments

Google Street View in 2026

https://tech.marksblogg.com/google-street-view-coverage.html
11•marklit•19m ago•0 comments

Show HN: Terminal Phone – E2EE Walkie Talkie from the Command Line

https://gitlab.com/here_forawhile/terminalphone
229•smalltorch•7h ago•53 comments

Google API keys weren't secrets, but then Gemini changed the rules

https://trufflesecurity.com/blog/google-api-keys-werent-secrets-but-then-gemini-changed-the-rules
1058•hiisthisthingon•22h ago•256 comments

Bild AI (YC W25) Is Hiring Interns to Make Housing Affordable

https://www.workatastartup.com/jobs/80596
1•rooppal•57m ago

Anthropic ditches its core safety promise

https://www.cnn.com/2026/02/25/tech/anthropic-safety-policy-change
488•motbus3•5h ago•278 comments

BuildKit: Docker's Hidden Gem That Can Build Almost Anything

https://tuananh.net/2026/02/25/buildkit-docker-hidden-gem/
52•jasonpeacock•3h ago•21 comments

just-bash: Bash for Agents

https://github.com/vercel-labs/just-bash
58•tosh•4h ago•36 comments

Will vibe coding end like the maker movement?

https://read.technically.dev/p/vibe-coding-and-the-maker-movement
38•itunpredictable•1h ago•33 comments

Better to Skip a Year for Hardware Upgrades?

https://boilingsteam.com/poll-better-to-skip-a-year-for-pc-upgrades/
16•ekianjo•3d ago•18 comments

Show HN: Hacker Smacker – spot great (and terrible) HN commenters at a glance

https://hackersmacker.org
4•conesus•1d ago•0 comments

Tell HN: YC companies scrape GitHub activity, send spam emails to users

430•miki123211•8h ago•143 comments

Jimi Hendrix was a systems engineer

https://spectrum.ieee.org/jimi-hendrix-systems-engineer
610•tintinnabula•21h ago•200 comments

Banned in California

https://www.bannedincalifornia.org/
401•pie_flavor•18h ago•469 comments

Those who can, teach history

https://www.historytoday.com/archive/making-history/those-who-can-teach-history
30•hhs•4d ago•28 comments

How will OpenAI compete?

https://www.ben-evans.com/benedictevans/2026/2/19/how-will-openai-compete-nkg2x
399•iamskeole•19h ago•556 comments

The Pentagon Feuding with an AI Company Is a Bad Sign

https://foreignpolicy.com/2026/02/25/anthropic-pentagon-feud-ai/
32•Jimmc414•1h ago•11 comments

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

https://machinelearning.apple.com/research/ferret-ui
18•CharlesW•4d ago•3 comments

First Website (1992)

https://info.cern.ch
288•shrikaranhanda•18h ago•82 comments

A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight

https://arxiv.org/abs/2602.06811
52•Terretta•4d ago•15 comments

Windows 11 Notepad to support Markdown

https://blogs.windows.com/windows-insider/2026/01/21/notepad-and-paint-updates-begin-rolling-out-...
334•andreynering•1d ago•501 comments

Story of XZ Backdoor [video]

https://www.youtube.com/watch?v=aoag03mSuXQ
72•Ulf950•3h ago•28 comments

Making MCP cheaper via CLI

https://kanyilmaz.me/2026/02/23/cli-vs-mcp.html
291•thellimist•21h ago•112 comments

Some silly Z3 scripts I wrote

https://www.hillelwayne.com/post/z3-examples/
28•azhenley•3d ago•6 comments

Artist who “paints” portraits on glass by hitting it with a hammer

https://simonbergerart.com
229•cs702•4d ago•101 comments

Time Is Different

https://shkspr.mobi/blog/2026/02/this-time-is-different/
29•speckx•4h ago•25 comments

Bus stop balancing is fast, cheap, and effective

https://worksinprogress.co/issue/the-united-states-needs-fewer-bus-stops/
406•surprisetalk•1d ago•588 comments