frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Running a vision model on every screenshot on-device

https://github.com/ayushh0110/ScreenMind/blob/main/README.md
16•alexkarpathy•1h ago
hi author here, Screenmind is privacy first Microsoft recall alternative . It runs on gemma 4 which is one of the fewer models supporting vision audio and reasoning all 3, so your data never leaves you machine.

With screenmind you can keep a track of your timeline , how much time you spent on what..search any screenshot with any text on it.. and the coolest thing, you can chat with your screen history, like what did alex texted me on discord or did i received any mail from Microsoft, if it was on your screen , you can prompt it in the cha. and also you can make automations on top of it, like send me my whole day report on slack(it has integrations )..you can also write automation either though plain English for not so coders or use the python for devs who want to deep dive, and you can save voice memos(with a screenshot) with just a hotkey, and get you meeting transcribed and summarised(auto detects meeting)

the hardest part which i faced was keep running screenmind as a background service it would not have been not hard if chat feature didn't existed, as running local model requires compute ..and keep analyzing screenshots continuously will keep all the resouces hogged up for that i came up with a perceptual has cache .. the three tier cache system reduces inference upto 40% for an average user(which is me)..and to reduce the inference time more i came up with three modes..fast balanced and accurate..where the tradeoff is between time and accuracy

for now i use it daily on my 4gb gtx 1650 with fast mode, works pretty fine also it would be much faster on high end machine , it also has a mcp server so you can just ask claude desktop/cursor about the bug you saw in morning..

supports windows/mac/Linux

being upfront about rough edges , it is not extensively tested on mac and installation has some friction , for which i m working on one click installer thing

(reposting- i put up an earlier version a few days back, comments got flagged cuz of new account so couldn't reply to any )

repo:github.com/ayushh0110/ScreenMind

curious about anyone have idea for how to approach multi monitor support

Building a voice dictation pipeline tuned for devs

https://freestylevoice.com/blog/freestyle-transcribe
1•matteo8p•1m ago•0 comments

Europe's resistance to AC is driving it insane

https://www.noahpinion.blog/p/europes-resistance-to-ac-is-driving
1•mooreds•1m ago•0 comments

Sorry, but There's Nothing Stable About Bitcoins or Stablecoins

https://www.forbes.com/sites/johntamny/2026/06/28/sorry-but-theres-nothing-stable-about-bitcoins-...
1•RickJWagner•1m ago•0 comments

The war against 'woke' could end US science as we know it

https://www.theverge.com/science/957630/omb-killing-science-budget-grants-research
1•rufo•1m ago•0 comments

I built an anime-style UI for watching AI coding agents review each other's code

https://old.reddit.com/r/codex/comments/1uit86f/i_built_an_animestyle_ui_for_watching_ai_coding/
2•syumei•2m ago•0 comments

Vibe Coding to Agentic Engineering: A Three-Phase Workflow with Claude Code

https://www.apimatic.io/blog/agentic-engineering-claude-code
1•m3h•2m ago•0 comments

The Storytelling of Fictional User Interfaces (FUI) in Film

https://manonstripes.substack.com/p/the-hidden-storytelling-of-fictional
1•doctorwhat•2m ago•0 comments

Ask HN: How do you guys find your competitors?

2•krishavRajSingh•2m ago•0 comments

Show HN: Llama Legends

https://llamalegends.com/
1•TheLuigiplayer•4m ago•0 comments

"Energy Constraints and Tradeoffs" by Martin Picard [video]

https://www.youtube.com/watch?v=uT3PwI_pX5E
1•surprisetalk•5m ago•0 comments

The Troubled Energy Transition (2025)

https://www.foreignaffairs.com/united-states/troubled-energy-transition-yergin-orszag-arya
1•simonebrunozzi•7m ago•0 comments

Show HN: Free earnings calendar for US stocks

https://zenvesto.com/earnings-calendar
1•zenvesto•7m ago•0 comments

CachyOS June 2026 Release

https://cachyos.org/blog/2606-june-release/
1•simonpure•8m ago•0 comments

The nervous system reset trend: what interoception is

https://medium.com/@6thMind/the-nervous-system-reset-trend-what-interoception-is-and-whether-you-...
1•smanuel•8m ago•0 comments

The No-Human Future

https://aeon.co/essays/what-is-nick-lands-philosophy-of-accelerationism-really
1•speckx•8m ago•0 comments

Show HN: AI Music Generator Free and No Signup

https://music0.org/
1•pekingzcc•9m ago•0 comments

Renameforce

https://renameforce.com
1•cosiiine•10m ago•0 comments

Microsoft's new DocumentDB rethinks NoSQL on PostgreSQL

https://www.infoworld.com/article/3812630/microsofts-new-documentdb-rethinks-nosql-on-postgresql....
2•amai•11m ago•1 comments

CoreWeave ARIA: AI Research and Iteration Agent

https://wandb.ai/wandb/aria/reports/Introducing-CoreWeave-ARIA-AI-Research-and-Iteration-Agent--V...
1•OnlineInference•11m ago•0 comments

Show HN: Wealtii – Digital Asset index funds with on-chain 1:1 backed vaults

https://wealtii.com/funds
3•zayd7861•13m ago•0 comments

macOS California Camino

https://basicappleguy.com/basicappleblog/macos-california-adventure
1•herbertl•13m ago•0 comments

PostgreSQL Jsonb vs. MongoDB BSON: The Real Architectural Tradeoffs

https://visualeaf.com/blog/postgresql-jsonb-vs-mongodb-bson-architectural-tradeoffs/
1•amai•15m ago•0 comments

Show HN: I built Exfault, agentic mobile app pentesting tool

https://www.exfault.com/
4•shubh_sidhu•18m ago•0 comments

Show HN:Swarm intelligence without degradation using two Qwen models

1•kofdai•18m ago•0 comments

Creating a Personalised Bin Calendar

https://alexwlchan.net/2026/bin-calendar/
1•surprisetalk•20m ago•0 comments

Toto: From Toilets to E-Chucks [video]

https://www.youtube.com/watch?v=CIB49e_r1OI
1•lyall•20m ago•0 comments

Show HN: Interactive Calculators Hub

3•StizzurpXDD•22m ago•0 comments

SteamOS now offered with new gaming prebuilt PCs

https://videocardz.com/newz/steamos-now-offered-with-new-gaming-prebuilt-pcs
3•LorenDB•24m ago•0 comments

Browser CLI for Agents

https://github.com/detrin/brow
3•kekqqq•25m ago•0 comments

A playbook to rank #1 of the day on ProductHunt

https://m-ric.com/blog/how-to-get-number-1-on-producthunt/
1•aubanel•26m ago•0 comments