frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Verdic Guard – Deterministic guardrails to prevent LLM hallucinations

https://www.verdic.dev
1•kundan_s__r•1m ago•0 comments

Building iOS UI with Coding Agents Is Slow

https://qckfx.com/blog/building-ios-ui-with-coding-agents-is-slow-heres-how-to-fix-it
1•chw9e•1m ago•0 comments

$4,500 Conductive Suit Could Make Power-Line Work Safer

https://spectrum.ieee.org/transmission-line-safety-suit
1•nradov•1m ago•0 comments

Scott Adams Has Died

https://www.washingtonpost.com/obituaries/2026/01/13/scott-adams-dead-dilbert/
1•jvanderbot•1m ago•1 comments

"Leave Yourself an Out"

https://seths.blog/2026/01/leave-yourself-an-out/
1•7777777phil•3m ago•0 comments

Cold weather and data centres drive up US greenhouse gas emissions

https://www.bbc.com/news/articles/cj9r3832j47o
1•paran0rmal•4m ago•0 comments

Impact of Bluetooth headset usage on thyroid nodules

https://pmc.ncbi.nlm.nih.gov/articles/PMC11192738/
1•typeofhuman•4m ago•1 comments

Show HN: A workflow for publishing AI-assisted content without manual rewrites

https://plagiarismremover.ai/
1•PlagiarismRem•4m ago•0 comments

Scott Adams Dead at 68

https://en.wikipedia.org/wiki/Scott_Adams
1•kolektiv•4m ago•1 comments

Show HN: Aristotle, an AI-powered e-reader that helps you read deeper

https://www.aristotlereader.com
1•smahendrakar•4m ago•0 comments

Reducing Hip Strain with High-Density Cushioning

https://dreamhomestore.co.uk/collections/gaming-chairs
1•lewisrichson•5m ago•1 comments

Gh Account Permabanned – Help?

1•nicomeemes•5m ago•0 comments

Nvidia Rubin's Network Doubles Bandwidth

https://spectrum.ieee.org/nvidia-rubin-networking
1•rbanffy•5m ago•0 comments

Show HN: Kalshi Market Intelligence and AI Signal Analyst

https://apify.com/brazen_vanguard/kalshi-market-intelligence-signal-analyst
1•founder_mode•6m ago•0 comments

Show HN: Verdic Guard – Deterministic guardrails to prevent LLM hallucinations

2•kundan_s__r•6m ago•0 comments

Show HN: Hivinq – Copilot for customer support teams

https://www.hivinq.com/
1•vishalds•6m ago•0 comments

Show HN: Chains – Word association puzzles that form a loop

https://puzzles.madebynathan.com/chains
1•nathan_f77•6m ago•0 comments

EdiNation is the new EDI Notepad

https://edination.edifabric.com/
1•donzog•6m ago•0 comments

Show HN: Zero to One but it's a game

https://zerooneterminal.com
1•stellarcat•8m ago•0 comments

Headroom – context optimization layer for tool-using agents

https://github.com/chopratejas/headroom
1•chopratejas•9m ago•2 comments

The Cognitive Edge: Why Silicon Valley Can't Stop Taking White Powder

https://twitter.com/unicodeveloper/status/2011102214576799935
1•unicodeveloper•9m ago•0 comments

Temperature Effects in Watches

https://www.vintagewatchstraps.com/temperatureeffects.php
1•pillars•11m ago•0 comments

VLLM Large Scale Serving: DeepSeek 2.2k Tok/S/H200 with Wide-EP

https://blog.vllm.ai/2025/12/17/large-scale-serving.html
1•robertnishihara•11m ago•0 comments

Show HN: I built Gridfy – live website widgets from Airtable, Notion and Sheets

https://gridfy.io
1•jumagrande•12m ago•0 comments

Show HN: I Will Do Whatever to Get Primeagen to My Hackathon Stream

https://vibe.devpost.com/
1•abdibrokhim•12m ago•0 comments

Show HN: Term.stream – Stream your terminal to any device via URL

https://term.stream
1•zero_dev•15m ago•1 comments

Show HN: RSS Reader using the browser's local storage

https://github.com/travisred/rss-local-storage
1•travisr•15m ago•0 comments

Show HN: WordsUnite – Synchronized Crowd Chants at Scale

https://wordsunite.us/
1•wordsunite•15m ago•1 comments

Tools for AI Collaboration Are a Different Design Problem

https://michaelhegner.com/blog/tools-for-ai-collaboration-are-a-different-design-problem
1•shellDev•16m ago•0 comments

Every GitHub Object Has Two IDs

https://www.greptile.com/blog/github-ids
1•dakshgupta•18m ago•0 comments
Open in hackernews

Show HN: AI Mime – Record and parameterize workflows for Computer Use agents

https://github.com/prakhar1114/ai_mime
1•prakharjain•1h ago
Hi HN,

I’ve been experimenting with the latest "computer use" models (like Gemini 3 flash, qwen 3 vl plus, browser use), and while they are impressive, I hit a wall with reliability in production use cases.

The main issue I found is context. When we give agents simple natural language prompts (e.g., "download the invoice"), they often lack the nuance to handle edge cases or specific UI quirks. They try to be "creative" when they should be deterministic.

I built AI Mime to solve this by shifting from "prompting" to "demonstrating." It’s an open-source macOS tool that lets you record a workflow, parameterize it, and replay it using computer-use agents.

How it works:

Record: It captures native macOS events (mouse, keyboard, window states) to create a ground-truth recording of the task.

Refine (The interesting part): It uses an LLM to parse that raw recording into parameterized instructions. Instead of a static macro, you get manageable subtasks where you can define inputs/variables. This constrains the agent to a specific "happy path" while still allowing it to handle dynamic elements.

Replay: The agent executes the subtasks using the computer-use interface, but with significantly higher success rates because it has "seen" the exact steps required.

The goal is to make these agents observable and repeatable enough for actual RPA work.

The repo is here: https://github.com/prakhar1114/ai_mime

I’d love to hear your thoughts on the approach or how you are currently handling state/reliability with computer-use models.