frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Open-source dashboard for your domain experts to improve your AI Agents

https://github.com/getevalkit/evalkit
3•mellowcookie•3h ago
Hey HN! We built EvalKit, a library you embed to capture agent actions and a UI where domain experts give feedback, evaluate and improve AI agents.

We experienced, in large agentic systems, prompt-engineering or auto-prompt improvement tool can get accuracy from 0 to 50% but for increasing accuracy to 100% we had to work with domain experts. Example -> In a law ai agent, lawyers are needed because law is complex and lawyers have a deeper context compared to non-lawyers.

Other evaluation tools in the market focus on the experience of the developer and we are focusing on making as easy as possible for your domain experts to improve agentic systems on their own. Each agent action automatically routes a feedback request to the appropriate domain expert; once they respond, the system pinpoints the responsible agent and applies the necessary change which they can test.

We’re borrowing our OSS business model from Supabase who makes it easy to self-host with features reserved for enterprise and a paid version for managed cloud service. Right now, all of our code is available under a permissive license (MIT).

We’re admittedly early, and many features are in the process of being built. We would really appreciate a star and feedback on how we can make it useful to you. Thanks!

Morphology of a Marvel Movie

https://github.com/dhealy05/morphology_of_a_marvel_movie
1•higuidebot•38s ago•0 comments

Kelp – simple replacement for homebrew on macOS

https://github.com/crhuber/kelp
1•amai•3m ago•0 comments

Show HN: Let Me Prompt It for You

https://lmpify.com
2•janwilmake•3m ago•0 comments

China says U.S. undermined trade talks with Huawei chip warning

https://www.cnbc.com/2025/05/19/china-us-trade-tariffs-chip-huawei.html
2•zerosizedweasle•4m ago•1 comments

I made my own Email Protocol [video]

https://www.youtube.com/watch?v=nALc9GwZdFc
1•hisamafahri•4m ago•0 comments

Async/Await versus the Calloop Model

https://notgull.net/calloop/
1•simonpure•8m ago•0 comments

Remembering the ISP that David Bowie ran for 8 years

https://hackaday.com/2025/05/19/remembering-the-isp-that-david-bowie-ran-for-eight-years/
3•ethanpil•9m ago•0 comments

Launch HN: Better Auth (YC X25) – Authentication Framework for TypeScript

1•bekacru•11m ago•0 comments

Ukraine can move beyond its Soviet architectural legacy

https://www.counteroffensive.news/p/how-ukraine-can-move-beyond-its-soviet
3•dbuxton•13m ago•0 comments

Show HN: Mirror World, create an AI clone of anyone

https://mirr.world/
2•p-sharpe•15m ago•0 comments

Zymtrace AI Flamegraph in Rust and WASM

https://www.zymtrace.com/article/zymtrace-ai-flamegraph
1•iogbole•16m ago•0 comments

Oops, I accidentally vibe-coded a ChatGPT client for my Apple Watch

https://richarddas.com/blog/chatgpt-client-for-apple-watch/
3•cleverbit•17m ago•0 comments

Taiwan to Ramp Up Gas Imports After Shuttering Last Nuclear Plant

https://e360.yale.edu/digest/taiwan-nuclear-power-gas-imports
2•YaleE360•18m ago•1 comments

How Xi sparked China's electricity revolution

https://www.ft.com/content/f86782fa-9f2e-448a-b710-29e787dc9831
1•Traces•18m ago•0 comments

Over 125 DLSS 4 with Multi Frame Generation Games and Apps Available Now

https://www.nvidia.com/en-us/geforce/news/125-dlss-4-multi-frame-gen-games-more-announced-computex-2025/
1•ksec•19m ago•1 comments

A visual guide to Pope Leo XIV

https://multimedia.scmp.com/infographics/news/world/article/3310236/habemus-papa/index.html
1•gmays•20m ago•0 comments

Show HN: I Built a Framework That Makes LLMs Think Like Heinlein's Fair Witness

https://fairwitness.bot/
1•9wzYQbTYsAIc•20m ago•1 comments

Show HN: Distill – Automated company and industry monitoring

https://www.distillintelligence.com/
1•gustavfridell•20m ago•1 comments

Tesla Regret Syndrome

https://www.seattletimes.com/seattle-news/protests-take-a-satirical-approach-in-tesla-regret-syndrome-ad/
9•dxs•21m ago•2 comments

What MCP Is Missing: UI Components

https://www.maximepeabody.com/blog/mcp-missing-ui
2•peab•22m ago•0 comments

A Dialogue on Agentic Coding

https://substack.com/home/post/p-163894567
1•beala•23m ago•0 comments

JavaScript Algorithms – Clean and beginner-friendly implementations

https://github.com/AllThingsSmitty/javascript-algorithms
1•AllThingsSmitty•23m ago•1 comments

We Need Lisp Machines

https://fultonsramblings.substack.com/p/why-we-need-lisp-machines
3•irthomasthomas•23m ago•0 comments

In the Future, China Will Be Dominant. The U.S. Will Be Irrelevant

https://www.nytimes.com/2025/05/19/opinion/china-us-trade-tariffs.html
4•Traces•24m ago•2 comments

Integrated Brillouin photonics in thin-film lithium niobate

https://www.science.org/doi/10.1126/sciadv.adv4022
1•PaulHoule•24m ago•0 comments

How many satellites orbit Earth?

https://www.livescience.com/how-many-satellites-orbit-earth
1•Brajeshwar•24m ago•0 comments

Still booting after all these years: The people using ancient Windows computers

https://www.bbc.com/future/article/20250516-the-people-stuck-using-ancient-windows-computers
2•graeme•24m ago•0 comments

Metabolic Pathways

https://faculty.cc.gatech.edu/~turk/bio_sim/articles/metabolic_pathways.png
1•Tomte•24m ago•0 comments

Regeneron agrees to buy 23andMe, promises ethical use of customers' DNA data

https://www.reuters.com/business/healthcare-pharmaceuticals/regeneron-buy-bankrupt-genetic-testing-firm-23andme-256-million-2025-05-19/
2•c420•24m ago•0 comments

Writing a Technical Book for Manning (2022)

https://www.tunetheweb.com/blog/writing-a-technical-book-for-manning/
1•Tomte•25m ago•0 comments