frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Auto-harness: Self improving agentic systems with auto-evals (open-sourced)

https://twitter.com/gauri__gupta/status/2040251170099524025
2•gauri1902•1h ago

Comments

gauri1902•1h ago
Hey all, we just released our work on self-improving AI systems at NeoSigma. We show our auto agent harness improvement system on Tau3 benchmark tasks where the agent’s score improves from 0.56 to 0.78 (~40% jump) while mining failures and auto maintaining live evals. We got a lot of responses from people wanting to try the self-improving loop on their own agent, so we open-sourced our setup. Releasing auto-harness: an open source library for our self improving agentic systems with auto-evals. Connect your agent and let it cook over the weekend. Watch it go brrrr!! Link to the article here: https://x.com/gauri__gupta/status/2040251170099524025
deadinator•1h ago
Point it at your agent. Leave it running. Come back to a better agent with evals!!

FDA had already warned the self-proclaimed 'fastest growing company in history'

https://www.drugdiscoverytrends.com/the-new-york-times-spotlighted-medvi-the-fda-had-already-warn...
1•thm•1m ago•0 comments

Emotion concepts and their function in a large language model

https://www.anthropic.com/research/emotion-concepts-function
1•dnw•1m ago•0 comments

Show HN: Cursor Cmd+K like and macOS spotlight like TUI for all terminals

https://github.com/64bit/commandOK
1•gigapotential•4m ago•0 comments

Anyone switch accounts for Claude Code, did you lose everything?

1•dpark2026•5m ago•0 comments

¡Haciendo Historia Celebrating PyCon US's First-Ever Spanish-Language Keynote

https://pycon.blogspot.com/2026/04/haciendo-historia-celebrating-pycon-uss.html
2•lumpa•19m ago•0 comments

Auralo: An nice new Radio App check it out

https://testflight.apple.com/join/mEtdrzZ5
1•marc0janssen•24m ago•2 comments

Trump fires Pam Bondi as US Attorney General

https://www.reuters.com/world/trump-fires-pam-bondi-us-attorney-general-cnn-fox-2026-04-02/
2•mgh2•28m ago•3 comments

AgentShift–One command migrates your OpenClaw agents to NemoClaw

https://agentshift.sh/
1•ogkranthi•36m ago•0 comments

Uber engineer alleges hostile 'boys club' culture, firing after cancer leave

https://archive.org/details/10127280
3•nickvec•39m ago•0 comments

Spath and Splan

https://sumato.ai/posts/2026-04-04-spath-and-splan.html
1•jasonmoo•40m ago•0 comments

Ask HN: Interactive Car Mechanics Guide?

1•id00•44m ago•0 comments

Scientists are working on "everything vaccines"

https://economist.com/science-and-technology/2026/04/01/scientists-are-working-on-everything-vacc...
5•andsoitis•44m ago•1 comments

Donald Knuth: Open Letter to Condoleezza Rice (2002)

https://www-cs-faculty.stanford.edu/~knuth/rice.html
3•car•45m ago•1 comments

100 Years of the Iron Ring

https://engineerscanada.ca/news-and-events/news/100-years-of-the-iron-ring-a-symbol-of-an-enginee...
1•jruohonen•51m ago•0 comments

Vibe coded a design tool for a client handover as a non-technical founder

https://www.ugh.design
2•jayantrao94•56m ago•1 comments

Video Friday: Digit Learns to Dance

https://spectrum.ieee.org/video-humanoid-dancing
2•jruohonen•58m ago•0 comments

AdGuard ad trackers What ad-based surveillance does to your traffic

https://adguard.com/en/blog/adguard-ad-tracker-report-2025.html
2•XzetaU8•1h ago•0 comments

Pale Blue Dot

https://en.wikipedia.org/wiki/Pale_Blue_Dot
3•thunderbong•1h ago•1 comments

EU cyber agency attributes major data breach to TeamPCP hacking group

https://therecord.media/european-commission-cyberattack-teampcp
4•jruohonen•1h ago•1 comments

Show HN: AI Dev Board – Job Board for AI Developers with a Full REST API

https://aidevboard.com/
2•8bitconcepts•1h ago•0 comments

Ask HN: Why still embed heavy 3rd-party iFrames for simple social proof?

2•LordKode•1h ago•0 comments

Show HN: HyprMac – I missed Hyprland after switching to Mac, so I built it

https://github.com/zacharytgray/HyprMac
2•zachtgray•1h ago•0 comments

Thoughts on AI and Research [pdf]

https://economics.mit.edu/sites/default/files/2026-04/IA%20AI%20note_1.pdf
2•jxmorris12•1h ago•0 comments

Jungle old school drum and bass radio

https://radio.aklein.studio/public/lounge24_radio
3•misterthp•1h ago•0 comments

It's open season for refusing AI

https://www.bloodinthemachine.com/p/its-open-season-for-refusing-ai
7•HotGarbage•1h ago•2 comments

No luck for Broadcom as Netflix and Quinn Emanuel succeed in nullity claim

https://www.juve-patent.com/cases/no-luck-for-broadcom-as-netflix-and-quinn-emanuel-succeed-in-an...
2•breve•1h ago•0 comments

How to Evaluate Claude Skill Output Quality for Prompt-to-SQL Scenarios

https://dekart.xyz/blog/how-to-evaluate-claude-skill-output-quality-for-prompt-to-sql-scenarios/
2•delfrrr•1h ago•0 comments

Mcpx: a Rust proxy that catches MCP schema changes and tool poisoning at runtime

https://github.com/MeghP89/mcpx
2•meghp89•1h ago•0 comments

Naming rights to street auctioned in San Francisco

https://paintastreet.com/auction
3•18nleung•1h ago•1 comments

Show HN: Clangine-de-Poitrine

https://github.com/jerpint/clangine-de-poitrine
1•jerpint•1h ago•0 comments