frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Adding verification steps to AI agents made them worse. I tested it 29 times

https://substack.com/@krzysztofdudek/p-192580258
2•chrisdudek•1h ago

Comments

nareyko•1h ago
Small observation. Verification often looks like an obvious improvement, but it changes the reward structure of the system.

Agents start optimizing for passing the verification step, not necessarily for solving the task itself.

You sometimes see a similar effect in recommender systems: optimizing secondary metrics shifts system behavior.

chrisdudek•1h ago
I suspected that given some responses the agent was giving during the experimentation. Looked sometimes tendentious when measuring.
nareyko•14m ago
Right - the verifier effectively becomes part of the reward function.

So the system starts optimizing for passing the verifier, not necessarily for solving the task.

You're right to be anxious about AI: This is how much we are building

https://www.dumky.net/posts/youre-right-to-be-anxious-about-ai-this-is-how-much-we-are-building/
2•dmkii•7m ago•2 comments

Mathematical methods and human thought in the age of AI

https://terrytao.wordpress.com/2026/03/29/mathematical-methods-and-human-thought-in-the-age-of-ai/
1•jjgreen•9m ago•0 comments

Swift SDK for Android

https://www.swift.org/documentation/articles/swift-sdk-for-android-getting-started.html
1•devy•14m ago•0 comments

Polygraphs have major flaws. Are there better options?

https://undark.org/2026/03/25/lie-detection-polygraph-accuracy/
2•Tijana329•18m ago•0 comments

Stripe Is Down

https://downdetector.fr/en/status/stripe/
6•pinter69•20m ago•0 comments

Show HN: IsDisposable – Open-source disposable email detection (160K+ domains)

https://www.npmjs.com/package/@isdisposable/js
1•junaidshaukat•20m ago•0 comments

Mistral raises $830M to build Nvidia-powered AI centres in Europe

https://www.ft.com/content/229f4f59-d518-4e00-abd6-5a5b727cd2aa
1•macleginn•20m ago•2 comments

Show HN: I'm a Happy Engineer [video]

https://www.youtube.com/watch?v=f1a_MRLibqU
1•denysvitali•23m ago•0 comments

How to Survive in the Tech industry in 2026

https://blog.phuaxueyong.com/post/2026-03-23-how-to-survive-tech-in-2026/
5•xueyongg•29m ago•1 comments

How Can Universities Value-Add Their Alumni?

https://blog.phuaxueyong.com/post/2025-06-27-university-role-in-alumni-engagement/
2•xueyongg•29m ago•0 comments

The CTO's Burden: Building What the World Doesn't See

https://blog.phuaxueyong.com/post/2025-04-29-questions-for-cto/
1•xueyongg•29m ago•0 comments

Show HN: Travel app that replaces trip research with a 30s briefing (TestFlight)

https://globallybased.com
2•ilyagruzhevski•31m ago•0 comments

Sad Story of Soviet Compact Disc Players

https://sovietrock.com/mediums/cd/sad-story-of-soviet-compact-disc-players/
2•thenthenthen•33m ago•0 comments

Credential Broker for Agents (CB4A)

https://datatracker.ietf.org/doc/draft-hartman-credential-broker-4-agents/
1•jruohonen•35m ago•0 comments

We tricked 1M+ bots and hackers with our honeypot

https://github.com/BlessedRebuS/Krawl
1•blessedrebus•36m ago•0 comments

Every Package You Install Can Read Your Secrets

https://www.eliranturgeman.com/2026/03/28/supply-chain-attacks/
1•gsky•38m ago•1 comments

Copilot Adverts in Pull Requests

https://github.com/search
2•tomwphillips•38m ago•3 comments

Show HN: A curated list of plugins,themes, agents,projects, for OpenCode

https://github.com/awesome-opencode/awesome-opencode
3•ishqdehlvi•48m ago•0 comments

Yahoo turns to AI-powered answer engine Scout to lead back to online search

https://isp.netscape.com/tech/story/0001/20260327/a9ec7ff0f7af72662b6d98ddd9c5280d
2•Imustaskforhelp•49m ago•1 comments

The Missing Equation of Quantum Biology

https://sectio-aurea-q.github.io/emc2-of-quantum-biology.html
2•sectio-aurea-q•59m ago•0 comments

Show HN: Veil – A Minimal Neovim GUI for macOS with Metal Rendering

https://github.com/rainux/Veil
3•rainux•1h ago•0 comments

AI tool that scores your job's displacement risk by role and skills

https://careerrisk.ee/
4•Equitis•1h ago•2 comments

Selling to AI Agents

https://mattgiustwilliamson.substack.com/p/selling-to-ai-agents
4•MattSWilliamson•1h ago•3 comments

Introduction to Gaussian Splats [video]

https://www.youtube.com/watch?v=X8yRlA7jqEQ
2•Khaine•1h ago•0 comments

Google's Larry Page Won the Bidding War for DeepMind

https://www.wsj.com/tech/ai/deepmind-google-demis-hassabis-5bd6de54
2•yihongs•1h ago•0 comments

Ask HN: Who needs contributors? (March 2026)

5•Kathan2651•1h ago•0 comments

Show HN: TermCanvas – An infinite canvas for your terminals

https://github.com/blueberrycongee/termcanvas
1•blueberrycongee•1h ago•0 comments

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models

https://dani2442.github.io/posts/continuous-rl/
18•sebzuddas•1h ago•5 comments

Fast Image AI Image Enhancer

https://fastimage.ai/ai-image-enhancer
1•lucas0953•1h ago•0 comments

Twitching Before You Sprint

https://mikefisher.substack.com/p/twitching-before-you-sprint
1•kiyanwang•1h ago•0 comments