frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

We hit a wall testing AI agents, agents simulations works better

1•draismaa•4h ago
We've been working with teams building AI agents (agentic systems, with actual execution) But here's the thing: everyone says “agents are the future,” yet no one really knows how to test them. Some teams are manually walking through conversations, others are just shipping and "vibe checking" what comes back. Both break down at scale. The real problem? We’re testing agents like software, but agents don’t behave like software. They make decisions, adapt, escalate, reason across contexts. They're more like processes than functions. Rogerio, our CTO, wrote up a deeper dive on how we see the future of agent testing, and why agent simulations (not hardcoded flows) are becoming the new unit tests for AI systems. We built LangWatch scenario to let teams simulate real-world agent behavior and catch regressions early on. Would love feedback from folks who’ve been burned by this or hacked together their own simulation setups.

Myths and mythconceptions: what does it mean to be a programming language?(2021)

https://dl.acm.org/doi/10.1145/3480947
1•mpweiher•1m ago•0 comments

DeepMind Close to Solving the Navier-Stokes Millenium Prize Problem

https://english.elpais.com/science-tech/2025-06-24/spanish-mathematician-javier-gomez-serrano-and-google-deepmind-team-up-to-solve-the-navier-stokes-million-dollar-problem.html
1•sajid•1m ago•0 comments

The Wheel (Direction)

1•santiviquez•2m ago•0 comments

Tesla head of manufacturing Omead Afshar fired by Elon Musk

https://www.cnbc.com/2025/06/26/tesla-head-of-manufacturing-omead-afshar-fired-by-elon-musk.html
2•sundaeofshock•2m ago•0 comments

VMware perpetual license holder receives audit letter from Broadcom

https://arstechnica.com/information-technology/2025/06/vmware-perpetual-license-holder-receives-audit-letter-from-broadcom/
1•TMWNN•3m ago•0 comments

`blaze-install` is a drop-in CLI that installs NPM packages

https://github.com/TrialLord/Blazed-install
1•teckmill•3m ago•0 comments

Elastic's journey to build Elastic Cloud Serverless

https://www.elastic.co/blog/journey-to-build-elastic-cloud-serverless
1•dpifke•5m ago•0 comments

The Washington Post Will Ask Some Sources to Annotate Its Stories

https://www.nytimes.com/2025/06/25/business/washington-post-annotations-comments.html
1•jaredwiener•6m ago•0 comments

Marge Simpson isn't dead yet, so everyone can calm down

https://www.cnn.com/2025/06/26/entertainment/marge-simpson-isnt-dead-yet
1•austinallegro•10m ago•0 comments

Apple's Swift coding language is working on Android support

https://9to5google.com/2025/06/26/swift-coding-language-android-support/
1•rbanffy•10m ago•0 comments

Show HN: Chisel – Profile GPU Kernels Without a GPU (Nvidia and AMD)

https://github.com/Herdora/chisel
1•technoabsurdist•14m ago•0 comments

Salesforce CEO Says 30% of Internal Work Is Being Handled by AI

https://www.bloomberg.com/news/articles/2025-06-26/salesforce-ceo-says-30-of-internal-work-is-being-handled-by-ai
3•petethomas•15m ago•0 comments

A.I. Is Homogenizing Our Thoughts

https://www.newyorker.com/culture/infinite-scroll/ai-is-homogenizing-our-thoughts
3•thoughtpeddler•15m ago•0 comments

Lalo Schifrin, Film Composer Who Wrote 'Mission: Impossible' Theme, Dies at 93

https://variety.com/2025/music/news/lalo-schifrin-dead-mission-impossible-film-composer-1236442000/
2•rb2e•16m ago•1 comments

Coloring.app – Custom AI Coloring Pages and Books

https://coloring.app
1•presson•18m ago•1 comments

Simulating a neural operating system with Gemini 2.5 Flash-Lite

https://developers.googleblog.com/en/simulating-a-neural-operating-system-with-gemini-2-5-flash-lite/
2•lastdong•18m ago•0 comments

Can a Brain Be Preserved and Uploaded? Neuroscience Reveals 40% Chance It Could

https://www.iflscience.com/can-a-brain-be-preserved-and-uploaded-neuroscientist-survey-reveals-surprising-40-percent-probability-that-yes-it-could-79775
4•Bluestein•21m ago•0 comments

I started writing the hono.js of Golang

https://github.com/buildwithgo/amaro
2•bernaforcillo•21m ago•0 comments

BIS: Stablecoins Fail Key Tests of Real Money

https://cointelegraph.com/news/stablecoins-fail-money-bis-report
3•walterbell•22m ago•0 comments

Show HN: Built a Food Scanner for Longevity

https://www.getbiohack.app
2•Fbue•25m ago•0 comments

Understanding the sport viewership experience using functional IR spectroscopy

https://www.nature.com/articles/s41598-025-96895-7
2•PaulHoule•26m ago•0 comments

Built something to help with panic attacks – what am I missing?

https://abler.health
2•Kreshnik•27m ago•2 comments

Britain shuns $34B Morocco-UK subsea power project

https://www.reuters.com/business/energy/britain-rejects-morocco-uk-green-energy-cable-project-2025-06-26/
1•zekrioca•27m ago•0 comments

Bluefishjs: Composing Diagrams in with Declarative Relations

https://dl.acm.org/doi/10.1145/3654777.3676465
1•fanf2•27m ago•0 comments

Stryker is a new generation mobile pentest application

https://github.com/Stryker-Defense-Inc/strykerapp
1•mooreds•28m ago•0 comments

RFK Jr's new vaccine panel votes against preservative in flu shots in shock move

https://www.theguardian.com/us-news/2025/jun/26/rfk-flu-shot-vaccines-panel
12•voxadam•28m ago•0 comments

Carrot Cache: High-Performance, SSD-Friendly Caching Library for Java

https://medium.com/carrotdata/carrot-cache-high-performance-ssd-friendly-caching-library-for-java-30bf2502ff76
2•mooreds•29m ago•0 comments

Genomics coordinate systems

https://docs.rs/omics-coordinate/latest/omics_coordinate/
1•clmcleod•29m ago•0 comments

Everything We Just Learned About the Ordnance Penetrator Strikes on Iran

https://www.twz.com/air/gbu-57-massive-ordnance-penetrator-strikes-on-iran-everything-we-just-learned
3•2OEH8eoCRo0•30m ago•3 comments

Mammal evolution of upright posture was no cake walk

https://cosmosmagazine.com/history/palaeontology/mammal-evolution-upright-posture/
2•Bluestein•30m ago•0 comments