news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The Genus Amanita

https://www.mushroomexpert.com/amanita.html

1•rolph•2m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/

1•mooreds•3m ago•1 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•4m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

1•pinkmuffinere•5m ago•0 comments

KV Cache Transform Coding for Compact Storage in LLM Inference

https://arxiv.org/abs/2511.01815

1•walterbell•10m ago•0 comments

A quantitative, multimodal wearable bioelectronic device for stress assessment

https://www.nature.com/articles/s41467-025-67747-9

1•PaulHoule•12m ago•0 comments

Why Big Tech Is Throwing Cash into India in Quest for AI Supremacy

https://www.wsj.com/world/india/why-big-tech-is-throwing-cash-into-india-in-quest-for-ai-supremac...

1•saikatsg•12m ago•0 comments

How to shoot yourself in the foot – 2026 edition

https://github.com/aweussom/HowToShootYourselfInTheFoot

1•aweussom•12m ago•0 comments

Eight More Months of Agents

https://crawshaw.io/blog/eight-more-months-of-agents

3•archb•14m ago•0 comments

From Human Thought to Machine Coordination

https://www.psychologytoday.com/us/blog/the-digital-self/202602/from-human-thought-to-machine-coo...

1•walterbell•14m ago•0 comments

The new X API pricing must be a joke

https://developer.x.com/

1•danver0•15m ago•0 comments

Show HN: RMA Dashboard fast SAST results for monorepos (SARIF and triage)

https://rma-dashboard.bukhari-kibuka7.workers.dev/

1•bumahkib7•16m ago•0 comments

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag

1•artigent•21m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor

3•dragandj•22m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/

1•maurizzzio•23m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•24m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/

1•pranay01•25m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/

1•todsacerdoti•26m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel

1•Sean766•28m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos

1•fluantix•28m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/

1•MaximilianEmel•28m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf

1•mooreds•29m ago•1 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app

1•sngahane•30m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/

1•gaws•32m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba

1•mooreds•32m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747

1•paulpauper•33m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/

1•cherrylinedev•34m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...

1•mooreds•35m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...

2•paulpauper•38m ago•0 comments

Ask HN: Are we at the point where software can improve itself?

1•ManuelKiessling•38m ago•2 comments

Open in hackernews

Hard part about building AI Agents isn't planning it's making them stick to plan

https://sia.build/blog/production-ai-agents

9•anup_sia•3mo ago

Comments

anup_sia•3mo ago

LLMs are great at creating plans but terrible at following them. I've seen agents claim to create 5 files but only make 2, repeat API calls 3x, skip error handling, then report success anyway. The fix: treat execution like todo management—track every step, block the agent if it tries tools not in the current step, and verify completion (don't trust its word, actually check if the file exists). This plus guardrails and git-like versioning improved the reliability siginificantly

verdverm•3mo ago

seems reasonable and resonates with the approach I plan to take when I start building my agent

sunir•3mo ago

If the plan is too big to fit into context or requires too much attention it overwhelms the llm. You need to decompose into tasks and todos aggressively.

mayankd•3mo ago

For sure, agents tend to bang their heads against a wall, and can deviate in surprising ways to attempt to escape that wall. Balancing the scope of a plan and making agents stick to it is a tricky balance to strike