frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I built a clawdbot that texts like your crush

https://14.israelfirew.co
1•IsruAlpha•2m ago•0 comments

Scientists reverse Alzheimer's in mice and restore memory (2025)

https://www.sciencedaily.com/releases/2025/12/251224032354.htm
1•walterbell•5m ago•0 comments

Compiling Prolog to Forth [pdf]

https://vfxforth.com/flag/jfar/vol4/no4/article4.pdf
1•todsacerdoti•6m ago•0 comments

Show HN: Cymatica – an experimental, meditative audiovisual app

https://apps.apple.com/us/app/cymatica-sounds-visualizer/id6748863721
1•_august•7m ago•0 comments

GitBlack: Tracing America's Foundation

https://gitblack.vercel.app/
2•martialg•7m ago•0 comments

Horizon-LM: A RAM-Centric Architecture for LLM Training

https://arxiv.org/abs/2602.04816
1•chrsw•8m ago•0 comments

We just ordered shawarma and fries from Cursor [video]

https://www.youtube.com/shorts/WALQOiugbWc
1•jeffreyjin•9m ago•1 comments

Correctio

https://rhetoric.byu.edu/Figures/C/correctio.htm
1•grantpitt•9m ago•0 comments

Trying to make an Automated Ecologist: A first pass through the Biotime dataset

https://chillphysicsenjoyer.substack.com/p/trying-to-make-an-automated-ecologist
1•crescit_eundo•13m ago•0 comments

Watch Ukraine's Minigun-Firing, Drone-Hunting Turboprop in Action

https://www.twz.com/air/watch-ukraines-minigun-firing-drone-hunting-turboprop-in-action
1•breve•14m ago•0 comments

Free Trial: AI Interviewer

https://ai-interviewer.nuvoice.ai/
1•sijain2•14m ago•0 comments

FDA Intends to Take Action Against Non-FDA-Approved GLP-1 Drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
11•randycupertino•15m ago•3 comments

Supernote e-ink devices for writing like paper

https://supernote.eu/choose-your-product/
3•janandonly•18m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•18m ago•0 comments

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

https://arxiv.org/abs/2602.01465
2•NBenkovich•18m ago•0 comments

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

https://www.latent.space/p/adversarial-reasoning
1•swyx•19m ago•0 comments

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•27m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
9•karakoram•27m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•27m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•27m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•29m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•30m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•30m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•32m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•35m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
2•SirLJ•36m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
4•randycupertino•37m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
3•breve•42m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•43m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
3•ks2048•43m ago•0 comments
Open in hackernews

Show HN: How to stop Claude Code hallucinations using a CLI Truth Layer

https://apidog.com/blog/apidog-cli-claude-skills-api-test-automation-guide/
4•themast•2w ago

Comments

themast•2w ago
I’ve been using Claude Code since the early beta, and while the "99% of my code is AI-written" claims are technically true for boilerplate, I kept hitting a wall with complex backend refactors. The agent would get stuck in a "hallucination loop"—fixing a bug, breaking a route, then suggesting the same fix again.

I finally hit a 10x velocity boost by moving away from "chatting" and implementing a Deterministic Truth Layer.

The Problem: The Vibe Coding Ceiling "Vibe coding" is great until you hit a 50-80% probabilistic success rate. If the agent doesn't have a way to verify its own logic against a real server state, it’s just guessing.

The Solution: Apidog CLI as a Claude Skill Instead of letting Claude "think" it fixed the API, I linked my terminal to the Apidog CLI guide via a custom SKILL.md.

The 10x Workflow:

The Skill: I defined my API test suite as a core Claude Skill.

The Execution: When I tell Claude to "Refactor the Auth logic," it doesn't just write code. It triggers an Automated API Testing Guide scenario through the CLI.

The Self-Correction: Claude captures the CLI output (actual JSON responses, status codes). If a test fails, the agent performs a root-cause analysis on the logs and auto-corrects before I ever see the PR.

This moves the agent from "Fuzzy Reasoning" to "Deterministic Execution." It’s the only way I’ve found to actually reach 100% autonomous code generation for production-grade APIs.

I’ve put together a quick Claude Code Tutorial on how to map these CLI skills in the comments/link below. Curious if anyone else is using MCP or custom skills to provide a "ground truth" to their terminal agents?

verdverm•2w ago
I just let my agent run the tests directly, no need for a skill that I need to maintain, just put that shit in the AGENTS/CLAUDE.md

The 10x differentiator is giving the agents the language tools for the code they are working on. Compiler, linters, formatters, LSP, testing, running dev servers, logs... these allow the agent to get feedback without the human in the loop and self adjust