frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN : Pilot – System to improve dramatically your AI coding

https://github.com/clementrog/pilot
1•crog•2h ago
I'm a non-technical guy who spent 2 months trying to ship software with AI tools. Not toy projects — real things I wanted to use. Finance analyzers, productivity tools, dev utilities.

The models are incredible. But the loop was broken.

Every session started from zero. Context would explode. The AI would hallucinate with confidence. And because I can't read code, I had no way to verify when something was wrong. I just knew it was broken. So I stopped fighting the model and started building the system around it.

Pilot is a /pilot folder you drop into any repo. It's emergent complexity from simple primitives — markdown files that give AI tools:

Persistent state (STATE.md tracks where you are in the workflow) Scoped tasks (TASK.md defines boundaries before implementation) Evidence capture (real terminal output via MCP, not generated text) Protected paths (red zones require human approval) Recovery (LKG commit auto-updated after health passes)

The core insight: split the AI into two roles. Orchestrator (Claude/ChatGPT) — high reasoning, low volume. Writes specs, reviews evidence, manages flow. Builder (Cursor/Claude Code) — high volume, lower cost. Implements, provides proof. The Orchestrator defines scope before the Builder touches anything. The Builder works within boundaries. The Orchestrator reviews after. Two models, two verification passes. It's moving from "trust me" to "show me the terminal."

Why I needed this: I wanted to program by intuition, not by syntax. I can design systems. I can spec features. I can verify that tests pass and URLs work. What I can't do is read 200 lines of generated TypeScript and know if it's correct. So the system had to prove correctness without requiring code review. Evidence-based commits. Scope contracts. Clear rejection criteria. It's shared intuition for messy realities. Not a sandbox — I know markdown isn't a firewall. It's defense in depth: separation of concerns, multi-model review, explicit rules, human gates.

Technical notes: The workflow is a state machine: idle → building → verifying → done. Evidence comes from MCP-captured terminal output. The Orchestrator validates Builder output against TASK.md constraints. Red zone violations trigger automatic escalation. The /pilot folder is just markdown. Any MCP-enabled tool can read it. No vendor lock-in.

Limitations (being honest): Solo builder workflow. Team use needs merge strategy for state files. Convention-based, not filesystem-enforced. If you need true isolation, run in a container. Context can still drift if you skip the workflow. Health checks help, but it's not foolproof. Token overhead exists. Trading cost for correctness insurance.

What I've built with it: Private projects mostly — finance analyzer, productivity tools, Framer components, and Pilot itself. Iterating on the workflow every time I hit a wall until the walls stopped appearing.

Now using it on bigger things I plan to release.

Felt too good not to share.

Happy to discuss the architecture, failure modes, or specific edge cases.

Someone Has to Fly the Plane

https://www.raptitude.com/2026/01/someone-has-to-fly-the-plane/
1•crescit_eundo•1m ago•0 comments

TimeCapsuleLLM: LLM trained only on data from 1800-1875

https://github.com/haykgrigo3/TimeCapsuleLLM
1•admp•1m ago•0 comments

Show HN: Reversing YouTube's "Most Replayed" Graph

https://priyavr.at/blog/reversing-most-replayed/
1•prvt•1m ago•0 comments

Playwriter: Browser extension MCP uses 90% fewer tokens than Playwright

https://github.com/remorses/playwriter
1•Areibman•2m ago•0 comments

Show HN: AI Motion Control – Transfer any motion to any character with Kling AI

https://aimotioncontrol.app
1•sunpy•2m ago•0 comments

VeilNet: A Different Approach to Overlay Networks for Dynamic Infrastructure

https://veilnet.net
1•ulfaric•3m ago•1 comments

I put Economic rules within silicon

https://codeberg.org/ErickAlexander/Primal-Origins-SoC-IP-Core
1•PrimalOrigins•3m ago•1 comments

Apple Warning–iPhones Must Now Restart

https://www.forbes.com/sites/zakdoffman/2026/01/12/apple-warning-hundreds-of-millions-of-iphones-...
1•aq3cn•4m ago•0 comments

New York begins cargo drone trial between Manhattan and Brooklyn

https://blog.adafruit.com/2026/01/12/new-york-begins-cargo-drone-trial-between-manhattan-and-broo...
1•ptorrone•5m ago•0 comments

Show HN: ETLR – An alternative to Zapier/Make/n8n for developers (YAML-based)

https://etlr.io/
1•liambre•5m ago•1 comments

Dead shrimp. Living machines. Necrobotics is here

https://blog.adafruit.com/2026/01/12/dead-shrimp-living-machines-necrobotics-is-here-at-adafruit/
1•ptorrone•5m ago•0 comments

Founding AE/GTM Role

1•clelands•5m ago•0 comments

Are you hiring designers & engineers?

https://x2talent.com/
1•carlwheatley10•5m ago•0 comments

LLM remembers every past conversation (no embeddings, no RAG) [video]

https://www.youtube.com/watch?v=bOTivKyNH-E
1•chatman1•6m ago•1 comments

Things that Janet and Clojure do better than each other

https://janet.zulipchat.com/#narrow/channel/399615-general/topic/Things.20that.20janet.20and.20cl...
1•amano-kenji•8m ago•0 comments

Show HN: Convert bank statement PDFs to Tables

https://www.bankpdfstatementconverter.com
2•yiyiyayo•11m ago•0 comments

Streamer Spend to Top $100B for First Time in 2026 – Report

https://deadline.com/2026/01/streamer-spend-landmark-figure-2026-ampere-analysis-1236680312/
1•smurda•11m ago•0 comments

Interactive SHA-256 Visualizer

https://hashexplained.com/
1•jrakibi•11m ago•0 comments

Keyword-Match Domains Still Matter for SEO

https://www.wannabe-entrepreneur.com/post/keyword-match-domains-google
1•wbemaker•13m ago•0 comments

Python UCP Client (Universal Commerce Protocol)

https://github.com/Upsonic/ucp-client
1•onuratakan•13m ago•0 comments

Show HN: Create LLM-optimized random identifiers

https://github.com/blixt/tokeydokey
1•blixt•14m ago•0 comments

Influencers and OnlyFans models are dominating O-1 visa requests

https://www.theguardian.com/us-news/2026/jan/11/onlyfans-influencers-us-o-1-visa
1•randycupertino•14m ago•1 comments

Building a No-Tracking Newsletter

https://philippdubach.com/posts/building-a-no-tracking-newsletter-from-markdown-to-distribution/
2•homo_economicus•15m ago•1 comments

Betterment Unauthorized Access

https://www.betterment.com/customer-update
1•gregsadetsky•15m ago•0 comments

Show HN: From Weekend Hack to the Serial Console Adapter We Enjoy Using

https://www.intertooth.com/en/about-us
1•bojan-ch•18m ago•0 comments

The Middle Binomial Coefficient

https://www.johndcook.com/blog/2026/01/12/the-middle-binomial-coefficient/
1•ibobev•19m ago•0 comments

Combining In-Shuffles and Out-Shuffles

https://www.johndcook.com/blog/2026/01/12/in-out-shuffle/
1•ibobev•20m ago•0 comments

Resilience vs. Fault Tolerance

https://www.ufried.com/blog/resilience_vs_fault_tolerance/
1•sylvainkalache•20m ago•0 comments

Toward single-cell control: noise-robust perfect biomolecular adaptation

https://www.nature.com/articles/s41467-025-67736-y
1•PaulHoule•21m ago•0 comments

Windows 2000 still earning its keep running a rail ticket machine in Portugal

https://www.theregister.com/2026/01/12/windows_2000_portugal_rail/
3•pjmlp•22m ago•0 comments