news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

minikeyvalue

https://github.com/commaai/minikeyvalue/tree/prod

1•tosh•29s ago•0 comments

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

https://github.com/eval-exec/neomacs

1•evalexec•5m ago•0 comments

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

https://moli-green.is/

1•ShinyaKoyano•9m ago•0 comments

How I grow my X presence?

https://www.reddit.com/r/GrowthHacking/s/UEc8pAl61b

1•m00dy•10m ago•0 comments

What's the cost of the most expensive Super Bowl ad slot?

https://ballparkguess.com/?id=5b98b1d3-5887-47b9-8a92-43be2ced674b

1•bkls•11m ago•0 comments

What if you just did a startup instead?

https://alexaraki.substack.com/p/what-if-you-just-did-a-startup

2•okaywriting•18m ago•0 comments

Hacking up your own shell completion (2020)

https://www.feltrac.co/environment/2020/01/18/build-your-own-shell-completion.html

1•todsacerdoti•21m ago•0 comments

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

https://github.com/gorse-io/gorse

1•zhenghaoz•21m ago•0 comments

GLM-OCR: Accurate × Fast × Comprehensive

https://github.com/zai-org/GLM-OCR

1•ms7892•22m ago•0 comments

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

https://github.com/MikeVeerman/tool-calling-benchmark

1•MikeVeerman•23m ago•0 comments

Show HN: AboutMyProject – A public log for developer proof-of-work

https://aboutmyproject.com/

1•Raiplus•23m ago•0 comments

Expertise, AI and Work of Future [video]

https://www.youtube.com/watch?v=wsxWl9iT1XU

1•indiantinker•24m ago•0 comments

So Long to Cheap Books You Could Fit in Your Pocket

https://www.nytimes.com/2026/02/06/books/mass-market-paperback-books.html

3•pseudolus•24m ago•1 comments

PID Controller

https://en.wikipedia.org/wiki/Proportional%E2%80%93integral%E2%80%93derivative_controller

1•tosh•28m ago•0 comments

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

https://twitter.com/AlecStapp/status/2019932764515234159

2•bkls•28m ago•0 comments

Kubernetes MCP Server

https://github.com/yindia/rootcause

1•yindia•30m ago•0 comments

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

https://rokn.io/posts/building-movie-recommendation-agent

4•roknovosel•30m ago•0 comments

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z

2•beardyw•38m ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/

1•taubek•38m ago•0 comments

OldMapsOnline

https://www.oldmapsonline.org/en

1•surprisetalk•40m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience

2•surprisetalk•40m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...

2•surprisetalk•41m ago•0 comments

Lawyer sets new standard for abuse of AI; judge tosses case

https://arstechnica.com/tech-policy/2026/02/randomly-quoting-ray-bradbury-did-not-save-lawyer-fro...

5•pseudolus•41m ago•0 comments

AI anxiety batters software execs, costing them combined $62B: report

https://nypost.com/2026/02/04/business/ai-anxiety-batters-software-execs-costing-them-62b-report/

1•1vuio0pswjnm7•41m ago•0 comments

Bogus Pipeline

https://en.wikipedia.org/wiki/Bogus_pipeline

1•doener•43m ago•0 comments

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

https://nypost.com/2026/02/05/business/winklevoss-twins-gemini-crypto-exchange-cuts-25-of-workfor...

2•1vuio0pswjnm7•43m ago•0 comments

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646

3•obscurette•43m ago•0 comments

Cycling in France

https://www.sheldonbrown.com/org/france-sheldon.html

2•jackhalford•45m ago•0 comments

Ask HN: What breaks in cross-border healthcare coordination?

1•abhay1633•45m ago•0 comments

Show HN: Simple – a bytecode VM and language stack I built with AI

https://github.com/JJLDonley/Simple

2•tangjiehao•48m ago•0 comments

Open in hackernews

Hi HN I built Bauform to generate production-ready code, not demos

https://bauform-beta.fly.dev/

1•tekodu•3mo ago

Comments

tekodu•3mo ago

Tested against GPT-5 and Claude 4.5 on 10 production specifications: - Bauform: 10/10 pass all validation gates - GPT-5: 0/10 (generates Streamlit UIs instead of REST APIs) - Claude 4.5: 0/10 (same failure)

The problem: Frontier models pattern-match "build a validator" → "create Streamlit demo" regardless of actual requirements asking for production APIs.

Try it yourself: - Live beta: https://bauform-beta.fly.dev/ - Benchmark with all results: https://github.com/tekodu/bauform-evals - Quick API test: curl -X POST https://bauform-beta.fly.dev/v1/engine/generate \ -H "Content-Type: application/json" \ -d '{"spec": "CSV validator with REST API", "params": {}}' - Analysis paper (under peer review): https://www.dropbox.com/scl/fi/vtmztpdkm0ns86qapxp5p/bauform...

We use 5-gate validation: functional, security, limits, latency, stability. Binary pass/fail - production either works or doesn't.

The results are cryptographically signed (Ed25519) and fully reproducible.

Happy to answer questions about the methodology or system architecture.