frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Toolkit to help you get started with Spec-Driven Development

https://github.com/github/spec-kit
33•mooreds•6d ago

Comments

sebast_bake•3h ago
Is it good?
lngr•3h ago
Yes, I love it. I have used it a while with Claude Code, Codex CLI and Windsurf. It's awesome with Claude Code. Codex CLI produces just garbage. Windsurf results vary, even when I use it with Claude models. I now use it with Windsurf for the specify and plan modes, and Claude for the implementation.
rendall•1h ago
I'm a bit confused by this. Codex does not appear to be one of the options?
trjordan•2h ago
I don't think we ever get away from the code being the source of truth. There has to be one source of truth.

If you want to go all in on specs, you must fully commit to allowing the AI to regenerate the codebase from scratch at any point. I'm an AI optimist, but this is a laughable stance with current tools.

That said, the idea of operating on the codebase as a mutable, complex entity, at arms length, makes a TON of sense to me. I love touching and feeling the code, but as soon as there's 1) schedule pressure and 2) a company's worth of code, operating at a systems level of understanding just makes way more sense. Defining what you want done, using a mix of user-centric intent and architecture constraints, seems like a super high-leverage way to work.

The feedback mechanisms are still pretty tough, because you need to understand what the AI is implicitly doing as it works through your spec. There are decisions you didn't realize you needed to make, until you get there.

We're thinking a lot about this at https://tern.sh, and I'm currently excited about the idea of throwing an agentic loop around the implementation itself. Adversarially have an AI read through that huge implementation log and surface where it's struggling. It's a model that gives real leverage, especially over the "watch Claude flail" mode that's common in bigger projects/codebases.

Marazan•1h ago
> There are decisions you didn't realize you needed to make, until you get there.

Is the key insight and biggest stumbling block for me at the moment.

At the moment (encourage by my company) I'm experimenting with as hands off as possible Agent usage for coding. And it is _unbelievably_ frustrating to see the Agent get 99% of the code right in the first pass only to misunderstand why a test is now failing and then completely mangle both it's own code and the existing tests as it tries to "fix" the "problem". And if I'd just given it a better spec to start with it probably wouldn't have started producing garbage.

But I didn't know that before working with the code! So to develop a good spec I either have to have the agent stopping all the time so I can intervene or dive into the code myself to begin with and at that point I may as well write the code anyway as writing the code is not the slow bit.

trjordan•1h ago
For sure. One of our first posts was called "You Have To Decide" -- https://tern.sh/blog/you-have-to-decide/

And my process now (and what we're baking into the product) is:

- Make a prompt

- Run it in a loop over N files. Full agentic toolkit, but don't be wasteful (no "full typecheck, run the test suite" on every file).

- Have an agent check the output. Look for repeated exploration, look for failures. Those imply confusion.

- Iterate the prompt to remove the confusion.

First pass on the current project (a Vue 3 migration) went from 45 min of agentic time on 5 files to 10 min on 50 files, and the latter passed tests/typecheck/my own scrolling through it.

isodev•1h ago
Can I use it without the uv tool? I’d rather my open source projects remain open as in libre.
JimDabell•1h ago
uv is Apache and MIT-licensed. It’s as “open as in libre” as it gets.
shubhamjain•1h ago
> Spec-Driven Development changes this: specifications become executable, directly generating working implementations rather than just guiding them.

Reminds me of TDD bandwagon which was all the rage when I started programming. It took years to slowly die out and people realized how overhyped it really was. Nothing against AI, I love it as a tool, but this "you-don't-need-code" approach shows similar signs. Quick wins at first, lots of hype because of those wins, and then reaching a point where doing even tiny changes becomes absurdly difficult.

You need code. You will need it for a long time.

discreteevent•1h ago
There was a really good article on this here a few days ago that didn't get much traction. It was about how programming is a learning feedback loop and because of that there are good and bad ways to use LLMs:

"The readymade components we use are essentially compressed bundles of context—countless design decisions, trade-offs, and lessons are hidden within them. By using them, we get the functionality without the learning, leaving us with zero internalized knowledge of the complex machinery we've just adopted. This can quickly lead to sharp increase in the time spent to get work done and sharp decrease in productivity."

https://martinfowler.com/articles/llm-learning-loop.html

lloydatkinson•37m ago
Well done on conflating BDD and TDD then, I suppose.
insin•1h ago
This isn't just me not reading your comprehensive guide [1]. It's me recognising you couldn't even be bothered to write it yourself.

[1] https://github.com/github/spec-kit/blob/main/spec-driven.md

Visualize FastAPI endpoints with FastAPI-Voyager

https://www.newsyeah.fun/voyager/
41•tank-34•2h ago•8 comments

Reviving Classic Unix Games: A 20-Year Journey Through Software Archaeology

https://vejeta.com/reviving-classic-unix-games-a-20-year-journey-through-software-archaeology/
31•mwheeler•2h ago•7 comments

Email verification protocol

https://github.com/WICG/email-verification-protocol
25•sgoto•1w ago•8 comments

Alive internet theory

https://alivetheory.net/
72•manbitesdog•2h ago•34 comments

I Am Mark Zuckerberg

https://iammarkzuckerberg.com/
667•jb1991•8h ago•233 comments

Ironclad – formally verified, real-time capable, Unix-like OS kernel

https://ironclad-os.org/
307•vitalnodo•15h ago•87 comments

Ask HN: How would you set up a child’s first Linux computer?

51•evolve2k•3h ago•79 comments

Show HN: Pipeflow-PHP – Automate anything with pipelines even non-devs can edit

https://github.com/marcosiino/pipeflow-php
14•marcosiino•1h ago•2 comments

Reverse engineering Codex CLI to get GPT-5-Codex-Mini to draw me a pelican

https://simonwillison.net/2025/Nov/9/gpt-5-codex-mini/
104•simonw•10h ago•49 comments

The overengineered solution to my pigeon problem (2022)

https://maxnagy.com/posts/pigeons/
34•cyb0rg0•6d ago•21 comments

Largest cargo sailboat completes first Atlantic crossing

https://www.marineinsight.com/shipping-news/worlds-largest-cargo-sailboat-completes-historic-firs...
304•defrost•18h ago•203 comments

Show HN: I built a self-hosted error tracker in Rails

https://telebugs.com
63•kyrylo•1w ago•33 comments

Drax: Speech Recognition with Discrete Flow Matching

https://huggingface.co/papers/2510.04162
16•cliffly•1h ago•0 comments

Marko – A declarative, HTML‑based language

https://markojs.com/
318•ulrischa•20h ago•153 comments

Toolkit to help you get started with Spec-Driven Development

https://github.com/github/spec-kit
33•mooreds•6d ago•12 comments

How Airbus took off

https://worksinprogress.co/issue/how-airbus-took-off/
102•JumpCrisscross•13h ago•91 comments

Study identifies weaknesses in how AI systems are evaluated

https://www.oii.ox.ac.uk/news-events/study-identifies-weaknesses-in-how-ai-systems-are-evaluated/
370•pseudolus•1d ago•178 comments

Defeating KASLR by doing nothing at all

https://googleprojectzero.blogspot.com/2025/11/defeating-kaslr-by-doing-nothing-at-all.html
66•aa_is_op•5d ago•5 comments

Open-source communications by bouncing signals off the Moon

https://open.space/
204•fortran77•1w ago•57 comments

Forth – Is it still relevant?

https://github.com/chochain/eforth
73•lioeters•9h ago•45 comments

Tabloid: The Clickbait Headline Programming Language

https://tabloid.vercel.app/
260•sadeshmukh•11h ago•37 comments

Control structures in programming languages: from goto to algebraic effects

http://xavierleroy.org/control-structures/
165•SchwKatze•6d ago•28 comments

Show HN: Hephaestus – Autonomous Multi-Agent Orchestration Framework

https://github.com/Ido-Levi/Hephaestus
59•idolevi•6d ago•9 comments

Avería: The Average Font (2011)

http://iotic.com/averia/
193•JoshTriplett•19h ago•33 comments

Study finds memory decline surge in young people

https://onepercentrule.substack.com/p/under-40s-declining-memory
140•drcwpl•9h ago•75 comments

Show HN: Sparktype – a CMS and SSG that runs entirely in the browser

https://app.sparktype.org
26•mattkevan•5d ago•3 comments

Opencloud – An alternative to Nextcloud written in Go

https://github.com/opencloud-eu/opencloud
158•todsacerdoti•22h ago•54 comments

How to build your own VPN, or: the history of WARP

https://blog.cloudflare.com/how-to-build-your-own-vpn-or-the-history-of-warp/
69•yla92•6d ago•22 comments

Grok 4 Fast now has 2M context window

https://docs.x.ai/docs/models
132•hereme888•10h ago•209 comments

IRIX Introduction

http://www.sgistuff.net/software/irixintro/index.html
46•naves•13h ago•29 comments