frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

East Germany balloon escape

https://en.wikipedia.org/wiki/East_Germany_balloon_escape
217•robertvc•6h ago•65 comments

Cloudflare acquires Astro

https://astro.build/blog/joining-cloudflare/
668•todotask2•9h ago•316 comments

Left in the cold: Study finds most renters shut out of energy-saving upgrades

https://www.binghamton.edu/news/story/6011/left-in-the-cold-study-finds-most-renters-shut-out-of-...
6•hhs•15m ago•0 comments

Releasing rainbow tables to accelerate Net-NTLMv1 protocol deprecation

https://cloud.google.com/blog/topics/threat-intelligence/net-ntlmv1-deprecation-rainbow-tables
38•linolevan•2h ago•22 comments

6-Day and IP Address Certificates Are Generally Available

https://letsencrypt.org/2026/01/15/6day-and-ip-general-availability
308•jaas•8h ago•190 comments

LLM Structured Outputs Handbook

https://nanonets.com/cookbooks/structured-llm-outputs
49•vitaelabitur•1d ago•9 comments

Cursor's latest “browser experiment” implied success without evidence

https://embedding-shapes.github.io/cursor-implied-success-without-evidence/
343•embedding-shape•9h ago•151 comments

Michelangelo's first painting, created when he was 12 or 13

https://www.openculture.com/2026/01/discover-michelangelos-first-painting.html
281•bookofjoe•10h ago•154 comments

Just the Browser

https://justthebrowser.com/
464•cl3misch•11h ago•232 comments

HTTP RateLimit Headers

https://dotat.at/@/2026-01-13-http-ratelimit.html
16•zdw•2d ago•5 comments

Lock-Picking Robot

https://github.com/etinaude/Lock-Picking-Robot
237•p44v9n•4d ago•110 comments

Patching the Wii News Channel to serve local news (2025)

https://raulnegron.me/2025/wii-news-pr/
31•todsacerdoti•10h ago•5 comments

Reading across books with Claude Code

https://pieterma.es/syntopic-reading-claude/
44•gmays•5h ago•16 comments

STFU

https://github.com/Pankajtanwarbanna/stfu
571•tanelpoder•6h ago•405 comments

Launch HN: Indy (YC S21) – A support app designed for ADHD brains

https://www.shimmer.care/indy-redirect
61•christalwang•7h ago•72 comments

Why DuckDB is my first choice for data processing

https://www.robinlinacre.com/recommend_duckdb/
191•tosh•12h ago•70 comments

Slop is everywhere for those with eyes to see

https://www.fromjason.xyz/p/notebook/slop-is-everywhere-for-those-with-eyes-to-see/
148•speckx•3h ago•86 comments

Dev-owned testing: Why it fails in practice and succeeds in theory

https://dl.acm.org/doi/10.1145/3780063.3780066
98•rbanffy•10h ago•131 comments

Elasticsearch was never a database

https://www.paradedb.com/blog/elasticsearch-was-never-a-database
87•jamesgresql•5d ago•71 comments

Zep AI (Agent Context Engineering, YC W24) Is Hiring Forward Deployed Engineers

https://www.ycombinator.com/companies/zep-ai/jobs/
1•roseway4•6h ago

CLI's completion should know what options you've typed

https://hackers.pub/@hongminhee/2026/optique-context-aware-cli-completion
9•dahlia•3d ago•2 comments

We Gave Our Browser Agent a 3MB Data Warehouse

https://100x.bot/a/we-gave-our-browser-agent-a-3mb-data-warehouse
20•shardullavekar•1d ago•3 comments

An Ode to the Return of Wysiwyg

https://jeffverkoeyen.com/blog/2026/01/13/WYSIWYG/
4•featherless•3d ago•0 comments

Brain: PC virus [audio]

https://www.bbc.com/audio/play/w3ct7479
14•andsoitis•4d ago•1 comments

Show HN: 1Code – Open-source Cursor-like UI for Claude Code

https://github.com/21st-dev/1code
45•Bunas•1d ago•23 comments

Independent Guest Virtual Machine (IGVM) File Format

https://github.com/microsoft/igvm
18•ingve•1d ago•1 comments

Read_once(), Write_once(), but Not for Rust

https://lwn.net/SubscriberLink/1053142/8ec93e58d5d3cc06/
102•todsacerdoti•8h ago•33 comments

Drawbot: Let's hack something cute (2025)

https://www.atredis.com/blog/2025/9/30/drawbot-lets-hack-something-cute
14•notmine1337•1h ago•4 comments

Our approach to advertising

https://openai.com/index/our-approach-to-advertising-and-expanding-access/
195•rvz•5h ago•161 comments

LWN is currently under the heaviest scraper attack seen yet

https://social.kernel.org/notice/B2JlhcxNTfI8oDVoyO
137•luu•3h ago•86 comments
Open in hackernews

LLM Structured Outputs Handbook

https://nanonets.com/cookbooks/structured-llm-outputs
49•vitaelabitur•1d ago

Comments

tehnub•1h ago
This is a nice guide. I especially like the masked decoding diagrams on this page https://nanonets.com/cookbooks/structured-llm-outputs/basic-....

edit: Somehow that link doesn't work... It's the diagram on the "constrained method" page

prats226•1h ago
One of the authors here, will checkout the diagram link.

Every commercial model provider is adding structured outputs so will keep updating the guide.

HanClinto•54m ago
This is a seriously beautiful guide. I really appreciate you putting this together! I especially love the tab-through animations on the various pages, and this is one of the best explanations that I've seen. I generally feel I understand grammar-constrained generation pretty well (I've merged a handful of contributions to the llama.cpp grammar implementation), and yet I still learned some insights from your illustrations -- thank you!

I'm also really glad that you're helping more people understand this feature, how it works, and how to use it effectively. I strongly believe that structured outputs are one of the most underrated features in LLM engines, and people should be using this feature more.

Constrained non-determinism means that we can reliably use LLMs as part of a larger pipeline or process (such as an agent with tool-calling) and we won't have failures due to syntax errors or erroneous "Sure! Here's your output formatted as JSON with no other text or preamble" messages thrown in.

Your LLM output might not be correct. But grammars ensure that your LLM output is at least _syntactically_ correct. It's not everything, but it's not nothing.

And especially if we want to get away from cloud deployments and run effective local models, grammars are an incredibly valuable piece of this. For practical examples, I often think of Jart's example in her simple LLM-based spam-filter running on a Raspberry Pi [0]:

> llamafile -m TinyLlama-1.1B-Chat-v1.0.f16.gguf \ > --grammar 'root ::= "yes" | "no"' --temp 0 -c 0 \ > --no-display-prompt --log-disable -p "<|user|> > Can you say for certain that the following email is spam? ...

Even though it's a super-tiny piece of hardware, by including a grammar that constrains the output to only ever be "yes" or "no" (it's impossible for the system to produce a different result), then she can use a super-small model on super-limited hardware, and it is still useful. It might not correctly identify spam, but it's never going to break for syntactic reasons, which gives a great boost to the usefulness of small, local models.

* [0]: https://justine.lol/matmul/

shmolyneaux•50m ago
Are there output formats that are more reliable (better adherence to the schema, easier to get parse-able output) or cheaper (fewer tokens) than JSON? YAML has its own problems and TOML isn't widely adopted, but they both seem like they would be easier to generate.

What have folks tried?

marquesine•47m ago
Yes, that's the purpose of TOON.

https://github.com/toon-format/toon

prats226•1m ago
Nice, it would be good idea to develop CFG for this as well so can embed it into all these constrained decoding libraries
roywiggins•24m ago
> We use a lenient parser like ast.literal_eval instead of the standard json.loads(). It will handle outputs that deviate from strict JSON format. (single quotes, trailing commas, etc.)

A nitpick: that's probably a good idea and I've used it before, but that's not really a lenient json parser, it's a Python literal parser and they happen to be close enough that it's useful.

earth2mars•17m ago
BAML
dfajgljsldkjag•13m ago
I agree that building agents is basically impossible if you cannot trust the model to output valid json every time. This seems like a decent collection of the current techniques we have to force deterministic structure for production systems.