frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The missed opportunity of constrained decoding

https://michaelorenstein.com/blog/zero-entropy-tokens/
2•killcoder•1h ago

Comments

killcoder•1h ago
I was working on a speculative decoding optimisation and its accompanying blog post. Explaining the more basic concepts filled so much of the post I decided to pull them out, forming this article.

I had a bit too much fun with the tokenisation diagrams / animations. The raw text is provided to an Astro component, which tokenises it, and forms the individual DOM elements of the tokens. I find it really hard to read 'tokenised' text, I figured some consistent colouring would help. The 'Probabilities' component is a trivial grid, but all the other components support 'word wrap'.

I ended up writing a 'responsive design aware graph colouring solver'.

Multiple screen widths, 'desktop' and 'mobile' are 'simulated', forming an adjacency graph of tokens that touch. Colours are then greedily allocated, then optimised per page over a few hundred iterations, swapping allocations to enforce minimum hue distance between touching tokens at those common screen sizes. The optimising value function prioritises even distribution of colours, because it looks nicer than maximal hue difference.

Originally I naively outputted the palette styles per component, but found the css post processing optimisers didn't handle that as well as I'd have thought. So then I wrote a little 'CSS compiler' that takes the high level palette and timing concepts of the animations, and optimally merges rule declarations.

The start of the post really relies on the animation occurring while fully in view, so I set up some IntersectionObservers that do the 'please scroll' text.

I tried my best to have it all work when JS is disabled on the client. I tried to get the 'hovering' to be CSS-only, but found the JS solution much more performant.

The DAG diagrams are formed with this neat Needleman-Wunsch algorithm from the bioinformatics field. The Astro component accepts several 'examples' then aligns common subsequences, producing the CSS grid and the 'basic SVG' on the server. The responsive nature meant I had to move the final 'allow' generation to the client.

Some browsers seem to throttle the token animations sometimes but I haven't figured out what causes that. This is my first time leaning hard on CSS variables.

Agent_Builder•1h ago
This resonates. What we saw in practice is that most failures don’t come from models being too dumb, but from being given too much freedom.

While using GTWY.ai, the biggest reduction in hallucinations came from constraining what an agent was allowed to do at each step, not from better prompts or verification layers.

Once inputs, tools, and outputs were explicit, the model stopped confidently inventing things. It felt less “creative”, but far more useful.

Fewer degrees of freedom beat smarter models, at least in production.

killcoder•1h ago
I don't buy the "any constraints cause lower performance via being out of distribution" idea. Sure if you ask the model to output 'reasoning' in JSON steps, that is a completely different 'channel' to its trained 'reasoning' output. For real tasks though, I think it's more about picking the _right_ context free grammar to enforce format correctness. You can enforce an in-distribution format and get the best of both worlds. I don't think the industry should settle so hard on JSON-for-everything.
Agent_Builder•1h ago
I think we’re mostly aligned. The constraints we’re talking about weren’t about forcing everything into JSON or limiting reasoning bandwidth.

Inside a step, the model still reasons freely in plain language. The constraint is on what authority exists at that step.

The failures we saw came from permissions and assumptions silently carrying over between steps, not from the model “thinking wrong”. Once a step ended, any authority it had ended too.

So it’s less “constrain decoding” and more “constrain capability scope over time”. Free reasoning within a step, hard boundaries between steps.

That separation is what removed a lot of surprising behavior for us.

AI Contribution Policy

https://www.graphite.art/volunteer/guide/starting-a-task/ai-contribution-policy/
1•jruohonen•1m ago•0 comments

Kip: A programming language based on grammatical cases of Turkish

https://github.com/joom/kip
1•todsacerdoti•1m ago•0 comments

L-Systems: an exploration in Swift [video]

https://vimeo.com/1155453426
1•Austin_Conlon•4m ago•0 comments

AI friend- Brought to you by your friendly neighborhood mega corporation

https://gpt3experiments.substack.com/p/your-ai-friend-brought-to-you-by
1•nutanc•5m ago•1 comments

Ask HN: Should Developers Shift from Coding to Architecture in the LLM Era?

2•danver0•16m ago•1 comments

Meta delays international launch of Ray-Ban Display due to U.S. demand surge

https://techfusiondaily.com/meta-delays-ray-ban-display-international-launch-us-demand-surge/
1•nelkazzu•19m ago•0 comments

25 Years of Wikipedia

https://wikipedia25.org/en/
1•atulatul•21m ago•1 comments

Everything Is a Ralph Loop

https://ghuntley.com/loop/
1•ghuntley•24m ago•0 comments

Little red dots as young supermassive black holes in dense ionized cocoons [pdf]

https://www.nature.com/articles/s41586-025-09900-4
2•thunderbong•31m ago•0 comments

Politics and the English Language (1946) [pdf]

https://bioinfo.uib.es/~joemiro/RecEscr/PoliticsandEngLang.pdf
2•dvrp•32m ago•0 comments

U.S. freezes visas to 75 countries

https://www.kenklippenstein.com/p/trump-freezes-visas-to-75-countries
1•0x54MUR41•35m ago•1 comments

A Data Model for Git

https://jvns.ca/blog/2026/01/08/a-data-model-for-git/
2•vismit2000•35m ago•0 comments

Why is "Am I the asshole" always popular on Reddit

1•jaskirat1216•36m ago•0 comments

The New Food-Stamp Rules Will Make Your Head Spin

https://www.theatlantic.com/health/2026/01/snap-soda-ban-food-stamps/685637/
1•JumpCrisscross•37m ago•0 comments

Dps

https://engineering.fb.com/2019/08/15/security/zoncolan/
1•JohnCorey•41m ago•1 comments

Show HN: AudiobookHub – Blinkist-style summaries and full classics

https://www.audiobookhub.net/
1•baoyashishui•41m ago•2 comments

Something Is Wrong with Russia's Children

https://www.theatlantic.com/international/2026/01/russia-children-violence-war/685635/
1•JumpCrisscross•46m ago•0 comments

China blocks Nvidia H200 AI chips that US Government cleared for export – report

https://www.theguardian.com/technology/2026/jan/17/china-blocks-nvidia-h200-ai-chips-that-us-gove...
4•sorokod•46m ago•0 comments

Fatberg the size of 4 buses likely birthed poo balls that closed Sydney beaches

https://www.theguardian.com/australia-news/2026/jan/17/fatberg-poo-balls-sydney-beaches-malabar-o...
3•ljf•53m ago•0 comments

YouTube relaxes monetization policy on videos with controversial content

https://apnews.com/article/youtube-monetization-update-policy-controversial-issues-545e27e27e26e0...
1•01-_-•55m ago•0 comments

FestiveEcho

https://github.com/StnkRB/Chrome-Extension-X-AutoComment
1•rahulbootstrap•57m ago•1 comments

ClickHouse valued at $15B as database analytics firm rides AI wave

https://www.reuters.com/technology/database-management-firm-clickhouse-valued-15-billion-amid-ai-...
1•shadow28•58m ago•0 comments

True story of the 1916 hanging of Murderous Mary, a circus elephant

https://www.themoonlitroad.com/murderous-mary-the-elephant/
2•joebig•59m ago•1 comments

Show HN Rundown transforms docs into executable workflows

https://rundown.cool/
1•tobyhede•59m ago•0 comments

Tyler Cowen's AI Campus

https://arnoldkling.substack.com/p/tyler-cowens-ai-campus
1•samuel246•1h ago•0 comments

Show HN: Local AI that knows when you're burning out

https://www.humonos.com/beta
2•jaskirat1216•1h ago•0 comments

Why Systems Fail Under Load

https://www.youtube.com/watch?v=oO6pBX8_g6o
1•paperplaneflyr•1h ago•0 comments

GPT-5.2 does not follow instructions and ignores my prompts

https://old.reddit.com/r/OpenAI/comments/1mwyz6m/gpt_5_pro_no_following_instructions_and_ignoring/
1•behnamoh•1h ago•2 comments

Ask HN: How are you preventing LLMs from hallucinating in real workflows?

1•Agent_Builder•1h ago•0 comments

Built an app that aggregates Prediction Markets with AI Context

https://saipintel.ai:443/
1•everythingalt•1h ago•1 comments