frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Beaver: An Efficient Deterministic LLM Verifier

https://arxiv.org/abs/2512.05439
1•tshanmu•7h ago

Comments

tshanmu•7h ago
As large language models (LLMs) transition from research prototypes to production systems, practitioners often need reliable methods to verify that model outputs satisfy required constraints. While sampling-based estimates provide an intuition of model behavior, they offer no sound guarantees. We present BEAVER, the first practical framework for computing deterministic, sound probability bounds on LLM constraint satisfaction. Given any prefix-closed semantic constraint, BEAVER systematically explores the generation space using novel token trie and frontier data structures, maintaining provably sound bounds at every iteration. We formalize the verification problem, prove soundness of our approach, and evaluate BEAVER on correctness verification, privacy verification and secure code generation tasks across multiple state of the art LLMs. BEAVER achieves 6 to 8 times tighter probability bounds and identifies 3 to 4 times more high risk instances compared to baseline methods under identical computational budgets, enabling precise characterization and risk assessment that loose bounds or empirical evaluation cannot provide.

Space Data Center SIM

https://astrocompute.dev/
1•printerlover•1m ago•0 comments

Learning a new programming language with an LLM

https://feeding.cloud.geek.nz/posts/learning-new-programming-language-with-ai/
1•edward•2m ago•0 comments

Role of anthropogenic climate change in wildfire smoke concentrations in the US

https://www.pnas.org/doi/10.1073/pnas.2421903122
1•bikenaga•3m ago•0 comments

Microplastic exposure is associated with epigenomic effects in model organism

https://pubmed.ncbi.nlm.nih.gov/38742563/
2•donsupreme•4m ago•0 comments

Dafny: Verification-Aware Programming Language

https://dafny.org/
1•handfuloflight•5m ago•0 comments

Efficient Dockerfile templating for complex build scenarios

https://gagor.pro/2025/01/efficient-dockerfile-templating-for-complex-build-scenarios/
1•___timor___•7m ago•0 comments

I Ported JustHTML from Python to JavaScript with Codex CLI and GPT-5.2 in 4.5h

https://simonwillison.net/2025/Dec/15/porting-justhtml/
2•pbowyer•7m ago•0 comments

Google Fi Web Calls

https://fi.google.com/webcalls/calls
1•pcvetkovski•8m ago•0 comments

Launching ChinaRxiv, an automated translation pipeline of all Chinese preprints

https://twitter.com/seconds_0/status/2000606845644505093
1•Anon84•15m ago•0 comments

The "Commons Clause" License Condition

https://commonsclause.com/
1•Kerrick•23m ago•0 comments

Show HN: BoardSpace – AI that draws on a whiteboard in realtime for Calculus

https://www.useboardspace.com/
1•jonnotdoe•23m ago•1 comments

Texas sues biggest TV makers, alleging smart TVs spy on users without consent

https://arstechnica.com/tech-policy/2025/12/texas-sues-biggest-tv-makers-alleging-smart-tvs-spy-o...
9•c420•25m ago•7 comments

The Disappointing Truth About Wi-Fi 7: Multi-Link Operation Isn't Here Yet

https://www.rtings.com/router/learn/research/wifi-7-mlo
1•dokeeffe•25m ago•1 comments

Using Cursor's Bugbot to Spot Issues Early in Pull Requests

https://medium.com/@ali-dev/using-cursor-bugbot-to-spot-issues-early-0cdc142fbaff
1•stringtoint•26m ago•0 comments

The Writer Who Dared Criticize Silicon Valley

https://www.nytimes.com/2025/11/27/technology/writer-silicon-valley-criticism.html
3•petethomas•30m ago•0 comments

Show HN: Calm Companies – Businesses where less is more

https://calmcompanies.club
3•RaulOnRails•30m ago•1 comments

Glycemic index, glycemic load, and risk of dementia

https://academic.oup.com/ije/article-abstract/54/6/dyaf182/8313011?redirectedFrom=fulltext
1•bikenaga•32m ago•1 comments

What the Soviets Found on Venus

https://vinyasi.substack.com/p/what-the-soviets-found-on-venus
2•vinyasi•32m ago•0 comments

Write a Simple Code Agent using moonbitlang/async

https://www.moonbitlang.com/blog/moonbit-async-code-agent
1•necrodome•33m ago•0 comments

Read and Learn: open-source language learning app

https://readandlearn.app/
1•waveywaves•36m ago•1 comments

Breach at South Korea's Equivalent of Amazon Exposed Data of Almost Every Adult

https://www.wsj.com/world/asia/breach-at-south-koreas-equivalent-of-amazon-exposed-data-of-almost...
5•bookofjoe•37m ago•1 comments

Nicholas Deak

https://en.wikipedia.org/wiki/Nicholas_Deak
1•petethomas•37m ago•0 comments

Show HN: The Mirsky Ratio–Measuring R&D vs. SG&A as a predictor of S&P 100

https://substack.com/inbox/post/181826707
2•TheMirskyLimit•38m ago•1 comments

Who has enjoyed using PR code reviewers? What worked and what didn’t?

2•yashwantphogat•38m ago•1 comments

UK to rejoin EU's Erasmus student exchange programme

https://www.theguardian.com/world/2025/dec/16/uk-to-rejoin-eu-erasmus-student-exchange-programme
5•sandbach•39m ago•0 comments

Wall Street banks prepare for round-the-clock stock trading, reluctantly

https://www.reuters.com/business/finance/wall-street-banks-prepare-round-the-clock-stock-trading-...
3•gardncl•39m ago•0 comments

Director of MIT's Plasma and Fusion Center, Dies at 47

https://news.mit.edu/2025/nuno-loureiro-professor-director-plasma-science-and-fusion-center-dies-...
3•jacobedawson•42m ago•1 comments

Manifesto for AI Software Development: Code Is Cattle, Not Pets

https://metamagic.substack.com/p/manifesto-for-ai-software-development
1•r0ze-at-hn•44m ago•1 comments

Adding type-safe structs to Lua

https://if-not-nil.github.io/lua-structs/
1•qwool•44m ago•0 comments

Classify website content using text and screenshot

https://github.com/themains/piedomains
1•neehao•46m ago•0 comments