frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: A tiny reasoning layer that steadies LLM outputs (MIT; +22.4% accuracy)

https://github.com/onestardao/WFGY
5•TXTOS•2h ago
We kept shipping “simple” LLM features that were fluent-but-wrong. After too many postmortems we wrote down the failure patterns and added a small reasoning layer in front of the model. It’s model-agnostic, sits beside your existing stack, and you can implement it from a single PDF (MIT).

What’s inside the PDF

A problem map of 16 failure modes we kept hitting in real systems (OCR/layout drift, table-to-question mismatches, embedding≠meaning, pre-deploy collapse, etc.).

Four lightweight gates you can add today:

Knowledge-boundary canaries (empty/adversarial/known-fact probes).

ΔS “semantic jump” check to catch fluent nonsense when the draft answer drifts from retrieved context.

Layout-aware anchoring so chunking across PDFs/tables doesn’t silently break routing.

A minimal semantic trace for incident review (tiny, not full transcripts).

Bench snapshot (same model, with vs. without gates): Semantic Accuracy ↑ 22.4% · Reasoning Success Rate ↑ 42.1% · Stability ↑ 3.6×.

Traction (last ~50 days)

~2,400 downloads of the PDF.

~300 cold GitHub stars on related material (no marketing burst).

Also received a star from the creator of tesseract.js, which was nice validation from the OCR world.

Why this might be useful to you

You don’t need to swap models or vendors. The PDF describes checks you can drop into any RAG/agent/service pipeline.

No servers, SDKs, or proxy layers—just logic you can copy.

Link is Git Repo

Happy to answer HN-style questions (what breaks, where it fails, ablations, how we compute ΔS, etc.). If you try it and it doesn’t help, I’m also interested in the counter-examples.

with Terrseract (OCR legend) starred it verify it, we are WFFY on top1 https://github.com/bijection?tab=stars

Using Git Worktrees for Development

https://blog.kulman.sk/git-worktree/
1•ig0r0•1m ago•0 comments

Diagrammatic algebra: On the road to category theory

https://chalkdustmagazine.com/features/diagrammatic-algebra-on-the-road-to-category-theory/
1•adamnemecek•1m ago•0 comments

True Unidirectional WiFi broadcasting of video data for FPV Drones

https://befinitiv.wordpress.com/2015/01/25/true-unidirectional-wifi-broadcasting-of-video-data-for-fpv/
1•tak2hu•2m ago•0 comments

Red-teaming a RAG app: What happens?

http://blog.pamelafox.org/2025/08/red-teaming-rag-app-what-happens.html
1•pamelafox•2m ago•0 comments

A modest proposal for new holidays to manage your digital life

https://daverupert.com/2025/08/digital-holidays/
1•tobr•2m ago•0 comments

Show HN: Host local-only MCP tools in the cloud with Streamable HTTP

1•frafdez•3m ago•0 comments

Ask HN: How to build a 2D wave-like line graph that responds to keyboard events?

1•absoluteunit1•6m ago•0 comments

Tandy Corporation, Part 4 – By Bradford Morgan White

https://www.abortretry.fail/p/tandy-corporation-part-4
1•rbanffy•6m ago•0 comments

I Asked Four Former Friends Why We Stopped Speaking-Here's What I Learned (2023)

https://www.vogue.com/article/reconnecting-with-ex-friends
2•mooreds•7m ago•0 comments

Qwen-Image – a 20B MMDiT model for next-gen text-to-image generation

https://twitter.com/Alibaba_Qwen/status/1952398250121756992
1•tosh•8m ago•0 comments

Show HN: Modos Developer Kit Live on Crowd Supply

https://www.crowdsupply.com/modos-tech/modos-paper-monitor
1•alex-a-soto•9m ago•0 comments

OpenAI Transparency Letter

https://www.openai-transparency.org/
2•fzliu•10m ago•0 comments

Castro Podcasts – iPad and Device Sync

https://castro.fm/blog/device-sync-and-ipad
1•dabluck•11m ago•0 comments

Evaluation Algorithms for Parametric Curves and Surfaces

https://www.mdpi.com/2227-7390/13/14/2248
1•PaulHoule•14m ago•0 comments

Squashing my dumb bugs and why I log build IDs

https://rachelbythebay.com/w/2025/08/03/scope/
1•zdw•15m ago•0 comments

LLMs Aren't Just for Sissies

https://mattsayar.com/llms-arent-just-for-sissies/
1•MattSayar•16m ago•0 comments

Staan : European Search Index and API

https://staan.ai
1•maelito•17m ago•0 comments

Robin Berjon: Web Standards

https://protocol.ecologies.info/interviews/berjon-web_standards/
1•ntnsndr•17m ago•0 comments

JavaOne 2026 Dates Announced

https://inside.java/2025/08/04/javaone-returns-2026/
1•Sharat_Chander•19m ago•1 comments

A proof is that which is convincing

https://substack.com/inbox/post/170099481
1•mathattack•19m ago•0 comments

Updated Portal Map Editor in Battlefield 6 Runs on Godot Engine

https://80.lv/articles/updated-portal-map-editor-in-battlefield-6-runs-on-godot-engine
1•pjmlp•19m ago•0 comments

AI Embiggens the Big Clouds, Especially Microsoft

https://www.nextplatform.com/2025/08/01/ai-embiggens-the-big-clouds-especially-microsoft/
1•rbanffy•21m ago•0 comments

Firefox Has a New Home

https://windowsreport.com/firefox-has-a-new-home-mozilla-launches-dedicated-firefox-com-download-hub/
2•gwerbret•21m ago•0 comments

Leading phone repair and insurance firm collapses after paying ransomware demand

https://www.tomshardware.com/tech-industry/cyber-security/leading-phone-repair-and-insurance-firm-collapses-after-paying-crippling-ransomware-demand-cutting-100-employees-to-just-eight-wasnt-enough
2•speckx•21m ago•0 comments

What We're Optimizing ChatGPT For

https://openai.com/index/how-we%27re-optimizing-chatgpt
3•meetpateltech•24m ago•1 comments

Zed Shaw's Utu: Saving the internet with hate · weblog.masukomi.org

https://weblog.masukomi.org/2018/03/25/zed-shaws-utu-saving-the-internet-with-hate/
2•janandonly•28m ago•0 comments

You Should Probably Leave Substack

https://leavesubstack.com/
2•aintitthetruitt•28m ago•0 comments

Musk says he's bringing back Vine's archive

https://techcrunch.com/2025/08/04/elon-musk-says-hes-bringing-back-vines-archive/
2•thm•30m ago•1 comments

Tiger Mask Donation Phenomenon

https://en.wikipedia.org/wiki/Tiger_Mask_donation_phenomenon
2•thunderbong•31m ago•0 comments

Lyft Partners with Baidu to Deploy Autonomous Rides Across Europe

https://www.lyft.com/blog/posts/lyft-partners-with-baidu-to-deploy-autonomous-rides-across-europe
4•thm•33m ago•0 comments