frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Structured data extraction using local quantized LLMs

https://github.com/nxank4/loclean
1•nxank4•1h ago
Hi HN,

I built this library because I wanted a way to clean messy text data and extract PII without sending sensitive information to cloud APIs or dealing with brittle regex patterns.

The tool runs quantized models locally via llama.cpp and uses GBNF grammars generated from Pydantic models. This forces the LLM to output valid JSON strictly adhering to the schema, which solves the reliability issues common with small models. It currently supports Pandas and Polars dataframes and works with any GGUF model.

It is still an early alpha, so performance on older CPUs might be a bottleneck compared to standard string manipulation, but I found it useful for semantic extraction tasks where regex fails. I would appreciate any feedback on the implementation or suggestions for optimization.

A multi-entry CFG design conundrum

https://bernsteinbear.com/blog/multiple-entry/
2•fbuilesv•4m ago•0 comments

A Massacre in Mashhad

https://www.newyorker.com/news/as-told-to/a-massacre-in-mashhad
3•Tomte•6m ago•1 comments

Disney to Pay $10M in FTC Children's Data Settlement

https://natlawreview.com/article/court-approves-order-requiring-disney-pay-10mm-settle-ftc-attorn...
2•petethomas•8m ago•0 comments

Show HN: Lumina – Open-source observability for LLM applications

https://github.com/use-lumina/Lumina
3•iggycodexs•19m ago•0 comments

Hacker turned WiFi airwaves into LED art with a Raspberry Pi

https://www.theregister.com/2026/01/23/raspberry_pi_wifi_wall_art/
3•ghendelf•23m ago•0 comments

What Is Clawdbot? (and Why People Are Losing Their Minds over It)

https://twitter.com/noahepstein_/status/2015073824799371370
3•taubek•29m ago•0 comments

Kitty Cards

https://lmno.lol/alvaro/introducing-kitty-cards
3•todsacerdoti•29m ago•0 comments

'Fundamental reset': Scott Bessent has a plan to free the nation's banks

https://www.politico.com/news/2026/01/24/scott-bessent-banks-00744468
1•scrubs•32m ago•1 comments

Show HN: Markdown Viewer with LaTeX Math Support and Export to PDF/Word/HTML

https://markdownviewer.cc
1•LuckyBuddy•35m ago•0 comments

Box64 Expands into RISC-V and LoongArch territory

https://boilingsteam.com/box64-expands-into-risc-v-and-loong-arch-territory/
1•ekianjo•36m ago•0 comments

Neveragain.tech

https://neveragain.tech/
3•m-hodges•37m ago•0 comments

Show HN: Run world-class focus groups in minutes

https://chatgpt.com/g/g-6835306f9c48819185d3a665c09cc5d2-focus-groups-like-a-pro
1•vassilbek•38m ago•0 comments

Part 1: IndiaAI mission does not need compute, it needs data

https://gpt3experiments.substack.com/p/part-1-indiaai-mission-does-not-need
1•nutanc•38m ago•0 comments

JVic – The web-based VIC 20 emulator built with libGDX

https://vic20.games/
1•duck•46m ago•0 comments

The most precise mechanical indicators ever made – The Mikrokator [video]

https://www.youtube.com/watch?v=_HIKmxHcxkg
1•pillars•53m ago•0 comments

Apt-bundle: brew bundle for apt

https://github.com/apt-bundle/apt-bundle
1•sadeshmukh•53m ago•0 comments

Show HN: Document your tRPC API without mapping to OpenAPI

1•liorcodev•53m ago•0 comments

Deaths and deportations of citizens by Trump administration

https://en.wikipedia.org/wiki/Deaths,_detentions_and_deportations_of_American_citizens_in_the_sec...
12•praptak•1h ago•0 comments

Jeffrey Way: I'm Done

https://www.youtube.com/watch?v=g_Bvo0tsD9s
3•doppp•1h ago•0 comments

Ask HN

2•mekod•1h ago•0 comments

Fast Joystick Read – Elon Musk on Usenet, 1994

https://twitter.com/UsenetGems/status/2004876626161721467
1•nomilk•1h ago•0 comments

World’s most powerful literary critic is on TikTok

https://www.newstatesman.com/culture/books/2026/01/the-worlds-most-powerful-literary-critic-is-on...
2•insistey•1h ago•0 comments

Non-Functional Requirements: The Secret Sauce Nobody Wants to Season

https://blog.hermesc.gr/non-functional-requirements-the-secret-sauce-nobody-wants-to-season/
1•puppion•1h ago•0 comments

Show HN: Voice to Text– Free browser-based speech-to-text with local projects

https://www.voicetotextonline.com/
1•digi_wares•1h ago•0 comments

Show HN: Structured data extraction using local quantized LLMs

https://github.com/nxank4/loclean
1•nxank4•1h ago•0 comments

Jurassic Park: SGI Computers (2010)

http://www.sgistuff.net/funstuff/hollywood/jpark.html
1•exvi•1h ago•0 comments

Earth's Rotation Limits IBIS Performance to 6.3 Stops

https://thecentercolumn.com/2020/01/17/earths-rotation-limits-ibis-performance-to-6-3-stops/
4•Geo_ge•1h ago•0 comments

Lawsuit Claims Meta Can See WhatsApp Chats in Breach of Privacy

https://www.bloomberg.com/news/articles/2026-01-25/lawsuit-claims-meta-can-see-whatsapp-chats-in-...
4•g-b-r•1h ago•0 comments

The Brain of the Greatest Solo Climber

https://nautil.us/the-strange-brain-of-the-worlds-greatest-solo-climber-236051/
1•blondie9x•1h ago•0 comments

'Jurassic Park' at 30: The Gift That Keeps on Giving (2023)

https://scriptmag.com/screenplays/jurassic-park-at-30-the-gift-that-keeps-on-giving
3•exvi•1h ago•0 comments