news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies

1•tejonutella•48s ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro

1•qzcanoe•5m ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot

1•g1raffe•7m ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496

1•vinhnx•13m ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/

1•rolph•17m ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter

1•Lwrless•19m ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE

1•vermilingua•24m ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/

1•telui•25m ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k

1•cedel2k1•28m ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349

22•chwtutha•28m ago•2 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/

2•osnium123•29m ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/

2•jeremy_su•31m ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/

1•fx31xo•33m ago•0 comments

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•39m ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/

1•melvinodsa•41m ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md

1•prabhatkr•52m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis

2•thread_id•53m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504

1•geeknews•54m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/

3•cwwc•57m ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html

1•paladin314159•57m ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles

1•omosubi•59m ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/

1•BeinerChes•59m ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/

2•ark296•1h ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/

2•medbar•1h ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html

1•alance•1h ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/

2•akagusu•1h ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems

1•cl42•1h ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/

1•MBCook•1h ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/

1•aej11•1h ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html

2•lxm•1h ago•0 comments

Open in hackernews

Show HN: Prompt Refiner – Lightweight Python lib to clean and compress LLM input

https://github.com/JacobHuang91/prompt-refiner

2•xinghaohuang•2mo ago

Comments

xinghaohuang•2mo ago

OP here!

I built this because I was getting tired of my RAG contexts getting clogged with HTML tags, invisible characters, and excessive whitespace from web scrapers.

It's a zero-dependency (well, almost) library designed to sit between your data source and your prompt construction.

Key features: * Standardizes whitespace and strips HTML. * Smart truncation (middle-out) to fit context windows. * PII redaction for privacy. * A "TokenPacker" to manage budget across multiple inputs (coming soon).

Fun fact: I just renamed it from "prompt-groomer" to "prompt-refiner" yesterday based on Reddit feedback (long story, lesson learned about naming!).

Benchmarks show it adds <3ms latency for typical 10k token contexts, which is negligible compared to the 20%+ token savings.

Happy to answer any questions!