frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Sempress – 2× better compression for numeric data

https://sempress.net
4•jalyper•2d ago

Comments

jalyper•2d ago
I built a compression system specifically for numeric-heavy tables (IoT sensors, ML features, financial data). Uses learned vector quantization per column instead of treating tables as byte streams.

Key results on 100K row datasets: - IoT Telemetry: 8.08× (Sempress) vs 3.58× (Gzip) = +125% - Sensor Physics: 5.88× vs 2.76× = +113% - ML Features: 5.46× vs 3.09× = +77% - Financial: 3.80× vs 2.51× = +51%

How it works: - Auto-detects numeric vs categorical columns - Learns K-Means codebook (k=64) per numeric column - Encodes values as nearest centroid indices - Optional residuals for precision-critical columns - Packages with msgpack + zstd

Paper: https://sempress.net/paper.pdf Code: https://github.com/jalyper/sempress-core (MIT license, ~500 LOC) Install: pip install -e .

Best for: 60%+ numeric columns, >10K rows, IoT/ML/finance Still use gzip for: Text-heavy tables, small files, real-time streaming

Independent research with AI coding assistance. All algorithmic decisions and experimental design are mine. Open to feedback and collaborators!

What would you use this for? Any datasets you'd like me to benchmark?

HPE to build two systems for Oak Ridge National: Next-generation exascale

https://www.hpe.com/us/en/newsroom/press-release/2025/10/hpe-to-build-two-systems-for-oak-ridge-n...
1•gnufx•5m ago•0 comments

Setlist.fm – The Website Reshaping Live Music, One Set List at a Time

https://www.nytimes.com/2025/10/27/arts/music/setlist-fm-website-concerts.html
2•ChrisArchitect•6m ago•1 comments

Show HN: CoJudge – open-source, offline judge for studying LC-style problems

https://github.com/cojudge/cojudge
1•ansliy•8m ago•0 comments

Pronatalist Research in Japan Showing Local Governments Can Boost Birth Rates

https://www.governance.fyi/p/even-more-pronatalist-research-showing
2•toomuchtodo•10m ago•3 comments

Shadows in the AI Mirror: "AI Apocalypse" as Jungian projection

https://www.maryharrington.co.uk/p/shadows-in-the-ai-mirror
1•binning•11m ago•0 comments

Lighting the way for electric vehicles by using streetlamps as chargers

https://techxplore.com/news/2025-10-electric-vehicles-streetlamps-chargers.html
1•PaulHoule•12m ago•0 comments

The PSF has withdrawn $1.5M proposal to US Government grant program

https://old.reddit.com/r/Python/comments/1ohh6v2/the_psf_has_withdrawn_15_million_proposal_to_us/
1•amrrs•12m ago•1 comments

Brazil launches AI platform to prosecute authors of posts considered anti-LGBT

https://www.gp1.com.br/brasil/noticia/2025/10/21/governo-lula-lanca-plataforma-para-processar-aut...
3•delichon•13m ago•0 comments

Goonpocalypse: A path of self-destruction for Gen Z men

https://www.meghanmurphy.ca/p/goonpocalypse
1•binning•14m ago•0 comments

Turning browser automation into an AI Agent

2•venuur•16m ago•0 comments

The dark side of the global surrogacy trade

https://juliebindel.substack.com/p/the-dark-side-of-the-global-surrogacy-f2a
2•binning•18m ago•0 comments

Free CC Checker

https://shrinkme.click/chk
1•pauzemk•20m ago•0 comments

Prince Andrew Labelled a Hypocrite

https://www.abc.net.au/news/2025-10-28/prince-andrew-sarah-ferguson-hypocrites-andrew-lownie/1059...
3•asdefghyk•20m ago•1 comments

The 'blue-collar CEO' trying to fix Kodak

https://www.ft.com/content/68f5a5d4-af6f-4aa8-8ddc-c347abff4092
2•bookofjoe•24m ago•1 comments

The Terrible Technical Architecture of My First Startup

https://blog.jacobstechtavern.com/p/my-terrible-startup-architecture
1•birdculture•24m ago•0 comments

Turn WhatsApp into Your 24/7 Sales and Support Team

https://chromewebstore.google.com/detail/whatsapp-ai-assistant/elchmcabjjpledhjdagcjhnpmjpfflkk
1•dchun•24m ago•1 comments

Chinese Talent Plans

https://www.fbi.gov/investigate/counterintelligence/the-china-threat/chinese-talent-plans
2•737min•28m ago•1 comments

Auto Lender First Help Notes at Risk of Downgrade by Kroll

https://www.bloomberg.com/news/articles/2025-10-27/auto-lender-first-help-notes-at-risk-of-downgr...
1•zerosizedweasle•28m ago•0 comments

Welcome to hell; please drive carefully

https://2earth.github.io/website/20251026.html
1•2earth•30m ago•1 comments

In Times Past - The New York Times from a New Point of View

https://www.nytimes.com/2025/10/26/insider/google-cardboard.html
3•rmason•31m ago•1 comments

A SV family wanted their son's science test graded fairly. It became a battle

https://www.sfchronicle.com/bayarea/article/family-challenges-silicon-valley-school-exam-21114149...
5•apparent•31m ago•3 comments

Photobooth

https://www.youtube.com/watch?v=Sc1bdh_iiSA
3•denysvitali•32m ago•1 comments

Software Profession Resources

https://trello.com/b/1lfMkCOh/software-profession-resources
1•matthew16550•32m ago•0 comments

Synctera 2x+ net revenue YoY, adds 2 new banks

https://synctera.com/post/the-life-of-a-startup
1•thatdrew•33m ago•0 comments

Cigna Will End Drug Rebates in Many Private Health Plans

https://www.bloomberg.com/news/articles/2025-10-27/cigna-will-end-drug-rebates-in-many-private-he...
2•sizzle•34m ago•0 comments

The Apple compact unwinding format: documented and explained

https://faultlore.com/blah/compact-unwinding/
3•fanf2•35m ago•0 comments

Trap Bots on Your Server

https://maurycyz.com/projects/trap_bots/
3•c-oreills•36m ago•0 comments

It's surprising that people are surprised that Signal runs partly on AWS

https://bsky.app/profile/meredithmeredith.bsky.social/post/3m46a2fm5ac23
2•anigbrowl•37m ago•0 comments

Study finds a shift toward liberal politics after leaving religion

https://www.psypost.org/study-finds-a-shift-toward-liberal-politics-after-leaving-religion/
3•mustaphah•38m ago•0 comments

There can be more than Notion and Miro

https://github.com/toeverything/AFFiNE
3•sagacity•39m ago•1 comments