frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Diagram as code tool with draggable customizations

https://github.com/RohanAdwankar/oxdraw
177•RohanAdwankar•12h ago•40 comments

Show HN: Chonky – a neural text semantic chunking goes multilingual

https://huggingface.co/mirth/chonky_mmbert_small_multilingual_1
29•hessdalenlight•21h ago•2 comments

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

https://shadcnthemer.com
109•miketromba•13h ago•34 comments

Show HN: LLM Rescuer – Fixing the billion dollar mistake in Ruby

https://github.com/barodeur/llm_rescuer
78•barodeur•1d ago•12 comments

Show HN: Zoto – low-level audio playback in Zig

https://github.com/braheezy/zoto
3•braheezy•5h ago•0 comments

Show HN: Piping in and Out of Emacs

https://github.com/agzam/mx-piper
3•iLemming•5h ago•0 comments

Show HN: Random Makers – Show HN and Product Hunt, but Faster and Not Corporate

https://makers.random.gg/
14•waynerd•15h ago•1 comments

Show HN: Status of my favorite bike share stations

https://blog.alexboden.ca/toronto-bike-share-status/
12•alexboden•13h ago•4 comments

Show HN: Dictly – Local, real‑time voice‑to‑text for macOS (sub‑100ms, no cloud)

https://dictly.app/
5•JannikJung•11h ago•2 comments

Show HN: MacOS Live Screensaver – A screensaver that plays live video streams

https://github.com/hauxir/macos-live-screensaver
61•hauxir•4d ago•40 comments

Show HN: NickelJoke – Pay a Nickel to Get a Joke Using X402 Micropayments

https://github.com/btahir/nickeljoke
2•bilater•9h ago•2 comments

Show HN: Git for LLMs – A context management interface

https://twigg.ai
98•jborland•2d ago•36 comments

Show HN: Sempress – 2× better compression for numeric data

https://sempress.net
4•jalyper•11h ago•1 comments

Show HN: LeafTok – Applied TikTok's Swipe UX to ePub/PDF Reading

https://leaftok.github.io/site/
4•iago-cavalcante•11h ago•1 comments

Show HN: Deta Surf – An open source and local-first AI notebook

https://github.com/deta/surf
134•mxek•2d ago•39 comments

Show HN: Tommy – Turn ESP32 devices into through-wall motion sensors

https://www.tommysense.com
101•mike2872•2d ago•78 comments

Show HN: A fast, privacy-first image converter that runs in browser

https://imageconverter.dev/
44•wainguo•1d ago•37 comments

Show HN: Path-security – Comprehensive path validation with 62 attack vectors

https://github.com/redasgard/path-security
2•redasgard•12h ago•0 comments

Show HN: Circalify – 10KB circular timeline library for annual planning

https://mahmoodseoud.github.io/circalify/
3•Matooize•13h ago•0 comments

Show HN: I created a small 2D game about an ant

https://github.com/aanthonymax/ant-and-apples
4•aanthonymax•14h ago•2 comments

Show HN: OpenSnowcat – A fork of Snowplow to keep open analytics alive

https://opensnowcat.io/
75•joaocorreia•2d ago•18 comments

Show HN: Nostr Web – decentralized website hosting on Nostr

https://nweb.shugur.com
101•karihass•2d ago•27 comments

Show HN: Centia.io – Open PostgreSQL/PostGIS back end for developers

https://centia.io/
4•mhoegh•22h ago•0 comments

Show HN: Sqlite3-dump - a fast SQLite to CSV and parquet

https://github.com/i64/sqlite3-dump
17•Gave4655•1d ago•3 comments

Show HN: Pyxis CodeCanvas a lightweight, client-side IDE for iPad and browsers

https://github.com/Stasshe/Pyxis-CodeCanvas
2•Stasshe•16h ago•0 comments

Show HN: I built a tech news aggregator that works the way my brain does

https://deadstack.net/recent
184•dreadsword•2d ago•97 comments

Show HN: Cuq – Formal Verification of Rust GPU Kernels

https://github.com/neelsomani/cuq
93•nsomani•3d ago•63 comments

Show HN: Gisia – A Lightweight Self-Hosted DevOps Platform

https://github.com/gisiahq/gisia
2•okoddcat•18h ago•1 comments

Show HN: Katakate – Dozens of VMs per node for safe code exec

https://github.com/Katakate/k7
122•gbxk•4d ago•53 comments

Show HN: Inspec – Specification scheduling software for interior designers

https://inspec.design
13•nick_cook•2d ago•0 comments
Open in hackernews

Show HN: Sempress – 2× better compression for numeric data

https://sempress.net
4•jalyper•11h ago

Comments

jalyper•11h ago
I built a compression system specifically for numeric-heavy tables (IoT sensors, ML features, financial data). Uses learned vector quantization per column instead of treating tables as byte streams.

Key results on 100K row datasets: - IoT Telemetry: 8.08× (Sempress) vs 3.58× (Gzip) = +125% - Sensor Physics: 5.88× vs 2.76× = +113% - ML Features: 5.46× vs 3.09× = +77% - Financial: 3.80× vs 2.51× = +51%

How it works: - Auto-detects numeric vs categorical columns - Learns K-Means codebook (k=64) per numeric column - Encodes values as nearest centroid indices - Optional residuals for precision-critical columns - Packages with msgpack + zstd

Paper: https://sempress.net/paper.pdf Code: https://github.com/jalyper/sempress-core (MIT license, ~500 LOC) Install: pip install -e .

Best for: 60%+ numeric columns, >10K rows, IoT/ML/finance Still use gzip for: Text-heavy tables, small files, real-time streaming

Independent research with AI coding assistance. All algorithmic decisions and experimental design are mine. Open to feedback and collaborators!

What would you use this for? Any datasets you'd like me to benchmark?