frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Searchable compression for JSON (p50≈0.18 ms; 10-min demo)

https://github.com/kodomonocch1/see_proto
3•kodomonocch1•2h ago
Hi! I built SEE (Semantic Entropy Encoding) because the “data tax” (storage/egress) and the “CPU tax” (decompress/parse) keep rising together.

Tradeoff: it’s not always smaller than Zstd, but it stays searchable while compressed and minimizes I/O. Key numbers (demo): combined≈19.5% of raw, skip≈99%, lookup p50≈0.18 ms (bloom≈0.30).

10-min reproduction (no marketing): 1) Download the Demo ZIP (Release). 2) Follow README_FIRST.md. 3) Run `python samples/quick_demo.py` → prints ratio/skip/bloom + p50/p95/p99.

ROI quick math: Savings/TB ≈ (1 − 0.195) × Price_per_GB × 1000 (e.g., $0.05/GB → ~$40/TB). NDA/VDR (private, no confidential info in public): [https://docs.google.com/forms/d/e/1FAIpQLScV2Ti592K3Za2r_WLU...]

Happy to answer technical questions (schema-aware layout, delta strategy, bloom density, skip heuristics, failure modes).

Comments

kodomonocch1•2h ago
“Why not just Zstd?” Short: Zstd-only can be smaller, but it isn’t searchable; you still pay I/O + CPU to decompress and parse JSON. SEE trades a bit of size for millisecond lookups and ~99% skipping, which often wins on TCO at scale.

“Will it hold on real data?” Short: Best on repetitive JSON/NDJSON (logs, events, telemetry). We provide a 10-minute demo so anyone can reproduce KPIs and stress it with their own patterns.

“Why not keep a separate index?” Short: Separate indexes add I/O/space and consistency overhead. SEE keeps searchability in the storage format, reducing random I/O and parse costs.

“Are the numbers cherry-picked?” Short: We publish p50/p95/p99, skip (present/absent), and bloom density. The demo script prints them all, along with raw and combined sizes.

Best of times, worst of times: record fossil-fuel profits inflation & inequality

https://www.sciencedirect.com/science/article/pii/S2214629625003020
1•mdhb•40s ago•0 comments

Arduino App Lab: Integrated Development Environment (IDE) for Arduino UNO Q

https://docs.arduino.cc/software/app-lab/
1•teleforce•2m ago•0 comments

How to Run WordPress completely from RAM

https://rickconlee.com/how-to-run-wordpress-completely-from-ram/
1•indigodaddy•2m ago•0 comments

Man accused of intentionally starting fire that destroyed Pacific Palisades

https://www.latimes.com/california/story/2025-10-08/palisades-fire-arrest
1•jaredwiener•3m ago•0 comments

Zcash Price Doubled

https://www.johndcook.com/blog/2025/10/08/zcash-price-doubled/
1•ibobev•5m ago•0 comments

I made a web tool that turns Markdown into presentation slides instantly

https://deckless.app
1•dkimster•7m ago•1 comments

The Most Important Invention Ever Is Glue [video]

https://www.youtube.com/watch?v=n1-5-O6IAWo
1•gmays•9m ago•0 comments

Enshitification with Cory Doctorow [YouTube] [video]

https://www.youtube.com/watch?v=P1EKQidRooc&list=PLet00UQnlQoUKqSB5-oFmrwpnnVc4C4A8&index=1
1•_joel•12m ago•0 comments

The Scaling Era: An Oral History of AI, 2019–2025

https://press.stripe.com/scaling
1•brandonb•12m ago•0 comments

Glue raises $20M Series A for agentic team chat

https://glue.ai/blog/20m-to-build-agentic-team-chat
7•kainosnoema•14m ago•0 comments

Hacking GTA V RP Servers Using Web Exploitation Techniques

https://nullpt.rs/hacking-gta-servers-using-web-exploitation
2•ibobev•15m ago•0 comments

Rendu: A JavaScript Hypertext Preprocessor

https://github.com/h3js/rendu
1•randomuxx•16m ago•1 comments

Show HN: Magic Vizion – highlight anything, visualize instantly with one click

https://chromewebstore.google.com/detail/columnsai/hfgfkpoildikklbmjnkedmapiopeacga
2•caoxhua•18m ago•0 comments

Show HN: KI Song Erstellen Kostenlos – AI Music Generator FüR Deutsche Musik

https://kisongerstellen.com/
2•kevinhacker•18m ago•0 comments

SoftBank to buy ABB robotics unit for $5.4B as it boosts its AI play

https://www.cnbc.com/2025/10/08/softbank-to-buy-abb-robotics-unit-for-5point4-billion-in-ai-push....
3•voxadam•20m ago•0 comments

Building What Matters in Product and Experience

https://comuniq.xyz/post?t=414
1•01-_-•21m ago•0 comments

Microsoft's Fluid Icons, Figma's ChatGPT Diagrams and Okay DEV's Creative Beta

https://uibits.co/p/microsoft-s-fluid-icons-figma-s-chatgpt-diagrams-okay-dev-s-creative-beta
3•Kristaps90•22m ago•0 comments

Women portrayed as younger than men online, and AI amplifies the bias

https://newsroom.haas.berkeley.edu/news-release/women-portrayed-as-younger-than-men-online-and-ai...
7•geox•22m ago•0 comments

Show HN: Solving the cluster 1 problem with vCluster standalone

https://www.vcluster.com/blog/vcluster-standalone-multi-tenancy-kubernetes
5•saiyampathak•25m ago•0 comments

What fully automated firms will look like

https://www.dwarkesh.com/p/ai-firm
1•rzk•25m ago•0 comments

Doctorow: American Tech Cartels Use Apps to Break the Law

https://lithub.com/how-american-tech-cartels-use-apps-to-break-the-law/
41•ohjeez•28m ago•3 comments

Show HN: I built a local-first podcast app

https://wherever.audio
3•aegrumet•29m ago•0 comments

Rebuild the World

http://www.rebuildworld.net/
1•infovi•29m ago•2 comments

Major protests against corruption in the Philippines

https://www.wsws.org/en/articles/2025/09/22/zhyf-s22.html
2•PaulHoule•30m ago•0 comments

3rd Circuit: CFAA Does Not Turn Workplace Policy Infractions into Federal Crimes [pdf]

https://www2.ca3.uscourts.gov/opinarch/241123ppan.pdf
5•ivl•31m ago•3 comments

From Zero Code to Live DApp: Why We Built an AI Launchpad for Web3 Founders

https://0xminds.com/
1•silasomen•32m ago•1 comments

What RSS is and why we should keep using it (2022)

https://harisont.github.io/l-informatico-di-famiglia/2022/03/05/rss-en.html
2•linhns•34m ago•0 comments

An Event Mikeal Would Have Liked

https://an-event-mikeal-would-have-liked.com/
1•neom•36m ago•0 comments

Show HN: Autocache – Cut Claude API costs 90% (for n8n, Flowise, etc.)

https://github.com/montevive/autocache
1•jmrobles•36m ago•1 comments

JSON River – Parse JSON incrementally as it streams in

https://github.com/rictic/jsonriver
1•rickcarlino•36m ago•0 comments