frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I made a new compression algorithm

https://github.com/BrowserBox/LZW-X
4•keepamovin•3h ago

Comments

cranberryturkey•2h ago
Middle out?
keepamovin•1h ago
Working on the Weissman score as we speak ;)
forgotpwd16•2h ago
You or Claude? Have you verified/reason the claims made in README? For starters ZIP doesn't use LZW. Initially used an LZW-derived method with reset mechanism but v2 (early 90s) introduced DEFLATE, combining dict-based LZ77 & Huffman coding (which has become the de-facto ZIP compression). And even this, superior to LZW method, is not considered state-of-the-art nowadays.
keepamovin•1h ago
You got me on the ZIP/LZW mix-up -- that was a mistake in the readme drafting. I'll fix that.

Regarding 'Me or Claude': The core concept (applying bioinformatics edit-distance/alignment to compression rather than just exact prefix matching) is something I worked on back in 2013. The implementation in this repo was heavily assisted by Claude, yes.

You're right that DEFLATE and modern algos (Zstd, Brotli) are the production standard. This project isn't trying to replace Zstd tomorrow; it's a research prototype testing the hypothesis that fuzzy matching + edit scripts can squeeze out entropy that exact-match dictionaries miss. The 8-10x slowdown means it's definitely experimental, but as a starting point for further exploration? That's what I want.

forgotpwd16•1h ago
This is better presentation than README, which currently is marketing-heavy and technically weak. Project as an experiment is acceptable and interesting but certainly isn't "next-generation" when has (assuming benchmarks are valid) <0.2% ratio improvement to an outdated algorithm, at expense (assuming description is valid) of much worse compression/decompression speed. Note such slowdown isn't implementation detail but expected by design; neighbor graph, Levenshtein distance, edit scripts, etc, kill speed. In the end compression is trade-off between ratio and speed, and methods benchmark to both rather one.

As overall note, AIs when you prompt "apply concept X in Y" (or anything really) will tell you what a great idea and then output something that without domain knowledge you've no idea if it's correct or if even makes sense at all. If don't want to do a literature research/study, recommend at least throwing the design back to the machine and asking for critique.

keepamovin•19m ago
I told the AI to make the README more enthusiastic, it turned out hyped and it threw folks off. I wasn't intending to hit any bs detector, but I see now why some would think that, and it is what it is.

Here's what actually happened: the path to get here was about as far from a 'one-shot' as you can get. The first iteration (Basic LZW + unbounded edit scripts + Huffman) was roughly 100x slower. I spent hours guiding the implementation through specific optimization attempts:

- BK-trees for lookups (eventually discarded as slow).

- Then going to Arithmetic coding. First both codes + scripts, later splitting.

- Various strategies for pruning/resetting unbounded dictionaries.

- Finally landing on a fixed dict size with a Gray-Code-style nearest neighbor search to cap the exploration.

The AI suggested some tactical fixes (like capping the Levenshtein table, splitting edits/codes in Arithemtic coding), but the architectural pivots came from me. I had to find the winning path. I stopped when the speed hit 'sit-there-and-watch-it-able' (approx 15s for 2MB) and the ratio consistently beat LZW (interestingly, for smaller dics, which makes sense, as the edit scripts make each word more expressive).

That was my bar: Is it real? Does it work? Can it beat LZW? Once it did, I shared it. I was focused on the bench accuracy, not the marketing copy. I let the AI write the hype readme - I didn't really think it mattered.

keepamovin•2h ago
LZW is the algorithm used in compress and also in GIF. It is a beautifully elegant and simple algorithm (based on learning a dictionary of words, and encoding the source as their indices) that converges in the limit on the Shannon entropy of the source.

In 2013, I was studying bioinformatics and had an idea to apply something like sequence alignment and edit scripts to compression instead of just, as LZW, addition at the end of the string. So, the idea for LZW-X was born long ago, but it wasn't until recently, by the power of AI, that I could implement and test it properly.

This is that proper implementation and it reveals what I intuited: that there are gains to be had using a method like this. I consider this a first rung, a starting point for further exploration.

Check it out: https://github.com/BrowserBox/LZW-X

Show HN: ShapedQL – A SQL engine for multi-stage ranking and RAG

https://playground.shaped.ai
40•tullie•2d ago•18 comments

Show HN: GLinksWWW – A lightweight browser with 9 independent clipboards

2•RioBurhan•1h ago•1 comments

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
201•jmuncor•21h ago•107 comments

Show HN: An Open Source Alternative to Vercel/Render/Netlify

https://www.shorlabs.com/
17•thirdavenue•4h ago•1 comments

Show HN: I made a new compression algorithm

https://github.com/BrowserBox/LZW-X
4•keepamovin•3h ago•7 comments

Show HN: Externalized Properties, a modern Java configuration library

https://github.com/joel-jeremy/externalized-properties
9•jeyjeyemem•2d ago•4 comments

Show HN: Shelvy Books

https://shelvybooks.com
41•tekkie00•14h ago•16 comments

Show HN: Dwm.tmux – a dwm-inspired window manager for tmux

https://github.com/saysjonathan/dwm.tmux
96•saysjonathan•5d ago•17 comments

Show HN: SHDL – A minimal hardware description language built from logic gates

https://github.com/rafa-rrayes/SHDL
41•rafa_rrayes•1d ago•20 comments

Show HN: LinuxWhisper – A native AI voice assistant for Linux (Groq/GTK)

https://github.com/Dianjeol/LinuxWhisper
3•LinuxWhisper•4h ago•0 comments

Show HN: Pinecone Explorer – Desktop GUI for the Pinecone vector database

https://www.pinecone-explorer.com
19•arsentjev•1d ago•3 comments

Show HN: The HN Arcade

https://andrewgy8.github.io/hnarcade/
328•yuppiepuppie•1d ago•109 comments

Show HN: Build Web Automations via Demonstration

https://www.notte.cc/launch-week-i/demonstrate-mode
30•ogandreakiro•2d ago•20 comments

Show HN: I built a small browser engine from scratch in C++

https://github.com/beginner-jhj/mini_browser
139•crediblejhj•1d ago•44 comments

Show HN: Cursor for Userscripts

https://github.com/chebykinn/browser-code
54•mifydev•20h ago•15 comments

Show HN: LemonSlice – Upgrade your voice agents to real-time video

120•lcolucci•1d ago•125 comments

Show HN: Cua-Bench – a benchmark for AI agents in GUI environments

https://github.com/trycua/cua
36•someguy101010•2d ago•8 comments

Show HN: ZK Visualizer – I built this while learning zero-knowledge proofs

https://www.zkvisualizer.com/
4•zkenthusiast•7h ago•0 comments

Show HN: One Human + One Agent = One Browser From Scratch in 20K LOC

https://emsh.cat/one-human-one-agent-one-browser/
313•embedding-shape•2d ago•147 comments

Show HN: We Built the 1. EU-Sovereignty Audit for Websites

https://lightwaves.io/en/eu-audit/
103•cmkr•2d ago•87 comments

Show HN: Extracting React apps from Figma Make's undocumented binary format

https://albertsikkema.com/ai/development/tools/reverse-engineering/2026/01/23/reverse-engineering...
55•albertsikkema•6d ago•24 comments

Show HN: Config manager for Claude Code (and others) – rules, MCPs, permissions

https://github.com/regression-io/coder-config
18•jtr101•1d ago•0 comments

Show HN: TheBaduk – A Go/Baduk Platform Built with Vanilla JavaScript

https://thebaduk.com
3•bugon•8h ago•0 comments

Show HN: Fuzzy Studio – Apply live effects to videos/camera

https://fuzzy.ulyssepence.com/
53•ulyssepence•2d ago•20 comments

Show HN: I wrapped the Zorks with an LLM

https://infocom.tambo.co/
109•alecf•1d ago•58 comments

Show HN: Codex.nvim – Codex inside Neovim (no API key required)

https://github.com/ishiooon/codex.nvim
3•ishiooon•9h ago•0 comments

Show HN: Sandbox Agent SDK – unified API for automating coding agents

https://github.com/rivet-dev/sandbox-agent
20•NathanFlurry•1d ago•0 comments

Show HN: Lendy – Keep track of books you have lended

https://lendy.viraat.dev/
8•viraatdas•1d ago•6 comments

Show HN: WordRE, Wordle for Real Estate

https://reidsherman.com/wordre/
7•reidjs•1d ago•0 comments

Show HN: Record and share your coding sessions with CodeMic

https://codemic.io/#
13•seansh•1d ago•2 comments