frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN:Novel Compression Algorithm Based on Pattern Similarity Unlike the Other

1•Forgret•3h ago
I've developed a fundamentally different compression algorithm called FSP (Find Similar Patterns) that breaks from traditional dictionary and statistical compression methods. While most compression algorithms struggle with small files due to inherent overhead, FSP takes a completely different approach that makes it uniquely effective for specific use cases.

How FSP Differs from Traditional Compression:

Unlike LZ77/LZ78 (used in ZIP), LZMA, Huffman coding, or other common algorithms, FSP doesn't rely on:

· Sliding window dictionary approaches · Frequency statistics or entropy coding · Block-based compression with fixed headers · Move-to-front transformations or Burrows-Wheeler transforms

Instead, FSP uses a pattern similarity approach that:

1. Identifies similar patterns across data chunks 2. Stores only differential changes between patterns 3. Uses position-based editing rather than token replacement 4. Has virtually no overhead for highly similar data

The Unique Advantages:

What makes FSP novel isn't just that it handles small files well – it's that it represents a different philosophical approach to compression:

1. No Dictionary Overhead: Unlike LZ variants that must store dictionaries, FSP only stores differences 2. Position-Aware Editing: Rather than replacing tokens, FSP uses exact positional editing 3. Similarity-Based: Excels where data has structural similarity rather than just byte-level repetition 4. Stream-Compatible: Processes data in logical chunks rather than fixed blocks

Technical Differentiators:

· Unlike LZ77: No sliding window or distance-length encoding · Unlike Huffman/Arithmetic: No frequency tables or probability models · Unlike RLE: Handles non-sequential patterns and similarities · Unlike BWT: Doesn't require full data transformation and rearrangement

Where FSP Excels:

The algorithm particularly shines in:

· Versioned data: Where consecutive versions have minor changes · Structured records: Database entries with similar schema but different values · Sensor data: Regular readings with small fluctuations · Log files: Similar log entries with varying parameters · Genomic data: Sequences with localized variations

Performance Characteristics:

In testing, FSP achieves what traditional compressors cannot: consistent compression of very small data chunks without the overhead that typically plagues small-file compression. Where ZIP might add 50+ bytes of overhead, FSP adds virtually none for similar patterns.

Open Questions for Discussion:

I'm particularly interested in the HN community's thoughts on:

1. Theoretical classification of this approach within compression taxonomy 2. Potential hybrid approaches combining FSP with traditional methods 3. Mathematical analysis of the similarity detection problem 4. Applications in distributed systems where small payload compression matters 5. Comparisons to other non-traditional compression approaches

The algorithm is open-source under LGPL 3.0, and I welcome both theoretical and practical contributions. Sometimes innovation comes not from better implementations of existing approaches, but from fundamentally different ways of thinking about problems.

GitHub: https://github.com/Ferki-git-creator/fsp Website(more info): https://ferki-git-creator.github.io/fsp/

Nicholas (Nick) J. Fuentes

https://x.com/NickJFuentes
1•barrister•3m ago•0 comments

Every Commodore Amiga Model Ever Made [video]

https://www.youtube.com/watch?v=JUwpkKVw0Xk
1•rbanffy•4m ago•0 comments

Training to Improve Memory

https://ethz.ch/en/news-and-events/eth-news/news/2025/09/press-release-training-to-improve-memory...
1•geox•7m ago•0 comments

David Baltimore, Nobel-Winning Molecular Biologist, Dies at 87

https://www.nytimes.com/2025/09/07/science/david-baltimore-dead.html
1•mitchbob•9m ago•1 comments

Pre-owned software trial kicks off in UK as Microsoft pushes resale ban

https://www.theregister.com/2025/09/08/microsoft_valuelicensing_latest/
1•beardyw•9m ago•0 comments

Lolgato: Advanced controls for Elgato lights on macOS

https://github.com/raine/Lolgato
1•rane•11m ago•0 comments

Show HN: Search the IndieWeb, one query at a time

https://search.indieblog.page/search
1•splitbrain•12m ago•1 comments

Don't Build an RL Environment Startup

https://benanderson.work/blog/dont-build-rl-env-startup/
1•jxmorris12•12m ago•0 comments

MacBook lid angle sensor sound effects

https://github.com/samhenrigold/LidAngleSensor
1•fanf2•14m ago•0 comments

Show HN: AIHint – Open standard for verifiable website trust metadata

https://github.com/Ai-Hint/aihint-standard
1•aihint•16m ago•1 comments

Show HN: The Daily Word Game Experience

https://wafflegames.net/
1•yangyiming•17m ago•0 comments

TS framework introspectable by AI via GraphQL

https://runner.bluelibs.com/
1•theodordiaconu•19m ago•0 comments

Beyond package management: How Nix refactored my digital life

https://www.jimmyff.co.uk/blog/beyond-package-management-how-nix-refactored-my-digital-life/
1•jimmyff•19m ago•1 comments

Undersea cables cut in Red Sea, disrupting internet access in Asia and Mideast

https://apnews.com/article/red-sea-undersea-cables-cut-internet-disruption-yemen-b79fe7b9764647ac...
1•cobbzilla•21m ago•0 comments

ButterBarTheGr8's Aug 15, 2025 comment in "Unsuitable SSD/NVMe hardware for ZFS"

https://github.com/openzfs/zfs/discussions/14793
1•sipofwater•22m ago•4 comments

Will AI Choke Off the Supply of Knowledge?

https://www.wsj.com/tech/ai/will-ai-choke-off-the-supply-of-knowledge-8a71cbcd
2•throw0101a•26m ago•1 comments

Source Cooperative

https://source.coop/
1•marklit•27m ago•0 comments

Ask HN: What program is running on this 1996 laptop?

1•fcpguru•29m ago•0 comments

Tor VPN Beta (Android)

https://play.google.com/store/apps/details?id=org.torproject.vpn&hl=en_US
2•HelloUsername•30m ago•0 comments

14 Killed in protests in Nepal over social media ban

https://www.tribuneindia.com/news/world/massive-protests-in-nepal-over-social-media-ban/
45•whatsupdog•31m ago•11 comments

Ask HN: Would Windows users want a native multi-model AI client?

1•120-dev•31m ago•0 comments

The Dropshipping Problem: Youth Digital Marketing Gone Wrong

2•haebom•35m ago•0 comments

Trillion Dollar Elephants

https://dbushell.com/2025/09/08/trillion-dollar-elephants/
4•martinhath•36m ago•0 comments

Show HN: Silksong Map Online

https://silksongmap.co
2•restorephotoiip•40m ago•0 comments

Quantum router could speed up quantum computers

https://www.newscientist.com/article/2495375-quantum-router-could-speed-up-quantum-computers/
1•jonbaer•40m ago•0 comments

Alloyed agents: combining LLMs to improve AI code generation

https://www.enginelabs.ai/blog/alloyed-agents-combining-llms-to-improve-ai-code-generation
1•sdspurrier•41m ago•0 comments

OntoMotoOS: An "Operating System" Between Delusion and Scholarship

https://medium.com/@nettalk83/ontomotoos-on-the-borderline-between-delusion-and-scholarship-eb299...
1•nettalk83•42m ago•1 comments

Using Cursor Commands to Onboard a New Developer to a Repository

https://shlep.ai/using-cursor-commands-to-onboard-a-new-developer-to-a-repository/
1•royosherove•43m ago•0 comments

Show HN: I made a tool to turn anxiety-inducing news into short narrated videos

https://explainerslop.xyz/
1•simpnoza•46m ago•0 comments

Signal to start offering 100GB cloud storage

https://community.signalusers.org/t/beta-feedback-for-the-upcoming-android-7-56-release/71434
1•xz18r•55m ago•0 comments