Diff Algorithms

https://flo.znkr.io/diff/

40•znkr•1h ago

Comments

yboris•51m ago

Mildly related: my favorite tool for viewing .git diffs diff2html - a CLI that with one command opens the diff in your browser

https://diff2html.xyz/ -- https://github.com/rtfpessoa/diff2html

ashu1461•44m ago

Apart from source code versioning what are the other most important real world use cases of diff algorithms ?

runningmike•37m ago

- Backup and restore

- integrity checks from security perspective

- nlp, finding same tokens in text

Etc

susam•21m ago

I encountered one about 17 years ago. It was for diffing IP packets, TCP segments, and network event payloads. At the time I worked at RSA Security on a network forensics and security analytics product, written in a mix of C and C++. In one of the projects I worked on, we needed to let users diff the packets, segments, and payloads. Back then we were very conservative about adding third-party libraries to the product. I have written more about that culture here: https://news.ycombinator.com/item?id=39951673

Long story short, due to the conservative culture, most data structures and algorithms were implemented in house. The diff algorithm for packets/segments/payloads was written in house too and I was the one to write it.

If I recall correctly, my implementation was based on a straightforward solution to the longest common subsequence problem. It ran in O(mn) time and O(min(m, n)) space, where m and n are the lengths of the two sequences. I knew there were more efficient algorithms, but this code was not performance critical. I chose to keep the implementation simple so anyone could understand it, learn it quickly and fix bugs if they arose. It served us well for the next seven years until the product was replaced with a new one.

On a related note, I sometimes miss that older style of software development where we would dive deep into a problem domain, master it, and design solutions ourselves. I am not being naively nostalgic though. I am very well aware that modern development, with its reliance on well established libraries, usually delivers much greater productivity and reliability. Still, I think the slower and more deliberate approach of building things from the ground up had a certain charm.

We Can Just Do Things (In ATProto)

FTC Sues Zillow and Redfin

The Network Effect of Intelligence

Google is blocking AI searches for Trump and dementia

Tile trackers are a stalker's dream, say Georgia Tech researchers

Nonprofits' use of flexible labor bad for outcomes, lacks long-term benefit

Show HN: InfiniteGpu, A platform enabling effortless AI compute power exchange

Television and the Public Interest (1961)

Australia offers to sell shares in critical minerals reserve to allies

Bad Productivity Frameworks

Amazon launches Vega OS, its Android replacement for Fire TV with no sideloading

Omarchy Is on the Move

Show HN: JPDB, GDB for Your Waveforms

Texas floods showed why many rural communities feel abandoned in a crisis

Zero-Based Numbering

Amazon Vega OS

Pre-Emptive Multi-Tasking on Arm Cortex-M

Feature documentary about the story of code and its builders (trailer)

Agentic Commerce Protocol

Life-Size Human Statue Found at Göbeklitepe

Meta reportedly buying RISC-V AI GPU firm Rivos

Hawala

Effect of Vitamin D2 Supplementation on 25-Hydroxyvitamin D3 Status

Spec-driven development: Using Markdown as a programming language with AI

Feds cut funding to program that shared cyber threat info with local governments

Show HN: FomoRobo – AI that reads your newsletters so you don't have to

CSS Utility Classes and "Separation of Concerns" (2017)

Our Stewardship: Where We Are, What's Changing and How We'll Engage

U.S. Army confirms Tesla Cybertruck can't be imported in Europe

Claude Sonnet 4.5 and the memory Omni-tool in Letta