frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Changeflow – Giving up on pixel diffs after 10 years of false positives

https://changeflow.com/
1•stevewillbe•1h ago
I've been building website monitoring tools since 2015. The core problem with pixel-diff screenshots: every ad rotation, every layout tweak = alert noise. Legal and compliance teams kept asking "just tell me WHAT changed."

So I rebuilt it. Changeflow extracts semantic changes and summarizes them in plain English:

- "FDA posted new adaptive trial guidance (Jan 15)" - "Competitor raised enterprise pricing 12%" - "9th Circuit issued opinion on arbitration agreements"

Instead of "47 pixels changed in the header region."

THE HARD TECHNICAL PROBLEMS

Scraping any URL (not just specific sites)

Unlike scrapers built for Amazon or LinkedIn, users give us any URL and expect it to work. Our approach:

Delayed-attach pattern: launch Chrome, let page load naturally, poll /json endpoint for title+URL stability, only THEN attach Puppeteer. Bot detection scripts run against a clean browser.

Three-tier fallback: Linux + datacenter proxy (90% of sites) -> Linux + mobile proxy (9%) -> macOS + real hardware (1%). Cache successful routes per-URL. Expensive path rarely fires.

Real Chrome, not Chrome for Testing (fingerprint detectable). On real Mac hardware, disable GPU spoofing entirely - genuine beats fake.

LLM costs at scale

Running AI on every fetch gets expensive. We cut costs 90%:

Strip nav/sidebars/footers before AI call (~60% token reduction). Model tiering: Llama 3.1 8B via Groq for extraction, Gemini Flash Lite for summaries, Claude only when quality matters.

Gemini cache trick: 1024+ token system prompts get 90% discount on repeat calls. Verbose prompts are actually cheaper.

Diffing beyond git diff

Git diff isn't enough. We add MD5 hashes to list items for move detection, use Levenshtein distance to distinguish edits from replacements, and clean temporal noise ("2 days ago") that creates false positives.

STACK

Rails + Postgres, Faktory workers, Node.js browser pool, Claude/Gemini/Llama via OpenRouter, Proxies from GridPanel and SquidProxies.

Happy to answer questions about the scraping, AI, or 10 years of lessons in this space.

Show HN: Webhook testing with instant URLs and real-time viewer

https://tools.pinusx.com/webhooks-home
1•dbhariprakash•1m ago•0 comments

China's Four-Year Energy Spree Has Eclipsed US Power Grid

https://www.bloomberg.com/news/articles/2026-01-28/china-s-four-year-energy-spree-has-eclipsed-en...
2•virgildotcodes•3m ago•0 comments

I've reported on UFO sightings for decades – and come to this conclusion

https://www.washingtonpost.com/opinions/interactive/2026/ufo-upa-sightings/
1•nabla9•4m ago•0 comments

Soil-based method can stop locust swarms from destroying crops

https://phys.org/news/2026-01-soil-based-method-locust-swarms.html
1•PaulHoule•4m ago•0 comments

Opinionated GitHub Action for generating high-quality SBOMs

https://github.com/sbomify/github-action
1•todsacerdoti•4m ago•0 comments

Catching the Next Telnetd-Class Security Bug

https://vartia.ai/posts/telnetd_cve/
1•briandw•6m ago•0 comments

Tmux for Claude Code but accessible from web browser and mobile

https://github.com/kirikov/teleclaude
1•Datkiri•6m ago•0 comments

I Hope You Get to Live Your Life as a Human Being

https://transgamerthoughts.com/post/802327706229456896/i-hope-you-get-to-live-your-entire-life-as...
1•hn_acker•8m ago•0 comments

We're All Beginners Again

https://matthewrocklin.com/ai-beginners/
1•vinhnx•8m ago•0 comments

Modern Pandas (2016)

https://tomaugspurger.net/posts/modern-1-intro/
1•tosh•9m ago•0 comments

The Pacific Northwest Tree Octopus

https://zapatopi.net/treeoctopus/
2•jeffjeffbear•9m ago•0 comments

US trade deficit widens by the most in nearly 34 years in November

https://finance.yahoo.com/news/us-trade-deficit-widens-most-144236696.html
3•thomassmith65•9m ago•2 comments

First Impressions of Readeck

https://www.autodidacts.io/readeck-open-source-read-it-later-app-with-kobo-support/
1•Curiositry•10m ago•0 comments

Data on Neocloud Adoption

https://www.hostingadvice.com/studies/neocloud-adoption/
1•ljh501•11m ago•0 comments

Advancing regulatory variant effect prediction with AlphaGenome

https://www.nature.com/articles/s41586-025-10014-0
1•mellosouls•12m ago•0 comments

US Congress asks Ford for more info on Chinese military battery partnership

https://chinaselectcommittee.house.gov/media/press-releases/moolenaar-questions-ford-about-its-ch...
1•737min•15m ago•1 comments

Create App store and Google Play store screenshots with AppLaunchpad

https://theapplaunchpad.com/
1•applaunchpad•15m ago•0 comments

We may get a trial on whether Elon Musk defrauded Twitter investors

https://bsky.app/profile/annmlipton.bsky.social/post/3mdkowyv7tk2p
4•doener•16m ago•0 comments

The 80% Problem in Agentic Coding

https://addyo.substack.com/p/the-80-problem-in-agentic-coding
1•vinhnx•17m ago•0 comments

New Game Plus

https://mar.coconauts.net/blog/posts/2025-01-29-new-game-plus/
1•marbartolome•17m ago•0 comments

Everyone's okay with their AI, just not yours

https://idiallo.com/blog/ai-is-ok-just-not-yours
1•Brajeshwar•17m ago•0 comments

Recreating the Smells of History

https://knowablemagazine.org/content/article/society/2026/recreating-the-smells-of-the-past
1•Brajeshwar•17m ago•0 comments

Ancient humans were seafaring far earlier than we realised

https://www.newscientist.com/article/2511681-ancient-humans-were-seafaring-far-earlier-than-we-re...
1•Brajeshwar•17m ago•0 comments

ClickBench

https://benchmark.clickhouse.com/
1•tosh•19m ago•0 comments

Don't invert established UX mental models

https://thoughts.wyounas.com/p/dont-invert-established-ux-mental
2•simplegeek•19m ago•0 comments

The Fancy Payment Cards of Taiwan

https://hackaday.com/2026/01/28/the-fancy-payment-cards-of-taiwan/
1•lxm•20m ago•0 comments

Anthropic Is at War with Itself

https://www.theatlantic.com/technology/2026/01/anthropic-is-at-war-with-itself/684892/
1•kerim-ca•23m ago•1 comments

Are Google navigation services getting worse?

https://ilearnt.com/blog/googleworse/
1•speckx•24m ago•1 comments

Something that I used to love

https://andreapivetta.com/posts/something-that-i-used-to-love.html
1•ziggy42•24m ago•0 comments

KiteSQL: Rust-native embedded SQL with TPC-C benchmarks and WASM support

https://github.com/KipData/KiteSQL
1•Jacques2Marais•25m ago•0 comments