frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•11m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
3•o8vm•13m ago•0 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•14m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•27m ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•30m ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•33m ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•40m ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•42m ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•43m ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•44m ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
1•basilikum•46m ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•47m ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•52m ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
3•throwaw12•53m ago•1 comments

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•53m ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•54m ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•56m ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•59m ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
2•andreabat•1h ago•1 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
2•mgh2•1h ago•0 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•1h ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•1h ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•1h ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
2•lifeisstillgood•1h ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•1h ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•1h ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
2•birdculture•1h ago•1 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•1h ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
2•ramenbytes•1h ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•1h ago•0 comments
Open in hackernews

Show HN: Changeflow – Giving up on pixel diffs after 10 years of false positives

https://changeflow.com/
1•stevewillbe•1w ago
I've been building website monitoring tools since 2015. The core problem with pixel-diff screenshots: every ad rotation, every layout tweak = alert noise. Legal and compliance teams kept asking "just tell me WHAT changed."

So I rebuilt it. Changeflow extracts semantic changes and summarizes them in plain English:

- "FDA posted new adaptive trial guidance (Jan 15)" - "Competitor raised enterprise pricing 12%" - "9th Circuit issued opinion on arbitration agreements"

Instead of "47 pixels changed in the header region."

THE HARD TECHNICAL PROBLEMS

Scraping any URL (not just specific sites)

Unlike scrapers built for Amazon or LinkedIn, users give us any URL and expect it to work. Our approach:

Delayed-attach pattern: launch Chrome, let page load naturally, poll /json endpoint for title+URL stability, only THEN attach Puppeteer. Bot detection scripts run against a clean browser.

Three-tier fallback: Linux + datacenter proxy (90% of sites) -> Linux + mobile proxy (9%) -> macOS + real hardware (1%). Cache successful routes per-URL. Expensive path rarely fires.

Real Chrome, not Chrome for Testing (fingerprint detectable). On real Mac hardware, disable GPU spoofing entirely - genuine beats fake.

LLM costs at scale

Running AI on every fetch gets expensive. We cut costs 90%:

Strip nav/sidebars/footers before AI call (~60% token reduction). Model tiering: Llama 3.1 8B via Groq for extraction, Gemini Flash Lite for summaries, Claude only when quality matters.

Gemini cache trick: 1024+ token system prompts get 90% discount on repeat calls. Verbose prompts are actually cheaper.

Diffing beyond git diff

Git diff isn't enough. We add MD5 hashes to list items for move detection, use Levenshtein distance to distinguish edits from replacements, and clean temporal noise ("2 days ago") that creates false positives.

STACK

Rails + Postgres, Faktory workers, Node.js browser pool, Claude/Gemini/Llama via OpenRouter, Proxies from GridPanel and SquidProxies.

Happy to answer questions about the scraping, AI, or 10 years of lessons in this space.