frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

FedCM Action Required: Upcoming breaking changes

https://groups.google.com/g/fedcm-developer-newsletter/c/B1uzU6MzrYY/m/_6gqvaHzCgAJ
1•mooreds•1m ago•0 comments

Ransomware IAB abuses EDR for stealthy malware execution

https://www.bleepingcomputer.com/news/security/ransomware-iab-abuses-edr-for-stealthy-malware-exe...
1•fleahunter•1m ago•0 comments

Gemini 3 fails at simple geometry

1•zkmon•1m ago•0 comments

The story of Erdős problem #1026

https://terrytao.wordpress.com/2025/12/08/the-story-of-erdos-problem-126/
1•signa11•4m ago•0 comments

Learning to See (2013)

https://ia.net/topics/learning-to-see
1•levmiseri•4m ago•0 comments

Why would anyone still choose MVC over Blazor with server-side rendering

https://www.reddit.com/r/dotnet/s/EW4AN1XwgI
1•stsrki•4m ago•0 comments

Traits of a Good Tech Lead

https://world.hey.com/joaoqalves/traits-of-a-good-tech-lead-b5cac0ae
1•joaoqalves•6m ago•0 comments

Diaphragm-based CO electrolyzers for alkaline multicarbon production

https://dx.doi.org/10.1038/s41467-025-63004-1
1•PaulHoule•6m ago•0 comments

Tracking Phone Numbers via WhatsApp and Signal: Open-Source PoC

https://arxiv.org/abs/2411.11194
1•birdculture•7m ago•0 comments

Anthropic's Claude Code can now read your Slack messages and write code for you

https://venturebeat.com/ai/anthropics-claude-code-can-now-read-your-slack-messages-and-write-code...
1•ywnzzn•7m ago•0 comments

US Job Openings Climbed in October to a Five-Month High

https://www.bloomberg.com/news/articles/2025-12-09/us-job-openings-climbed-in-october-to-a-five-m...
1•toomuchtodo•8m ago•1 comments

Microsoft to invest $17.5B in India, CEO Nadella says

https://www.reuters.com/world/india/microsoft-invest-175-billion-india-ceo-nadella-says-2025-12-09/
1•alephnerd•8m ago•0 comments

America's Grip on Oil Weakens – Saudi Arabia's New Partnership [video]

https://www.youtube.com/watch?v=d9FzXGVHzd4
3•thelastgallon•9m ago•0 comments

Show HN: A simple natural-language scheduling tool for finding a time to meet

https://www.schedulegend.com
1•paul_brook•9m ago•0 comments

American Data Centers

https://tech.marksblogg.com/american-data-centers.html
1•marklit•10m ago•0 comments

Did 37Signals Just Accidentally Make Writebook Open Source?

https://kerrick.blog/posts/2025/did-37signals-just-accidentally-make-writebook-open-source/
2•speckx•10m ago•0 comments

Divyam-LLM-interop:LLM responses,requests translation across APIs and models

1•omkarashish•10m ago•0 comments

The Quest Toward That Perfect Compiler – ACM Splash / OOPSLA 2025 Keynote [video]

https://www.youtube.com/watch?v=Af70DptYlYQ
1•matt_d•10m ago•0 comments

What's Next? Clippy Copilot?

2•johnnyballgame•12m ago•0 comments

Social Signals 2025_v5

https://docs.google.com/presentation/d/1aSrP4Pojh7EZde8EZL5kTOH_DAY0SoXzEWF6o2RmFg4/edit?slide=id...
1•ericzawo•12m ago•0 comments

THEA1200 is a full-size working Amiga replica

https://www.theregister.com/2025/11/14/thea1200_fullsize_amiga_replica/
3•rbanffy•13m ago•0 comments

RIP Raul Malo (The Mavericks)

https://apnews.com/article/raul-malo-dies-mavericks-d31ac0c4e1a77c85fcd800f71f079eb7
1•pauseandplay•13m ago•1 comments

Shell permission errors for busy coding agents

https://www.da.vidbuchanan.co.uk/blog/agent-perms.html
2•cirwin•14m ago•0 comments

Show HN: Celeste – The 'Requests' for AI: Any provider, any capability

https://github.com/withceleste/celeste-python
1•Kamilbenkirane•14m ago•1 comments

A new nuclear 'island' where magic numbers break down

https://phys.org/news/2025-12-nuclear-island-magic.html
2•rbanffy•15m ago•0 comments

EU investigates Google over AI-generated summaries in search results

https://www.bbc.com/news/articles/crl95eg33k1o
4•hackerbeat•15m ago•0 comments

Pyrefly typechecker/language server now has experimental support for Pydantic

https://pyrefly.org/blog/pyrefly-pydantic/
1•javabster•16m ago•0 comments

We Improved Organization Invites to Keycloak

https://xata.io/blog/how-we-improved-organization-invites-to-keycloak
1•chuckhend•17m ago•0 comments

Apple's Slow AI Pace Becomes a Strength as Market Grows Weary of Spending

https://finance.yahoo.com/news/apple-slow-ai-pace-becomes-104658095.html
15•bgwalter•17m ago•11 comments

Big Tech joins the race to build the world's heaviest airplane

https://pluralistic.net/2025/12/09/temporarily-embarrassed-founders/
1•hn_acker•17m ago•0 comments
Open in hackernews

Show HN: Tracking book mentions across podcast episodes

1•steyeomans•51m ago
Hi HN! I built this tool because I wanted to answer a simple question for myself: which books come up most often across the podcasts I listen to?

The pipeline is straightforward at a high level: transcribe episodes with faster_whisper running locally on an RTX 3060, run the text through GPT-5-mini to pull out structured book mentions, and store everything in Azure SQL and Blob storage. The frontend is a small Flask app using HTMX, Tailwind, and some D3 for the visualizations.

The part that turned out to be much more time consuming than expected was deduping. Everything else scaled nicely, but normalizing book titles is still the one piece I can’t fully automate without quality drifting. Fuzzy matching gets you most of the way, but the long tail of book names is huge. I ended up building a tiny internal Flask UI just to confirm or split fuzzy matches by hand, it also lets me review the context for the book mention to ensure accuracy. It's the only place in the system where a human is still in the loop.

A few other unexpected issues came up: some podcast RSS feeds randomly duplicate or link to broken episodes, CUDA can crash if I’m not careful with garbage collection between Whisper runs, and LLM extraction occasionally fails if the model doesn’t return exactly the JSON shape I expect.

One surprising pattern emerged: the long tail is enormous. A handful of books are mentioned constantly, but thousands more appear exactly once.

If you want to see the current state of it, the reports and visualizations are here: https://www.mavensignal.com

Happy to answer anything about the pipeline, LLM prompting, dedupe logic, or the stack in general.