frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open database of link metadata for large-scale analysis

https://github.com/rumca-js/RSS-Link-Database-2025
14•renegat0x0•5d ago
I would like to share an open database focused on link-level metadata extraction and aggregation, which may be of interest to researchers.

The project maintains a structured dataset of links enriched with metadata such as:

- page title

- description / summary

- publication date (when available)

- thumbnail / preview image

- etc.

The goal is to provide a reusable, inspectable set of link metadata that can be used for experiments in areas such as:

- RSS and feed analysis

- news analysis

- link rot analysis?

The database is publicly available here:

https://github.com/rumca-js/RSS-Link-Database-2025

There are also databases for previous years

Comments

Aherontas•19h ago
Curious how you handle feed evolution over time. When an RSS source changes structure (fields added/removed, summaries truncated, etc.), do you normalize to a fixed schema or store the raw payload alongside a best-effort normalized version? Longitudinal datasets tend to get tricky there.

Do not mistake a resilient global economy for populist success

https://www.economist.com/leaders/2026/01/08/do-not-mistake-a-resilient-global-economy-for-populi...
48•andsoitis•42m ago•13 comments

Why I left iNaturalist

https://kueda.net/blog/2026/01/06/why-i-left-inat/
159•erutuon•6h ago•68 comments

MIT's "Mathematics for Computer Science" (2018) [pdf]

https://courses.csail.mit.edu/6.042/spring18/mcs.pdf
6•vismit2000•19m ago•0 comments

How to Code Claude Code in 200 Lines of Code

https://www.mihaileric.com/The-Emperor-Has-No-Clothes/
447•nutellalover•11h ago•172 comments

Embassy: Modern embedded framework, using Rust and async

https://github.com/embassy-rs/embassy
167•birdculture•8h ago•63 comments

The No Fakes Act Has a "Fingerprinting" Trap That Kills Open Source

https://old.reddit.com/r/LocalLLaMA/comments/1q7qcux/the_no_fakes_act_has_a_fingerprinting_trap_t...
97•guerrilla•2h ago•34 comments

Sopro TTS: A 169M model with zero-shot voice cloning that runs on the CPU

https://github.com/samuel-vitorino/sopro
209•sammyyyyyyy•10h ago•82 comments

On Getting Hacked

https://ahmeto.com/post/on-getting-hacked
26•ahmetomer•3d ago•12 comments

Anthropic blocks third-party use of Claude Code subscriptions

https://github.com/anomalyco/opencode/issues/7410
261•sergiotapia•3h ago•180 comments

Hacking a Casio F-91W digital watch (2023)

https://medium.com/infosec-watchtower/how-i-hacked-casio-f-91w-digital-watch-892bd519bd15
46•jollyjerry•4d ago•10 comments

Bose has released API docs and opened the API for its EoL SoundTouch speakers

https://arstechnica.com/gadgets/2026/01/bose-open-sources-its-soundtouch-home-theater-smart-speak...
2243•rayrey•16h ago•329 comments

Richard D. James aka Aphex Twin speaks to Tatsuya Takahashi (2017)

https://web.archive.org/web/20180719052026/http://item.warp.net/interview/aphex-twin-speaks-to-ta...
151•lelandfe•10h ago•42 comments

The Unreasonable Effectiveness of the Fourier Transform

https://joshuawise.com/resources/ofdm/
203•voxadam•12h ago•90 comments

The Jeff Dean Facts

https://github.com/LRitzdorf/TheJeffDeanFacts
447•ravenical•18h ago•162 comments

Mysterious Victorian-era shoes are washing up on a beach in wales

https://www.smithsonianmag.com/smart-news/hundreds-of-mysterious-victorian-era-shoes-are-washing-...
17•Brajeshwar•3d ago•1 comments

Show HN: Executable Markdown files with Unix pipes

42•jedwhite•4h ago•36 comments

AI coding assistants are getting worse?

https://spectrum.ieee.org/ai-coding-degrades
287•voxadam•16h ago•448 comments

He was called a 'terrorist sympathizer.' Now his AI company is valued at $3B

https://sfstandard.com/2026/01/07/called-terrorist-sympathizer-now-ai-company-valued-3b/
154•newusertoday•13h ago•178 comments

Google AI Studio is now sponsoring Tailwind CSS

https://twitter.com/OfficialLoganK/status/2009339263251566902
603•qwertyforce•12h ago•199 comments

Ushikuvirus: Newly discovered virus may offer clues to the origin of eukaryotes

https://www.tus.ac.jp/en/mediarelations/archive/20251219_9539.html
91•rustoo•1d ago•16 comments

Logistics Is Dying; Or – Dude, Where's My Mail?

https://lagomor.ph/2026/01/logistics-is-dying-or-dude-wheres-my-mail/
36•ChilledTonic•5h ago•17 comments

Systematically Improving Espresso: Mathematical Modeling and Experiment (2020)

https://www.cell.com/matter/fulltext/S2590-2385(19)30410-2
23•austinallegro•6d ago•7 comments

Fixing a Buffer Overflow in Unix v4 Like It's 1973

https://sigma-star.at/blog/2025/12/unix-v4-buffer-overflow/
109•vzaliva•12h ago•33 comments

Show HN: macOS menu bar app to track Claude usage in real time

https://github.com/richhickson/claudecodeusage
111•RichHickson•13h ago•40 comments

Show HN: A geofence-based social network app 6 years in development

https://www.localvideoapp.com
56•Adrian-ChatLocl•10h ago•36 comments

Pole of Inaccessibility

https://en.wikipedia.org/wiki/Pole_of_inaccessibility
47•benbreen•5d ago•10 comments

Mux (YC W16) is hiring a platform engineer that cares about (internal) DX

https://www.mux.com/jobs
1•mmcclure•10h ago

Digital Red Queen: Adversarial Program Evolution in Core War with LLMs

https://sakana.ai/drq/
114•hardmaru•15h ago•14 comments

Making Magic Leap past Nvidia's secure bootchain and breaking Tesla Autopilots

https://fahrplan.events.ccc.de/congress/2025/fahrplan/event/making-the-magic-leap-past-nvidia-s-s...
55•rguiscard•1w ago•13 comments

I used Lego to design a farm for people who are blind – like me

https://www.bbc.co.uk/news/articles/c4g4zlyqnr0o
119•ColinWright•3d ago•49 comments