frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: RoastDB – A searchable database of 3,800 specialty coffee beans

https://roastdb.com
1•moabdelkader•1h ago
Hi HN! We built RoastDB (https://roastdb.com) – think Discogs for specialty coffee. A searchable index of beans from independent roasters around the world.

*The problem*

We got into specialty coffee gradually. Whenever we tried something we liked — a washed Colombian, a natural Ethiopian — we'd save the bag. At some point we had drawers of empty coffee bags we couldn't bring ourselves to throw away.

Our flow was simple: go to a cafe we liked, drink coffee, buy a bag of whatever they were roasting or stocking. Over time we started noticing patterns — we kept reaching for naturals, for East African origins, for anything with fruity notes. We'd try to seek out similar beans next time. Occasionally we'd fall in love with something and start reordering it online.

But discovery was limited to the few roasters we already knew. There was no easy way to find out that a roaster across town — or in another country — had something we were going to love. We knew great coffee existed out there. We just had no map.

So we built one. RoastDB currently indexes 3,800+ beans from 420+ roasters — and growing every week. Search by origin, process, variety, tasting notes. Save beans you want to try. When you find something, you buy directly from the roaster — we're a discovery engine, not a store.

*How it works*

The hardest part isn't the scraping — it's finding roasters worth indexing. We spend a lot of time hunting for quality third-wave roasters: browsing coffee forums, following competition results, exploring roasters in new cities. The selection is the real work.

Once we've found a roaster, the pipeline runs on a €5/month Hetzner VPS:

1. Scrapers fetch product pages from roaster websites

2. LLMs extract structured data (origin, variety, processing, price, tasting notes)

3. Normalization cleans up inconsistencies ("Äthiopien" → "Ethiopia", "84,25" → 84.25)

4. Non-English descriptions get translated

5. Deduplication scores beans and merges duplicates

6. Human review via an admin dashboard before publishing

The scrapers rerun weekly with content hashing — we only re-extract pages that actually changed, which keeps the data fresh without burning through API costs.

We built an internal tool that gamifies the review process, making it easier to keep up with new beans. And we control the whole pipeline through a Telegram bot — kick off scrapes, approve costs, get notified of failures, all from our phones.

The web app is Next.js + SQLite. The database file is ~15MB and serves directly from disk, no complexity.

*Feedback welcome*

- Roasters we should add (especially outside Europe)

- Filter combinations that would be useful

- Anything broken or confusing

Ask HN: Why is the $0 hijacking of intellectual labor so normalized in OSS?

1•fumi2026•26s ago•0 comments

My Rube Goldberg RSS Pipeline

https://taoofmac.com/space/blog/2026/01/17/2130
1•rcarmo•40s ago•0 comments

Global trust crisis deepfakes AI

https://techfusiondaily.com/global-trust-crisis-deepfakes-ai/
1•nelkazzu•2m ago•0 comments

Ask HN: How AliExpress gets its recommendation as priority in Gmail?

1•RicoElectrico•2m ago•0 comments

Weight-loss drugs could save U.S. airlines more than $500M this year

https://www.latimes.com/business/story/2026-01-15/weight-loss-drugs-help-airlines
2•cwwc•2m ago•0 comments

A.I. and Burnout

https://petersobot.com/blog/on-ai-and-burnout/index.html
3•psobot•3m ago•0 comments

"This is the way" parody Bluesky posts

https://bsky.app/profile/shengokai.blacksky.app/post/3mcndjl5hw22w/quotes
1•mooreds•5m ago•1 comments

Texas A&M university is banning Plato, citing his "gender ideology"

https://lithub.com/texas-am-is-banning-plato-citing-his-gender-ideology/
3•Geekette•7m ago•1 comments

Has AI removed the appeal of vertical SaaS?

https://www.elliotcsmith.com/has-ai-removed-the-appeal-of-vertical-saas/
1•smitec•10m ago•0 comments

Pittsburgh researchers developing lifesaving robot "dogs"

https://www.cbsnews.com/news/pittsburgh-researchers-carnegie-mellon-life-saving-robot-dogs/
1•rolph•10m ago•0 comments

Show HN: Potatoverse, Platform for Apps

https://github.com/blue-monads/potatoverse
5•born-jre•12m ago•1 comments

The Engineering Management Myths Star Trek Teaches (2022)

https://philipotoole.com/star-trek-made-me-a-bad-engineering-manager/
4•otoolep•15m ago•0 comments

Partial reprogramming as a strategy for aging and disease

https://www.sciencedirect.com/science/article/pii/S1568163726000012
1•jjoe•16m ago•0 comments

DIY Potato Aerogel: Free Cooling from Your Kitchen? [video]

https://www.youtube.com/watch?v=J87Qyxzm_fQ
1•znpy•17m ago•0 comments

The EU, spurred by Trump, to sign mega free-trade deal with South America

https://www.washingtonpost.com/world/2026/01/17/eu-trade-deal-mercosur-south-america/
4•stopbulying•19m ago•3 comments

Auto start Next.js dev server when you open the project in VSCode

https://marketplace.visualstudio.com/items?itemName=bullptr.nextjs-code
1•bukharim96•19m ago•0 comments

Show HN: School/ಶಾಲೆ – Agentic Voice Tutor for Students

1•gaganyatri•26m ago•0 comments

Apache Arrow for the Database

https://dataengineeringcentral.substack.com/p/apache-arrow-for-the-database
1•tanelpoder•27m ago•0 comments

New fintech company claims Bitcoin will reach 100M by 2050

https://www.distributedledgertechnologies.com/
1•winnertakeall•28m ago•1 comments

Vibe Coding the Port – I gave up waiting for engine exports

https://benwiser.com/blog/Vibe-Coding-the-Port---I-gave-up-waiting-for-engine-exports.html
1•RupertWiser•32m ago•0 comments

The Computer Chronicles – Artificial Intelligence (1984) [video]

https://www.youtube.com/watch?v=_S3m0V_ZF_Q
2•usernamed7•34m ago•0 comments

Light Mode InFFFFFFlation

https://willhbr.net/2025/10/20/light-mode-infffffflation/
45•Fudgel•35m ago•1 comments

Show HN: Pylot – Jobs API at 1/30th the cost of Coresignal

https://pylothq.com/
1•yadavrh•35m ago•2 comments

Chorex: Restartable, Language-Integrated Choreographies (2025)

https://programming-journal.org/2025/10/20/
1•mpweiher•36m ago•0 comments

Flux 2 Small, from BFL: AI image generation on consumer GPUs

https://the-decoder.com/flux-2-small-brings-ai-image-generation-and-editing-to-consumer-graphics-...
1•dreadsword•42m ago•2 comments

The Bitter Lesson of Agent Frameworks

https://twitter.com/i/status/2012052139384979773
3•pongogogo•45m ago•1 comments

Why Ukraine's Deadly Drone Operation Runs Like a 'McDonald's'

https://www.youtube.com/watch?v=9hzIMI2DLys
4•doener•48m ago•0 comments

Lord of War, Meet Lord of Tokens

https://singhkays.com/blog/lord-war-test-image-models/
1•gmays•48m ago•0 comments

Trump writes partisan plaques for predecessors, fronts gold [video]

https://www.youtube.com/watch?v=y-5ZsaPHv5o
2•stopbulying•48m ago•1 comments

NASA moves Artemis II rocket to launchpad ahead of mission to the moon

https://www.upi.com/Top_News/US/2026/01/17/nasa-rocket-launchpad-moon/7931768668982/
1•bookmtn•52m ago•0 comments