frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I am building a map of people who lived in the Roman Empire

https://new.roman-names.com/
1•metiscus•1h ago
Driving home from work one day, I wanted to know how many people we knew the names of who lived during the Roman era. Searching around, I found lists of Consuls and officials, but nothing that covered ordinary people or even most people like freedmen and slaves. So I ended up building a pipeline to process the more than 500k Latin inscriptions in the Epigraphic Database Clauss-Slaby https://edcs.hist.uzh.ch/en/ and extract the names of people (and attempt to cluster them, but this is a work in progress).

There are databases where Classicists have done this manually for specific regions, Trismegistos https://www.trismegistos.org/ and Latin Inscriptions of the Roman Empire (LIRE) https://pure.au.dk/portal/en/publications/latin-inscriptions... are two major efforts I found. But there doesn't seem to be a project that did what I set out to do, although I have read in some places that it was believed to be possible.

I am not a classicist or a web developer, but I have Claude and Gemini and I can sort of read basic Latin - so I set to work. I used LIRE and another database as ground truth and built a pipeline to extract and process the inscriptions to recover the names. The process I developed uses a high end LLM like Sonnet or Gemini Pro to supervise the extraction and tuning process on a regional basis until the obvious error rate is reasonable. For this, so far, reasonable to me means less than 1-2% in the smaller initial samples of 100-500 and no observed systemic issues. The different regions often need different prompts, so this basically became an exercise in letting the higher level AI tune the prompt for the lower level AI. The extraction when measured against LIRE produces an F1 score between 0.64 and 0.87, but take this with a grain of salt.

Once I had done a few regions, I wanted to see the work, so I threw together a pretty crude website but as I am not a web developer, it was crude in how it accessed its data. It does look cool and I also added summarization, and machine translation to each entry. I wanted to eventually get feedback from an actual team of classicists and make the website work better, so I am rewriting it as we speak but it is broadly functional now with a few extra bugs but substantially improved performance compared to the old one. All entries link back to the proper sources, and the old web app linked to several additional sources where the data was present, but I haven't gotten that working again just yet on the new one. (The old web interface is still available at https://roman-names.com, but I will warn you it is clunky and not mobile friendly at all)

Key findings so far:

AI supervised AI extraction saved me time. I was manually tuning things for a while and then the runbook became an idea that I feed my instructions in and let the big AI go with sparse oversight from me.

The extraction improved significantly (by about 10 F1 points) when I fed the model the raw text including the markers, vs a cleaned up version of the text.

I just thought it was a cool little project and wanted to share. If you happen to work in any adjacent space and there is something I could do better etc let me know.

Back to the Blog Again

https://www.rahulakira.com/posts/back-to-the-blog-again
2•exochrono•4m ago•0 comments

Will the next high value profession be people who can think independently?

2•ciwolex•6m ago•0 comments

AI hackathons on Devpost feel repetitive and somewhat random in judging

https://gemini3.devpost.com
2•noahnathan25•8m ago•0 comments

BYD is deploying 2.4x more charging power per month than Tesla

https://electrek.co/2026/06/10/byd-coming-for-tesla-supercharger-network-1500-kw-flash-charging/
3•breve•9m ago•0 comments

Gesture-based proof of humanity – W3C DID, on-chain, no biometrics

https://homosapience.org/en
2•tulubyev•10m ago•0 comments

The Roy Lee/Clavicular Balance

https://ferguswhitedev.substack.com/p/the-roy-leeclavicular-balance
2•ferguswhite•10m ago•1 comments

JWST solves decades-long mystery about why Saturn appears to change its spin

https://phys.org/news/2026-03-jwst-decades-mystery-saturn.html
2•tkcashman•12m ago•0 comments

Everyone got excited they can suddenly code, and missed the point

https://kasperjunge.com/blog/should-pms-code-with-agents/
2•juunge•13m ago•0 comments

O'Reilly Animal Menagerie

https://www.oreilly.com/animals.csp
1•skogstokig•14m ago•0 comments

Story Lab

https://story-lab.ai/
1•Aftermidn8•17m ago•0 comments

China plans to spend $295B on AI buildout

https://www.bloomberg.com/news/articles/2026-06-09/china-prepares-295-billion-plan-to-fund-nation...
2•loandbehold•17m ago•0 comments

Plane Launch Week Gallery

https://plane.so/launch-week/q2-2026
1•bbor•17m ago•1 comments

Nearly Everyone, Everywhere, Veers Left When Walking

https://www.nytimes.com/2026/06/10/science/humans-walking-veer-left-counterclockwise.html
1•donohoe•20m ago•0 comments

The Device Paradigm [pdf]

https://web.cs.ucdavis.edu/~rogaway/classes/188/spring04/projects/5.pdf
1•burnto•20m ago•0 comments

Fable 5 creates full Swiss lever watch movement in Three.js

https://twitter.com/quanghuynt14/status/2064509430650065278
2•mhb•20m ago•0 comments

Nuts – pip/NPM for Java with first-class workspaces and JDK provisioning (9y+)

https://github.com/thevpc/nuts
1•thevpc•21m ago•0 comments

AI-review: reviewing AI code before it lands

https://www.jackfranklin.co.uk/blog/ai-review-plan/
1•mooreds•21m ago•0 comments

Whole Earth Garden

https://wholegarden.falso.net/
1•gdss•22m ago•0 comments

How to enter side doors: guide to jobs, cold emails, and making yourself legible

https://velvetnoise.substack.com/p/how-to-enter-side-doors
1•nowflux•22m ago•0 comments

Who runs the ransomware group 'The Gentlemen?'

https://krebsonsecurity.com/2026/06/who-runs-the-ransomware-group-the-gentlemen/
2•krebsonsecurity•23m ago•0 comments

T9 Texting Is Back on iPhone

https://apps.apple.com/us/app/t9-keyboard-text-like-its-04/id6765738931
1•ItsMeDavidV•23m ago•0 comments

Ongoing attempt to standardise a DOM Templating API in browsers

https://github.com/justinfagnani/dom-templating-api-proposal
1•llcooliovice•26m ago•0 comments

Show HN: Kctx – A read-only Kubernetes context engine for SREs and AI Agents

https://github.com/lucasepe/kctx
2•lucasepe•28m ago•0 comments

Anthropic's Self Governance Is an Act of Social Violence

https://cezarbabin.com/notes/anthropic-self-governance-is-an-act-of-social-violence.html
1•nibab•29m ago•1 comments

Google Liable for Hallucinations (In Germany)

https://garymarcus.substack.com/p/breaking-google-liable-for-hallucinations
1•PaulDavisThe1st•29m ago•1 comments

The maths behind a leopards spots

https://www.bbcearth.com/news/the-maths-behind-a-leopards-spots
2•marysminefnuf•32m ago•0 comments

Jumping spiders inspire ultra-efficient 3D camera

https://news.northwestern.edu/stories/2026/06/jumping-spiders-inspire-ultra-efficient-3d-camera
1•gmays•34m ago•0 comments

US stock market to stop shrinking for first time in 23 years

https://www.ft.com/content/f7dae4e1-d650-45ab-ac97-043c7a965d24
5•JumpCrisscross•35m ago•0 comments

Ask HN: What are your thoughts on your critical thinking abilities and AI?

2•ciwolex•36m ago•5 comments

Piano Learning App focused on Sight-reading

http://virtuoso.host.eco.br/app/
2•ltouro•36m ago•2 comments