frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

A Herculaneum scroll has been read for the first time

https://scrollprize.org/firstscroll
359•verditelabs•3h ago

Comments

verditelabs•3h ago
I am on the vesuvius challenge team that did the segmentation, unwrapping, and ink detection, so feel free to ask any questions.
helterskelter•2h ago
Given the current rate of progress, how long do you think it will take to decipher the entire collection?
verditelabs•1h ago
That's a tough one to give a strong estimate of. Some scrolls are easier or harder to unwrap and read for a multitude of different reasons, mostly due to how damaged the scroll was in the eruption, and how easy or not the ink is to read. IIRC from what we've scanned of the herculaneum collection, none of the ink is easily visible via spectrum alone, so we have to use a lot of ML and physically based rendering techniques to be able to find ink. That also requires unwrapping and segmentation _before_ any ink detection.

For iron gall ink with high enough iron concentration, the ink stands out in the xray volume through simply masking off low values, such as was shown in our campfire scroll experiment a few years ago. No herculaneum scrolls show similar ink.

helterskelter•1h ago
Thanks!
pimlottc•1h ago
Do you think this particular scroll is easier or harder to read that the others will be? Or about average?
verditelabs•1h ago
Pherc1667 was quite small and just so happened to have readable ink, so it was easier than I expect most others to be.
superjan•1h ago
Do we known what ink is used?
verditelabs•55m ago
Most of the evidence so far points towards carbon based ink. I am not sure if any of the scrolls we have scanned show strong evidence of iron gall based ink. I know that there are different types and preparation methods for different carbon based inks, but I do not know if it is possible to determine which kind(s) were used solely from inspecting the xrays.

I am, though, not a papyrologist, so historical ink making, preparation, and usage are not my field.

jimbob45•2h ago
Are the fragments destroyed in ‘69 and ‘80 available to be read similarly? Or were they disposed of?
verditelabs•1h ago
I am unaware of those fragments in particular. Though we have scanned a dozen or so fragments, mostly to help guide ink detection, since the ink in them is often more visible in visible and/or near IR light, but can be hard to impossible to detect in the xray spectrum.
adriand•1h ago
What are the wildest, most exciting but plausible things that might be discovered in these documents?
verditelabs•1h ago
I am not a papyrologist or a classicist, rather I'm a computer scientist, so my expertise is unfortunately not in _what_ the scrolls say, rather how we get there. That being said I think and hope that there will be a trove of things that has no known provenance at all, completely lost works that elude the public memory.
arikrahman•40m ago
Well what were your first thoughts when you decoded the script, besides the obvious Eureka, after making some sense of the texts?
readthenotes1•21m ago
Your response reminds me of Nigel Richards :)

https://en.wikipedia.org/wiki/Nigel_Richards

Congratulations, and thank-you!

suddenlybananas•1h ago
Probably a lot more texts of Epicurean philosophy and not a whole lot else unfortunately according to my papyrologist friend.
cwmoore•1h ago
Why would Epicurean philosophy be unfortunate?

I was under the impression that there was almost nothing left of that school of thought, and that it’s writings had been destroyed.

What would you like to have instead?

echelon•1h ago
Did anyone on the team come from a non-science, non-math, non-academia background? Did anyone working on this just teach themselves and start contributing?
verditelabs•1h ago
Yes. Sean, who was a co-winner of the 2024 prize, IIRC has no formal background in ML, computer science, AI, etc. He is one of our core researchers and the most productive team member.
fintechjock•1h ago
I've been on the Discord for a couple of years now, and poking around with submissions as well. Sean and the entire team deserve so much praise for all of this work.

It's easy to just read about the breakthrough and see it as one neat, linear line to get there, and hard to comprehend the hours, months and years that so many spent to get there. Big congrats to you, Sean, Nat and the entire team!

tsol•1h ago
How do get to do that? As in what did you study to get the prerequisite knowledge, and how did you find this particular job? When I see interesting jobs I'm anyways curious what path lead there
verditelabs•1h ago
I am a computer scientist. I studied CS in university, worked in the semiconductor industry for a while, got started as a participant in the challenge aspect of the Vesuivus Challenge. They were hiring, I sent in an application, interviewed, and was offered the job.
matneyx•30m ago
That last sentence is so perfect, like my dad answering the question of how he lost weight. "I ate less and exercised more."
inglor_cz•1h ago
I don't have any questions, just a comment.

You have a potential to rewrite the history of European Antiquity quite substantially. The Herculaneum set of scrolls is enormous and must contain a lot of hitherto unknown.

That comes with a set of peculiar risks. Once your work starts producing something that contradicts previous work of Very Important People, they will lobby to stop you. Be prepared for that.

Science should be neutral and always value new evidence. Scientists as humans are unfortunately subject to all sorts of passions.

TheOtherHobbes•1h ago
No questions, but I just want to say this is really exciting work!
Dzugaru•1h ago
Outstanding work! I've participated in the challenge, but didn't get far. One of the questions I had at the time was - if I'm going to use ML to detect ink, could it invent hallucinated letters, or even parts of text, and how to prevent that?
verditelabs•1h ago
Yes, it's quite possible for ML to hallucinate ink, though it is on a much more local scale, like predicting a slightly longer stroke, filling in more of a character than is actually in the data, etc. Perhaps enough to change a reading of a character or show where ink isnt. It is difficult for ink detection to hallucinate grammatical and idiomatic greek and latin.
im3w1l•1h ago
What is the input to the ML algorithm? Does it know the surrounding context so that it has a chance to deduce "if this stroke is slightly longer then the end result will be idiomatic greek and latin"?
verditelabs•1h ago
The input is 3d chunks of reconstructed CT data from our scans. I can't remember the specifics but maybe enough voxels for .5mm^3 at a time or so? They're all available for free from https://registry.opendata.aws/vesuvius-challenge-herculaneum... . Our trained models are all available at https://huggingface.co/scrollprize
cwnyth•57m ago
BiraIgnacio•52m ago
Amazing work, fantastic!
tomcam•40m ago
Absolutely incredible work. This is one of the most amazing news articles I’ve encountered in decades. Congratulations team!
temp987•37m ago
this is überragend. by many means!
2ap•18m ago
I'm interested to know about the approaches that you tried with the ML, and then decided to not use. In practice, the options are so many. How did you come up with the final approach - and was there a systematic way to decide which options to go for?
verditelabs•5m ago
I am not on the research team, rather on the production side of things, so my knowledge on that is pretty limited. I think one of the main takeaways from a lot of the research, though, on both the segmentation side and the ink detection side, is that it's a lot less about what models and techniques and such you use, but how good your training data is. Gathering ground truth is hard, and if you don't have a lot of good ground truth, it doesn't matter if your code is perfect, you'll never get results.
NooneAtAll3•5m ago
how many scrolls have been scanned so far? what's the main limitation on scan amount?

have any attempts (or just ideas) been made to recreate such charring on known texts?

suddenlybananas•1h ago
Scrolls from Herculaneum have been read for a very long time. Not disputing the achievement of digitally unrolling one, but the scrolls from the library of have been studied since the 18th century.
tokai•1h ago
Sure, but its the potential scale that is important. There are also more scrolls still in the ground, which would make sense to dig out if they could be read.
suddenlybananas•1h ago
Of course! But the title is misleading and gives people the impression that we don't already know the library is just full of Epicurean texts.
verditelabs•1h ago
I think it's a case of HN once again butchering the title. I submitted it as the exact title from our page on scrollprize.org, "An _Entire_ Herculaneum Scroll Has Been Read For The First Time", which is IIRC true.
IAmBroom•1h ago
It's also technically incorrect. The texts have been read; this particular text was read for the first time in the modern era.
tokai•1h ago
I'm really hoping that the library contains some lost older Greek works. But its going to be awesome what ever we find.
helterskelter•1h ago
I'm hoping for a complete(ish) Heraclitus. Also Eratosthenes, whose methods have been described but we don't have the original work where we calculated the circumference of the Earth. Also Hipparchus and Thales.
annodomini2019•51m ago
My pick would easily be the missing books of ab urbe condita by Livy, so much early Roman history that would be wonderfully filled out for us
helterskelter•31m ago
Also, Aristarchus.
bobowzki•1h ago
Very impressive! I also highly recommend visiting Herculaneum.

A thought: I guess the days of scratch off lottery tickets are numbered?

cl3misch•1h ago
The tomography was done at a synchrotron (ESRF), and with beamtime being very expensive it would be a net-negative to scan lottery tickets, unfortunately...
verditelabs•1h ago
Fortunately for anyone wanting to xray lottery tickets, you don't need the IIRC most powerful beamline in the world. A few years ago a Vesuvius Challenge Community member bought a benchtop xray machine for a few grand and scanned pokemon cards and was able to identify them that way.
roflmaostc•1h ago
I found once super old books in our lab (like hundreds of years) and was wondering what they were used for.

Apparently they did CT scans of closed books and read the content. Polevoy, Dmitry V., et al. "From tomographic reconstruction to automatic text recognition: the next frontier task for the artificial intelligence." Fifteenth International Conference on Machine Vision (ICMV 2022). Vol. 12701. SPIE, 2023. https://iris.unive.it/bitstream/10278/3687069/1/Albertin_et-...

So yeah, but lottery companies probably make it harder by engineering against it.

kilroy123•1h ago
For me, this is one of the most exciting things being done with AI right now. (This and medical research)

I'm kind of obsessed with the ancient world. I dream of being able to read entire pages of new text from ~2,000 years ago.

clickety_clack•1h ago
When I read translations like these, I always wonder if the tone is translated. Did the writer mean to convey a very formal “to the utmost”, or was it a more casual “to the max”.

How much of the translators bias makes these seem like academic papers instead of social media posts.

dylan604•1h ago
Sending a tweet is free and takes zero thought to make it (as the vast majority of tweets prove). Writing something on a scroll would take a lot of effort and would not be free. If these were tweet level content in the scrolls, I'd have to totally reevaluate a lot of things to the point I might as well just become MAGA
colechristensen•1h ago
Sometimes there is very little to go on, but we really do have a lot to work with from the late republic and early roman empire.

Latin is also a very rich language and this is no snippet.

Translation is always hard, especially from a couple thousand years ago BUT this kind of translation comes with a lot of confidence.

hyhatqtv•36m ago
It’s in Greek, though. Of course same points apply
forshaper•1h ago
This is why I like literal translations & etymological dives, paired with asking what activities would constitute a life in that time. Ie, you may not need to be a competent archer, but it is a little easier to understand someone who used a particular style of bow if you can play around with that type of bow for a bit.
9dev•1h ago
Every time you feel depressed by the state of tech, and how so many intelligent people seem to work on forcing ever more ads down people's throats (a common trope around these parts), remember that projects like this do exist too!

There are lots of very smart folks working on incredible things, they just aren't as loud.

giancarlostoro•1h ago
This isnt the only incredible thing though, AI is being used to make discoveries in the medial field, and even to prevent sepsis related deaths, cutting down on them by detecting sepsis sooner. There was another that discovered the gene for Alzheimers is what activates it not just a sign of it.
verditelabs•50m ago
There is a large overlap in what we are doing with the medical field as well. A lot of the segmentation methodology and technology we use and adapt originally came out of the medical field for doing things like brain imaging.
_verandaguy•1h ago
I imagine it's not the first time, It must've at least been proofread at the time of writing :)

But really impressive stuff! Between this and (a particularly optimistic outlook on) the Linear-A news from the other week this is an exciting time for linguistics.

josefritzishere•1h ago
This is huge, we're about to learn so much about ancient texts.
lanthissa•1h ago
The person who wrote this was was closer in time to the technology that was able to unwind and read burned fragments of their text, than the technology that build the pyramids. pretty wild to think about.
sevenzero•1h ago
>technology that build the pyramids

You mean ropes and carts?

inglor_cz•53m ago
The stones were cut with enormous precision, at least relative to what we know about the available cutting tools. You cannot still stick a knife between a lot of these stones. Maybe we will learn more about that.
vitally3643•21m ago
I'm pretty sure we've conclusively answered these questions. Hand tools, skill, and absolutely unreasonable amounts of time and patience.

Any master stoneworker from any era should be able to carve stone to that level of precision given enough time and reason. The problem, as always, is that there is usually very little reason to put in that amount of time and effort when you can get 90% as good for 50% the effort.

cwillu•1h ago
Link to the image: https://scrollprize.org/img/firstscroll/banner-full.webp
INTPenis•1h ago
But wait, the work seems to be from the 2nd century, but it was buried during the Vesuvius eruption in the 1st century?

I love stuff like this because it gives a glimpse into Roman society. To me it seems like they were very similar to us today, forever contemplating learning, existence, gods.

verditelabs•1h ago
> places it in a Stoic context and dates it to the 2nd century _BC_.

Emphasis mine.

mattbettinson•1h ago
I wonder what the parellel would be 2,000 years for now:

A Post-Great Solar Flare of 2484 Step Brothers DVD Has Been Decoded

Waterluvian•29m ago
We have successfully uncorrupted audiovisual media of what we believe to be an oral retelling of the long lost ending to Chekhov‘s “The Three Sisters.” It turns out the light was on.
shevy-java•51m ago
Kind of cool. The eruption sort of "froze" some information in time, for later generations to learn from people living ~2000 years in the past.
charcircuit•45m ago
I thought we were able to read some of these scrolls years ago?
tern•40m ago
> "…we will inquire into something, but we will not grasp it, if in some way we depart from ourselves and from our own nature…"

Beautifully ironic, that we find this message.

proee•39m ago
Only about 20% of the Herculaneum site has been excavated, so there is high probability that more scrolls exist. The current scrolls were not part of the main library, but more of a private collection at the time.

So imagine how cool it would be to find a full library with thousand of scrolls across many different topics, that can now be read with this technology.

bambax•18m ago
This could eventually completely transform our understanding of Antiquity. It is estimated that only around 1% of the ancient works in Greek and Latin have survived to the present day, much less in other languages such as Punic [0]. Some works and some authors we only know by name because they were alluded to in later texts.

It's also well known that surviving texts survived because they were copied again and again on costly animal skin during the Middle Ages, by monks who had to make a choice and naturally favored topics that were of most interest to them.

This could quite literally change everything.

[0] https://talesoftimesforgotten.com/2021/09/25/are-there-more-...

cortesoft•25m ago
This is so cool. I feel like it is almost a victory against entropy!
hasteg•25m ago
So far this is some of the best uses of ML I've seen to date! This is one of the few things you can point at and say "AI made the world a better place" IMO (this and medical research).
ur-whale•19m ago
A scroll has been read ... what does it say ?
HarHarVeryFunny•13m ago
This is technology verging on witchcraft!

Amazing!

cwnyth•56m ago
The unfortunate part is the lack of anything else therein, not that it's Epicurean philosophy.
ogogmad•20m ago
The Jewish Talmud uses Epicurus's name as a term meaning "heretic".
Matticus_Rex•22m ago
That's what was thought, but maybe not -- only one of the three so far looks Epicurean, which is not what was expected. Maybe it's a fluke, but historians are buzzing a bit about whether it might be broader than expected.
colechristensen•1h ago
Here's a list. The scrolls are from a library that burned in 79 AD.

https://en.wikipedia.org/wiki/List_of_lost_literary_works

kouru225•42m ago
Woah there was a lost Homer epic comedy about a bumbling fool named Margites?
Not all machine learning is generative AI.
mc32•55m ago
True but like regular document scanning software there can be errors in detection.
dleeftink•37m ago
Just as with redacted documents (consistently blocked terms) or bad OCR jobs (wrong or missing characters), even if only a certain percentage comes out unmangled it is more readable than having no data at all.

A stable base corpus and some dynamic programming will allow you to clean up the remainder[0].

[0]: http://stackoverflow.com/a/11642687/2449774

adrian_b•1h ago
Any useful translation of an ancient text is accompanied by the text in the original language, so that the reader may assess how faithful is the translation.

For anyone who wants to read ancient texts, there are bilingual editions, for example those of the "Loeb library".

The translations that omit the original text are just for the people who want to have some idea about the content, but do not care about the correctness of the translation.

With a bilingual edition, it is easy to understand the original text even with relatively little knowledge about the original language.

The original text is important because frequently the translator is forced to introduce inaccuracies in the translation, because of the absence of exact equivalents in the target language, which would require a long explanation of the original meaning, instead of just a translated sentence.

Especially misleading are translations where several distinct ancient words are translated using the same English word, so some nuances are lost.

Equally confusing are the cases when the translator chooses to translate the same ancient word by different English words, because even if the meaning of a word may depend on the context, many translators fail to judge correctly the context, because they may lack specialized knowledge so their guesses are not necessarily better than of the readers who may be less competent in linguistics, but more competent in the science or technology needed to understand the context. Better translators prefer to use a one-to-one mapping between words, which makes it easier for the readers to discover the meaning intended by the ancient writer, after seeing multiple examples of usage.

charcircuit•48m ago
After sticking it into CharGPT I can tell you it's neither. The word upmost is coming from is a form of the compound verb ἐκπονέω.

* ἐκ- = “out,” “thoroughly,” “to the end”

* πονέω = “to labor,” “to toil,” “to work hard”

kridsdale3•39m ago
I trust a lifelong dedicated Ancient Greek Papyrologist to do a better job here than ChatGPT.
bibimsz•20m ago
let's translate the ancient classic poem Mugger's Paradise by the poet Somewhat Frosty:

While I step through the valley of the shadow of death,

I contemplate my life and perceive that nothing remains.

For I have hurled weapons and laughed for so long that

Even to my mother, my mind appears to have departed.

Yet I have deceived no one except him who was worthy of it;

For me to be held as a coward—that indeed is unheard of.

Beware what you speak and where you set out,

Lest you and your companions be outlined in chalk.

An entire Herculaneum scroll has been read for the first time

https://scrollprize.org/firstscroll
366•verditelabs•3h ago•90 comments

Oxide computer 3D rack guided tour

https://explorer.oxide.computer/
80•darthcloud•3d ago•27 comments

IBM debuts sub-1 nanometer chip technology

https://newsroom.ibm.com/2026-06-25-ibm-debuts-worlds-first-sub-1-nanometer-chip-technology
116•porridgeraisin•3h ago•68 comments

Zig's new bitCast semantics and LLVM back end improvements

https://ziglang.org/devlog/2026/#2026-06-25
157•kouosi•4h ago•49 comments

OS9Map

https://yllan.org/software/OS9Map/
91•LaSombra•4h ago•13 comments

Show HN: Chess-Inspired Roguelike

https://princechazz.com
57•cowboy_henk•4d ago•20 comments

Apple raises prices of MacBooks, iPads

https://www.reuters.com/world/asia-pacific/apple-raises-prices-macbooks-ipads-memory-costs-skyroc...
387•virgildotcodes•6h ago•597 comments

I built a GPU back end for Emacs

https://en.andros.dev/blog/4b707a03/how-i-built-a-gpu-backend-for-emacs/
99•andros•2d ago•50 comments

You can't unit test for taste

https://dev.karltryggvason.com/you-cant-unit-test-for-taste/
198•kalli•1d ago•83 comments

Show HN: I made Google Trends for Hacker News by indexing 18 years of comments

https://hackernewstrends.com
497•ytkimirti•5h ago•130 comments

Besimple AI (YC P25) Is Hiring

https://www.ycombinator.com/companies/besimple-ai/jobs/yWfhhOR-strategic-projects-lead-audio-data
1•yzhong94•2h ago

Half-Life 2 in a Browser

https://hl2.slqnt.dev/
588•panza•13h ago•236 comments

Early adversity leaves lasting molecular imprint across the body: primate study

https://medicalxpress.com/news/2026-06-early-life-adversity-molecular-imprint.html
34•gmays•4d ago•9 comments

52-hertz whale

https://en.wikipedia.org/wiki/52-hertz_whale
58•brightbeige•22h ago•7 comments

Tw-fade: pure CSS scroll-driven edge masking

https://pete.design/tw-fade
58•petekp•3d ago•20 comments

Physicists Track and Trap the Elusive Neutrino

https://www.quantamagazine.org/how-physicists-track-and-trap-the-elusive-neutrino-20260624/
33•ibobev•4h ago•4 comments

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion

https://github.com/inkeep/open-knowledge
22•engomez•3h ago•5 comments

Advanced Nintendo Entertainment System (ANES) – NES Modded to Use 2 PPUs

https://github.com/decrazyo/anes
40•zdw•1d ago•7 comments

Show HN: Turn native language audio into flashcards and shadowing practice

https://lingochunk.com/try
61•alder•7h ago•27 comments

Windows 10 quietly gets one more year of support and updates

https://www.neowin.net/news/windows-10-quietly-gets-one-more-year-of-support-and-updates/
138•bundie•3h ago•108 comments

The disappearance of Japan's animators

https://economist.com/interactive/1843/2026/06/19/the-strange-disappearance-of-japans-animators
81•andsoitis•4d ago•71 comments

LastPass notifies users of yet another data breach

https://9to5mac.com/2026/06/23/lastpass-notifies-users-of-yet-another-data-breach/
407•mooreds•8h ago•178 comments

Cloudflare launched self-managed OAuth for all

https://blog.cloudflare.com/oauth-for-all/
330•terryds•16h ago•141 comments

Political bias in AI: Where the AI models stand

https://trakkr.ai/bias
50•mektrik•6h ago•116 comments

How to get your first customers [video]

https://www.ycombinator.com/library/SF-how-to-get-your-first-10-customers
52•aurenvale•1d ago•16 comments

Show HN: Persona.js – a vanilla-JS agent UI library with native WebMCP (MIT)

https://github.com/runtypelabs/persona
14•becomevocal•4d ago•16 comments

Show HN: MiniPCs.zip – Charting the Pareto frontier of Mini PCs

https://minipcs.zip
71•yathern•4d ago•28 comments

Mixing Visual and Textual Code

https://arxiv.org/abs/2603.15855
66•doppioandante•18h ago•41 comments

The annotated PyTorch training loop

https://idlemachines.co.uk/essays/pytorch-training-loop
6•smaddrellmander•2d ago•0 comments

Blogging can just be stating the obvious

https://blog.jim-nielsen.com/2026/blogging-stating-the-obvious/
412•Curiositry•19h ago•122 comments