frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Vintage Large Language Models

https://owainevans.github.io/talk-transcript.html
23•pr337h4m•4h ago

Comments

mountainriver•2h ago
Very cool! I’ve been wanting to do this do a long time!
nxobject•2h ago
I love the ideas about how we might use historical LLMs to inquire into the past!

I imagine that (the author hints at this), to do this rigorously, spelling out assumptions etc, you’d have to build off theoretical frameworks used to inductively synthesize/qualify interviews and texts, currently around in history and the social sciences.

abeppu•1h ago
The talk focuses for a bit on having pure data from before the given date. But it doesn't consider that the data available from before that time may be subject to strong selection bias, based on what's interesting to people doing scholarship or archival work after that date. E.g. have we disproportionately digitized the notes/letters/journals of figures whose ideas have gained traction after their death?

The article makes a comparison to financial backtesting. If you form a dataset of historical prices of stocks which are _currently_ in the S&P500, even if you only use price data before time t, models trained against your data will expect that prices go up and companies never die, because they've only seen the price history of successful firms.

alalv•10m ago
It mentions that problem in the first section
ideashower•2m ago
I like the idea of using vintage LLMs to study explicit and implicit bias. e.g. text before mid-19th century believing in racial superiority, gender discrimination, imperial authority or slavery. Comparing that to text since then. I'm sure there are more ideas when you use temporal constraints on training data.

Heretic: Automatic censorship removal for language models

https://github.com/p-e-w/heretic
135•melded•2h ago•35 comments

FPGA Based IBM-PC-XT

https://bit-hack.net/2025/11/10/fpga-based-ibm-pc-xt/
48•andsoitis•2h ago•7 comments

Only three kinds of AI products work

https://www.seangoedecke.com/ai-products/
21•emschwartz•1h ago•15 comments

Brimstone: ES2025 JavaScript engine written in Rust

https://github.com/Hans-Halverson/brimstone
129•ivankra•6h ago•60 comments

De Bruijn Numerals

https://text.marvinborner.de/2023-08-22-22.html
30•marvinborner•2h ago•3 comments

AirPods libreated from Apple's ecosystem

https://github.com/kavishdevar/librepods
1074•moonleay•17h ago•309 comments

Running the "Reflections on Trusting Trust" Compiler

https://research.swtch.com/nih
79•naves•3h ago•2 comments

Garbage Collection Is Useful

https://dubroy.com/blog/garbage-collection-is-useful/
50•surprisetalk•4h ago•5 comments

Fourier Transforms

https://www.continuummechanics.org/fourierxforms.html
15•o4c•1w ago•2 comments

Anthropic's report smells a lot like bullshit

https://djnn.sh/posts/anthropic-s-paper-smells-like-bullshit/
578•vxvxvx•6h ago•187 comments

Measuring the doppler shift of WWVB during a flight

https://greatscottgadgets.com/2025/10-31-receiving-wwvb-with-hackrf-pro/
65•Jyaif•1w ago•0 comments

PgFirstAid: PostgreSQL function for improving stability and performance

https://github.com/randoneering/pgFirstAid
43•yakshaving_jgt•4h ago•2 comments

The Internet Is No Longer a Safe Haven

https://brainbaking.com/post/2025/10/the-internet-is-no-longer-a-safe-haven/
157•akyuu•4h ago•119 comments

Vintage Large Language Models

https://owainevans.github.io/talk-transcript.html
24•pr337h4m•4h ago•5 comments

Why use OpenBSD?

https://www.tumfatig.net/2025/why-are-you-still-using-openbsd/
104•akagusu•5h ago•58 comments

Production-Grade Container Deployment with Podman Quadlets – Larvitz Blog

https://blog.hofstede.it/production-grade-container-deployment-with-podman-quadlets/index.html
22•todsacerdoti•3h ago•10 comments

Iran begins cloud seeding operations as drought bites

https://www.arabnews.com/node/2622812/middle-east
92•mhb•4h ago•89 comments

Maybe you’re not trying

https://usefulfictions.substack.com/p/maybe-youre-not-actually-trying
278•eatitraw•7h ago•130 comments

IDEmacs: A Visual Studio Code clone for Emacs

https://codeberg.org/IDEmacs/IDEmacs
273•nogajun•17h ago•110 comments

Dissecting Flock Safety: The Cameras Tracking You Are a Security Nightmare [video]

https://www.youtube.com/watch?v=uB0gr7Fh6lY
36•emsign•2h ago•4 comments

Run Nix Based Environments in Kubernetes

https://flox.dev/kubernetes/
85•kelseyhightower•6d ago•23 comments

Things that aren't doing the thing

https://strangestloop.io/essays/things-that-arent-doing-the-thing
405•downboots•23h ago•189 comments

UK's first small nuclear power station to be built in north Wales

https://www.bbc.com/news/articles/c051y3d7myzo
125•ksec•7h ago•173 comments

Writing a DOS Clone in 2019

https://medium.com/@andrewimm/writing-a-dos-clone-in-2019-70eac97ec3e1
55•shakna•1w ago•18 comments

Alchemy

https://joshcollinsworth.com/blog/alchemy
17•tobr•6d ago•8 comments

Our investigation into the suspicious pressure on Archive.today

https://adguard-dns.io/en/blog/archive-today-adguard-dns-block-demand.html
1689•immibis•1d ago•419 comments

libwifi: an 802.11 frame parsing and generation library written in C (2023)

https://libwifi.so/
141•vitalnodo•19h ago•13 comments

Interactive Spectrum Chart

http://www.potatofi.com/posts/spectrum-viewer/
10•throw0101d•1w ago•4 comments

Owning a Cat Could Double Your Risk of Schizophrenia, Research Suggests

https://www.sciencealert.com/owning-a-cat-could-double-your-risk-of-schizophrenia-research-suggests
5•amichail•37m ago•0 comments

Boa: A standard-conforming embeddable JavaScript engine written in Rust

https://github.com/boa-dev/boa
263•maxloh•1w ago•67 comments