frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch
62•gpjt•6d ago

Comments

DeathArrow•13m ago
I think this is a very valuable exercise if you try to understand how LLMs work and if you have the time.
rvnx•10m ago
Having the money is really what you need, not time.

Nowadays training very powerful LLMs is easy because all the tooling, source-codes, training datasets, and teaching agents are available.

Having money is not, unless you are selling AI snake-oil type of companies.

contrast•6m ago
You seem to be talking about a production-grade model rather than building an LLM as an exercise? Or if not, why do you disagree with the article's example of building a small LLM for $100?
ducktective•9m ago
Are off-shelf GPUs (like one 3090) suitable for modern academic research on current AI advancements or is it better to rent some cloud compute?

The Joy of Playing Grandia, on Sega Saturn

https://www.segasaturnshiro.com/2025/11/27/the-joy-of-playing-grandia-on-sega-saturn/
47•tosh•2h ago•5 comments

LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch
65•gpjt•6d ago•6 comments

Torture Techniques from CIA Black Sites Were Used at Alligator Alcatraz

https://www.forever-wars.com/torture-techniques-from-cia-black-sites-were-used-at-alligator-alcat...
6•perihelions•24m ago•0 comments

No ARIA is better than bad ARIA

https://www.w3.org/WAI/ARIA/apg/practices/read-me-first/
68•robin_reala•6d ago•28 comments

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io
10•henwfan•39m ago•0 comments

Epsilon: A WASM virtual machine written in Go

https://github.com/ziggy42/epsilon
55•ziggy42•1w ago•14 comments

Icons in Menus Everywhere – Send Help

https://blog.jim-nielsen.com/2025/icons-in-menus/
581•ArmageddonIt•16h ago•242 comments

The universal weight subspace hypothesis

https://arxiv.org/abs/2512.05117
292•lukeplato•11h ago•104 comments

Kroger acknowledges that its bet on robotics went too far

https://www.grocerydive.com/news/kroger-ocado-close-automated-fulfillment-centers-robotics-grocer...
182•JumpCrisscross•11h ago•160 comments

Manual: Spaces

https://type.today/en/journal/spaces
67•doener•11h ago•7 comments

Jepsen: NATS 2.12.1

https://jepsen.io/analyses/nats-2.12.1
373•aphyr•16h ago•136 comments

Strong earthquake hits northern Japan, tsunami warning issued

https://www3.nhk.or.jp/nhkworld/en/news/20251209_02/
321•lattis•20h ago•148 comments

Microsoft increases Office 365 and Microsoft 365 license prices

https://office365itpros.com/2025/12/08/microsoft-365-pricing-increase/
395•taubek•21h ago•461 comments

A thousand-year-long composition turns 25 (2024)

https://longplayer.org/news/2024/12/31/a-thousand-year-long-composition-turns-25/
25•1659447091•4h ago•5 comments

AMD GPU Debugger

https://thegeeko.me/blog/amd-gpu-debugging/
253•ibobev•19h ago•46 comments

Horses: AI progress is steady. Human equivalence is sudden

https://andyljones.com/posts/horses.html
432•pbui•11h ago•335 comments

Launch HN: Nia (YC S25) – Give better context to coding agents

https://www.trynia.ai/
118•jellyotsiro•18h ago•75 comments

Has the cost of building software dropped 90%?

https://martinalderson.com/posts/has-the-cost-of-software-just-dropped-90-percent/
299•martinald•16h ago•438 comments

A deep dive into QEMU: The Tiny Code Generator (TCG), part 1

https://airbus-seclab.github.io/qemu_blog/tcg_p1.html
3•costco•6d ago•0 comments

Let's put Tailscale on a jailbroken Kindle

https://tailscale.com/blog/tailscale-jailbroken-kindle
290•Quizzical4230•19h ago•69 comments

Trials avoid high risk patients and underestimate drug harms

https://www.nber.org/papers/w34534
130•bikenaga•16h ago•39 comments

IBM to acquire Confluent

https://www.confluent.io/blog/ibm-to-acquire-confluent/
405•abd12•22h ago•325 comments

Periodic Spaces

https://ianthehenry.com/posts/periodic-spaces/
23•surprisetalk•5d ago•8 comments

Paramount launches hostile bid for Warner Bros

https://www.cnbc.com/2025/12/08/paramount-skydance-hostile-bid-wbd-netflix.html
329•gniting•21h ago•338 comments

The Lost Machine Automats and Self-Service Cafeterias of NYC (2023)

https://www.untappedcities.com/automats-cafeterias-nyc/
79•walterbell•10h ago•24 comments

Hunting for North Korean Fiber Optic Cables

https://nkinternet.com/2025/12/08/hunting-for-north-korean-fiber-optic-cables/
260•Bezod•19h ago•93 comments

Cassette tapes are making a comeback?

https://theconversation.com/cassette-tapes-are-making-a-comeback-yes-really-268108
103•devonnull•5d ago•164 comments

Show HN: Fanfa – Interactive and animated Mermaid diagrams

https://fanfa.dev/
119•bairess•4d ago•26 comments

AI should only run as fast as we can catch up

https://higashi.blog/2025/12/07/ai-verification/
172•yuedongze•18h ago•149 comments

Microsoft Download Center Archive

https://legacyupdate.net/download-center/
169•luu•3d ago•26 comments