frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The Big LLM Architecture Comparison

https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison
103•mdp2021•5h ago

Comments

bravesoul2•4h ago
This is a nice catchup for some who hasn't been keeping up like me
dmezzetti•2h ago
While all these architectures are innovative and have helped improve either accuracy or speed, the same fundamental problem of generating factual information still exists.

Retrieval Augmented Generation (RAG), Agents and other similar methods help mitigate this. It will be interesting to see if future architectures eventually replace these techniques.

tormeh•24m ago
To me, the issue seems to be that we're training transformers to predict text, which only forces the model to embed limited amounts of logic. We'd have to find something different to train models on in order for them to stop hallucinating.
bsenftner•2m ago
I'm still thinking about how RAG being conceptually simple and easy to implement, why the foundational models have not incorporated it into their base functionality? The lack of that strikes me as a negative point about RAG and it's variants, because if any of them worked, it would be in the models directly and not need to be added afterwards.
Chloebaker•1h ago
Honestly its crazy to think how far we’ve come since GPT-2 (2019), today comparing LLMs to determine their performance is notoriously challenging and it feels like every 2 weeks a models beats a new benchmark. I’m really glad DeepSeek was mentioned here, bc the key architectural techniques it introduced in V3 that improved its computational efficiency and distinguish it from many other LLMs was really transformational when it came out.
strangescript•51m ago
The diagrams in this article are amazing if you are somewhere in between a novice and expert. Seeing all of the new models laid out next to each other is fantastic.
webappguy•2m ago
Would love to see a PT.2 w even what is rumored in top closed source frontier models eg. o5, o3 Pro, o4 or 4.5, Gemini 2.5 Pro, Grok 4 and Claude Opus 4

The bewildering phenomenon of declining quality

https://english.elpais.com/culture/2025-07-20/the-bewildering-phenomenon-of-declining-quality.html
200•geox•4h ago•277 comments

Async I/O on Linux in databases

https://blog.canoozie.net/async-i-o-on-linux-and-durability/
92•jtregunna•6h ago•25 comments

The Big LLM Architecture Comparison

https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison
103•mdp2021•5h ago•7 comments

Show HN: ggc – A terminal-based Git CLI written in Go

https://github.com/bmf-san/ggc
25•bmf-san•3d ago•11 comments

Behind the ballistics of the 'explosive' squirting cucumber

https://phys.org/news/2025-07-ballistics-explosive-squirting-cucumber.html
12•PaulHoule•2d ago•1 comments

A Tour of Microsoft's Mac Lab

https://davidweiss.blogspot.com/2006/04/tour-of-microsofts-mac-lab.html
18•ingve•2h ago•4 comments

Hungary's oldest library is fighting to save books from a beetle infestation

https://www.npr.org/2025/07/14/nx-s1-5467062/hungary-library-books-beetles
141•smollett•3d ago•15 comments

Show HN: MCP server for Blender that builds 3D scenes via natural language

https://blender-mcp-psi.vercel.app/
49•prono•6h ago•10 comments

Make Your Own Backup System – Part 1: Strategy Before Scripts

https://it-notes.dragas.net/2025/07/18/make-your-own-backup-system-part-1-strategy-before-scripts/
288•Bogdanp•17h ago•93 comments

How the 'Minecraft' Score Became Big Business for Its Composer

https://www.billboard.com/pro/how-minecraft-score-became-big-business-for-composer/
16•tunapizza•3d ago•3 comments

Death by AI

https://davebarry.substack.com/p/death-by-ai
386•ano-ther•22h ago•156 comments

Nobody knows how to build with AI yet

https://worksonmymachine.substack.com/p/nobody-knows-how-to-build-with-ai
410•Stwerner•20h ago•324 comments

Borg – Deduplicating archiver with compression and encryption

https://www.borgbackup.org/
81•rubyn00bie•10h ago•26 comments

I tried vibe coding in BASIC and it didn't go well

https://www.goto10retro.com/p/vibe-coding-in-basic
118•ibobev•4d ago•128 comments

Beyond Meat fights for survival

https://foodinstitute.com/focus/beyond-meat-fights-for-survival/
111•airstrike•12h ago•242 comments

Local LLMs versus offline Wikipedia

https://evanhahn.com/local-llms-versus-offline-wikipedia/
274•EvanHahn•19h ago•160 comments

How to run an Arduino for years on a battery (2021)

https://makecademy.com/arduino-battery
70•thunderbong•3d ago•19 comments

Roman Roads Research Association (UK)

https://www.romanroads.org/index.html
20•countrymile•6h ago•4 comments

Mushroom learns to crawl after being given robot body (2024)

https://www.the-independent.com/tech/robot-mushroom-biohybrid-robotics-cornell-b2610411.html
136•Anon84•3d ago•36 comments

Robot metabolism: Toward machines that can grow by consuming other machines

https://www.science.org/doi/10.1126/sciadv.adu6897
6•XzetaU8•4h ago•1 comments

Matterport walkthrough of the original Microsoft Building 3

https://my.matterport.com/show/?m=SZSV6vjcf4L
52•uticus•3d ago•29 comments

What were the earliest laws like?

https://worldhistory.substack.com/p/what-were-the-earliest-laws-really
88•crescit_eundo•4d ago•38 comments

Scientists reveal a widespread but unidentified psychological phenomenon

https://www.psypost.org/scientists-reveal-a-widespread-but-previously-unidentified-psychological-phenomenon/
3•thunderbong•29m ago•0 comments

Ring introducing new feature to allow police to live-stream access to cameras

https://www.eff.org/deeplinks/2025/07/amazon-ring-cashes-techno-authoritarianism-and-mass-surveillance
303•xoa•14h ago•138 comments

Open-Source BCI Platform with Mobile SDK for Rapid Neurotech Prototyping

https://www.preprints.org/manuscript/202507.1198/v1
14•GaredFagsss•3d ago•1 comments

Rethinking CLI interfaces for AI

https://www.notcheckmark.com/2025/07/rethinking-cli-interfaces-for-ai/
176•Bogdanp•19h ago•75 comments

The curious case of the Unix workstation layout

https://thejpster.org.uk/blog/blog-2025-07-19/
94•ingve•20h ago•40 comments

“Bypassing” specialization in Rust

https://oakchris1955.eu/posts/bypassing_specialization/
38•todsacerdoti•3d ago•16 comments

Piano Keys

https://www.mathpages.com/home/kmath043.htm
60•gametorch•4d ago•59 comments

How we tracked down a Go 1.24 memory regression

https://www.datadoghq.com/blog/engineering/go-memory-regression/
177•gandem•2d ago•10 comments