news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: 26/02/26 – 5 songs in a day

https://playingwith.variousbits.net/saturday

1•dmje•29s ago•0 comments

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

https://github.com/Paraxiom/topological-coherence

1•slye514•2m ago•1 comments

Top AI models fail at >96% of tasks

https://www.zdnet.com/article/ai-failed-test-on-remote-freelance-jobs/

3•codexon•2m ago•1 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/

1•NaOH•3m ago•0 comments

Bob Beck (OpenBSD) on why vi should stay vi (2006)

https://marc.info/?l=openbsd-misc&m=115820462402673&w=2

2•birdculture•7m ago•0 comments

Show HN: Glimpsh- glimpse into the future of eye tracking for multi-agent use

https://github.com/dchrty/glimpsh

1•dochrty•8m ago•0 comments

The Optima-l Situation: A deep dive into the classic humanist sans-serif

https://micahblachman.beehiiv.com/p/the-optima-l-situation

1•subdomain•8m ago•0 comments

Barn Owls Know When to Wait

https://blog.typeobject.com/posts/2026-barn-owls-know-when-to-wait/

1•fintler•8m ago•0 comments

Implementing TCP Echo Server in Rust [video]

https://www.youtube.com/watch?v=qjOBZ_Xzuio

1•sheerluck•9m ago•0 comments

LicGen – Offline License Generator (CLI and Web UI)

1•tejavvo•12m ago•0 comments

Service Degradation in West US Region

https://azure.status.microsoft/en-gb/status?gsid=5616bb85-f380-4a04-85ed-95674eec3d87&utm_source=...

2•_____k•12m ago•0 comments

The Janitor on Mars

https://www.newyorker.com/magazine/1998/10/26/the-janitor-on-mars

1•evo_9•14m ago•0 comments

Bringing Polars to .NET

https://github.com/ErrorLSC/Polars.NET

3•CurtHagenlocher•16m ago•0 comments

Adventures in Guix Packaging

https://nemin.hu/guix-packaging.html

1•todsacerdoti•17m ago•0 comments

Show HN: We had 20 Claude terminals open, so we built Orcha

1•buildingwdavid•17m ago•0 comments

Your Best Thinking Is Wasted on the Wrong Decisions

https://www.iankduncan.com/engineering/2026-02-07-your-best-thinking-is-wasted-on-the-wrong-decis...

1•iand675•17m ago•0 comments

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

https://www.warcraftcn.com/

1•vyrotek•18m ago•0 comments

Trump Vodka Becomes Available for Pre-Orders

https://www.forbes.com/sites/kirkogunrinde/2025/12/01/trump-vodka-becomes-available-for-pre-order...

1•stopbulying•20m ago•0 comments

Velocity of Money

https://en.wikipedia.org/wiki/Velocity_of_money

1•gurjeet•22m ago•0 comments

Stop building automations. Start running your business

https://www.fluxtopus.com/automate-your-business

1•valboa•26m ago•1 comments

You can't QA your way to the frontier

https://www.scorecard.io/blog/you-cant-qa-your-way-to-the-frontier

1•gk1•27m ago•0 comments

Show HN: PalettePoint – AI color palette generator from text or images

https://palettepoint.com

1•latentio•28m ago•0 comments

Robust and Interactable World Models in Computer Vision [video]

https://www.youtube.com/watch?v=9B4kkaGOozA

2•Anon84•32m ago•0 comments

Nestlé couldn't crack Japan's coffee market.Then they hired a child psychologist

https://twitter.com/BigBrainMkting/status/2019792335509541220

1•rmason•33m ago•1 comments

Notes for February 2-7

https://taoofmac.com/space/notes/2026/02/07/2000

2•rcarmo•35m ago•0 comments

Study confirms experience beats youthful enthusiasm

https://www.theregister.com/2026/02/07/boomers_vs_zoomers_workplace/

2•Willingham•42m ago•0 comments

The Big Hunger by Walter J Miller, Jr. (1952)

https://lauriepenny.substack.com/p/the-big-hunger

2•shervinafshar•43m ago•0 comments

The Genus Amanita

https://www.mushroomexpert.com/amanita.html

1•rolph•48m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/

10•mooreds•48m ago•4 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•50m ago•0 comments

Open in hackernews

Grok 4, deep dive comparisons with o3 (benchmarks are broken)

https://www.youtube.com/watch?v=v4JYNhhdruA

1•Sherveen•6mo ago

Comments

Sherveen•6mo ago

I'm kind of bothered by how many folks in the "AI influencer" space just pick up on the latest model hype, "Grok 4 changes EVERYTHING" type of nonsense.

And Grok 4 is a great example where they're just completely lying about the practical results. Elon wants to claim this is the smartest model, but it's like... 3rd or 4th best, at best.

Benchmarks, for a variety of reasons, now seem inadequate to capture models' actual strength, so I decided to run Grok 4 and o3 (and Grok 4 Heavy + o3-pro) through a gauntlet of questions that I think demonstrate real, practical differences between the two.

Hope this is helpful!