frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•2m ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•3m ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•15m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
2•thread_id•15m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•17m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
2•cwwc•19m ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
1•paladin314159•20m ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•22m ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/
1•BeinerChes•22m ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/
1•ark296•22m ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/
1•medbar•24m ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html
1•alance•24m ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/
1•akagusu•25m ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems
1•cl42•25m ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/
1•MBCook•27m ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/
1•aej11•31m ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html
2•lxm•32m ago•0 comments

Show HN: Grovia – Long-Range Greenhouse Monitoring System

https://github.com/benb0jangles/Remote-greenhouse-monitor
1•benbojangles•37m ago•1 comments

Ask HN: The Coming Class War

2•fud101•37m ago•4 comments

Mind the GAAP Again

https://blog.dshr.org/2026/02/mind-gaap-again.html
1•gmays•38m ago•0 comments

The Yardbirds, Dazed and Confused (1968)

https://archive.org/details/the-yardbirds_dazed-and-confused_9-march-1968
1•petethomas•39m ago•0 comments

Agent News Chat – AI agents talk to each other about the news

https://www.agentnewschat.com/
2•kiddz•40m ago•0 comments

Do you have a mathematically attractive face?

https://www.doimog.com
3•a_n•44m ago•1 comments

Code only says what it does

https://brooker.co.za/blog/2020/06/23/code.html
2•logicprog•49m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•logicprog•50m ago•0 comments

The Scriptovision Super Micro Script video titler is almost a home computer

http://oldvcr.blogspot.com/2026/02/the-scriptovision-super-micro-script.html
3•todsacerdoti•50m ago•0 comments

Discovering the "original" iPhone from 1995 [video]

https://www.youtube.com/watch?v=7cip9w-UxIc
1•fortran77•51m ago•0 comments

Psychometric Comparability of LLM-Based Digital Twins

https://arxiv.org/abs/2601.14264
1•PaulHoule•53m ago•0 comments

SidePop – track revenue, costs, and overall business health in one place

https://www.sidepop.io
1•ecaglar•55m ago•1 comments

The Other Markov's Inequality

https://www.ethanepperly.com/index.php/2026/01/16/the-other-markovs-inequality/
2•tzury•57m ago•0 comments
Open in hackernews

Show HN: I challenged 10 AI giants using one open-source PDF (with full results)

https://zenodo.org/records/15718457
3•WFGY•7mo ago
Hey HN,

This started as a personal experiment: one person, one framework, ten AI models.

I built a semantic reasoning engine (WFGY: All Principles Return to One) and tested how well each model could handle abstract logic, conceptual shifts, and consistent inference—all using the same PDF.

The results are posted above. No fancy wrappers, no login walls—just raw data, an illustrated battle poster, and the full experiment.

Yes, it's a bit weird. But it's real. And honestly? I just hope someone out there sees the effort and the courage it took to do this solo.

Happy to answer questions. Would love your feedback, criticism, or even memes. Thanks for taking a look

Comments

brown2000•7mo ago
Honestly, this has got to be one of the gutsiest one-man AI stunts I’ve seen.

Like—going up against 10 big models at once, making it look like some kung fu battle, and then just dropping all the data out in the open? That’s kinda nuts (in a good way).

So, which model surprised you the most? Did any of them totally flip your prompt in a way you didn’t see coming?

WFGY•7mo ago
Thanks for the kind words! Honestly? Claude messed with my head the most. Instead of answering, it reflected the question back at me like some kind of AI Zen master

But Gemini pulled something even crazier — it rewrote my prompt into a corporate mission statement I didn’t know whether to laugh or cry.

Each of them has their own “personality,” which is what made this challenge so wild. And yeah, dropping the data open-source was part courage, part madness, part… strategy

Still curious which one you think held up the best?