frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I challenged 10 AI giants using one open-source PDF (with full results)

https://zenodo.org/records/15718457
3•WFGY•4h ago
Hey HN,

This started as a personal experiment: one person, one framework, ten AI models.

I built a semantic reasoning engine (WFGY: All Principles Return to One) and tested how well each model could handle abstract logic, conceptual shifts, and consistent inference—all using the same PDF.

The results are posted above. No fancy wrappers, no login walls—just raw data, an illustrated battle poster, and the full experiment.

Yes, it's a bit weird. But it's real. And honestly? I just hope someone out there sees the effort and the courage it took to do this solo.

Happy to answer questions. Would love your feedback, criticism, or even memes. Thanks for taking a look

Comments

brown2000•3h ago
Honestly, this has got to be one of the gutsiest one-man AI stunts I’ve seen.

Like—going up against 10 big models at once, making it look like some kung fu battle, and then just dropping all the data out in the open? That’s kinda nuts (in a good way).

So, which model surprised you the most? Did any of them totally flip your prompt in a way you didn’t see coming?

WFGY•3h ago
Thanks for the kind words! Honestly? Claude messed with my head the most. Instead of answering, it reflected the question back at me like some kind of AI Zen master

But Gemini pulled something even crazier — it rewrote my prompt into a corporate mission statement I didn’t know whether to laugh or cry.

Each of them has their own “personality,” which is what made this challenge so wild. And yeah, dropping the data open-source was part courage, part madness, part… strategy

Still curious which one you think held up the best?

PicoGUS gets CD-ROM emulation

https://github.com/polpo/picogus/releases/tag/v3.0.0
1•zdw•1m ago•0 comments

Show HN: A place for artists and writers to live their live with love

https://nocative.com/pool
1•penpendian•2m ago•0 comments

Iran attacks U.S. military base in Qatar with missiles

https://www.cnbc.com/2025/06/23/iran-qatar-missiles-us-doha.html
1•WolfOliver•3m ago•0 comments

A comprehensive collection of essential online tools for developers

https://onlinedevtools.io/
1•jonpalanis•3m ago•1 comments

Cook from the Fridge

https://twitter.com/geoffreylitt/status/1937190399484805393
1•tosh•6m ago•0 comments

The Web Will Live Again [video]

https://www.youtube.com/watch?v=BsBPeXCyrO0
1•right2copy•10m ago•0 comments

Criteria-Eval: Evaluating Long-Form Answers to Complex Questions

https://samaya.ai/blog/evaluation-of-agents-at-samaya/
1•AnhTho_FR•10m ago•0 comments

Testing between intervals: a key to retaining information in long-term memory

https://theconversation.com/testing-between-intervals-a-key-to-retaining-information-in-long-term-memory-246511
1•mikhael•10m ago•0 comments

Call by Meaning (2014) [pdf]

https://tinlizzie.org/VPRIPapers/tr2014003_callbymeaning.pdf
1•todsacerdoti•13m ago•0 comments

All the Science at Risk in Trump's Clash with Harvard

https://www.nytimes.com/interactive/2025/06/22/upshot/harvard-funding-cuts.html
1•aatish•13m ago•0 comments

Show HN: CivicEcho, a tool to help you write emails to Congress (AGPL)

https://civicecho.org
1•abkhur•18m ago•0 comments

Rich Americans flock to apply for New Zealand's golden visas after rules relaxed

https://www.theguardian.com/world/2025/jun/23/americans-new-zealand-golden-visas-trump
1•miles•20m ago•0 comments

Zohran Mamdani Is Proposing Green Abundance for the Many

https://jacobin.com/2025/06/mamdani-nyc-election-climate-policy
1•Traces•20m ago•0 comments

A multivalued language with a dependent type system. (A precursor to Epic Verse [pdf]

https://www.leafpetersen.com/leaf/publications/dtp2013/lambda-aleph-overview.pdf
1•fanf2•22m ago•0 comments

Learning to Learn in the Age of LLMs

https://www.carette.xyz/posts/learning_to_learn/
3•weird_trousers•25m ago•0 comments

Sam Altman says he is most excited about AI for Science

https://www.youtube.com/watch?v=V979Wd1gmTU
2•charlesxjyang•25m ago•1 comments

AI Model Calls Therapist

https://twitter.com/whitecircle_ai/status/1937197915770167485
1•ovyan•25m ago•0 comments

Ambient Garden

https://ambient.garden
1•fipar•26m ago•0 comments

I figured out how to build AGI and built one

https://playwithagi.com/
1•mrxhacker99•26m ago•1 comments

Connections with James Burke – Official Trailer [video]

https://www.youtube.com/watch?v=o-aAFz0ala0
1•diggan•28m ago•0 comments

Billions to Trillions: Stablecoin Use-Cases Poised to Expand the Market

https://www.theblock.co/post/354675/billions-to-trillions-stablecoin-use-cases-poised-to-expand-the-market
2•wslh•31m ago•0 comments

Show HN: M7 Stock Diversifier – AI-Powered Portfolio Diversification for Techies

https://equitycopilot.app/m7-stock-diversifier
1•haichuan•32m ago•0 comments

A.I. Might Take Your Job. Here Are 22 New Ones It Could Give You

https://www.nytimes.com/2025/06/17/magazine/ai-new-jobs.html
4•twalichiewicz•32m ago•0 comments

The Twom Database Format

https://www.fastmail.com/blog/introducing-twom/
1•PaulHoule•34m ago•0 comments

3D Printing

1•monicaaa•35m ago•2 comments

Subreply vs. Mastodon vs. Bluesky vs. Threads vs. Nostr

https://subreply.com/reply/29912
4•lovamova•35m ago•1 comments

You are what you launch: how software became a lifestyle brand

https://omeru.bearblog.dev/lifestyle/
1•almost-exactly•36m ago•0 comments

How Startups Beat Incumbents

https://longform.asmartbear.com/startup-beats-incumbent/
1•tosh•37m ago•0 comments

Recipes – A Pattern for Common Code Transformations – Arcturus Labs

http://arcturus-labs.com/blog/2025/06/17/recipes--a-pattern-for-common-code-transformations/
1•trikdat•37m ago•0 comments

Show HN: Pshunt – Go terminal app for finding and killing processes

https://github.com/jamesma100/pshunt
1•battle-racket•37m ago•0 comments