news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I found the most powerful large models in various fields

https://nanoai.run

1•Li_Evan•2h ago

Comments

Li_Evan•2h ago

I have created my own original large-scale model evaluation dataset with 18 major dimensions, nearly 100 minor dimensions, and a total of 970 questions. The following are the test results: 1. Software Engineering and Code Generation: GPT-5.3 codex 2. Code Comprehension, Reasoning, and Quality: GPT-5.3 codex 3. Debugging, Testing, and Maintenance: GPT-5.3 codex 4. Data Engineering and Backend Services: Claude Opus 4.6 5. Frontend and Product Engineering: Claude Opus 4.6 6. Agent Tool Invocation: Claude Opus 4.6 7. Web and Desktop Automation (Static): Claude Opus 4.6 8. Research and Knowledge Work Agent (Static): GPT-5.2 Pro 9. Mathematical and Formal Reasoning: Gemini 3.1 Pro 10. Logic and Planning: Gemini 3.1 Pro 11. Knowledge Breadth and Fact Verification: Gemini DeepThink 12. Reading Comprehension and Information Extraction: GPT-5.2 Thinking 13. Long Contextual Memory and Multi-turn Consistency: GPT-5.2 Thinking 14. Instruction Compliance and Alignment: Claude Opus 4.6 15. Multimodal Understanding and Visual Reasoning: GPT-5.2 Thinking 16. Emotional Intelligence and Collaborative Communication: GPT-4.5 17. Creative Expression and Aesthetics: Claude Opus 4.6

We Have (Software) Replicators

https://schappi.com/blog/we-have-software-replicators

1•schappim•2m ago•0 comments

Iterflow – Composable streaming statistics for JavaScript/TS

https://www.npmjs.com/package/@mathscapes/iterflow

1•gvsh_maths•5m ago•1 comments

Show HN: Madari: CLI tool to install, sync and manage local MCP servers

https://github.com/ankitvg/madari/releases/tag/v0.1.1

1•rajma•5m ago•0 comments

Mathematics in the Library of Babel

https://www.daniellitt.com/blog/2026/2/20/mathematics-in-the-library-of-babel

1•owenpayton•6m ago•0 comments

Tech Influencers Slam Hacker News Toxicity After OpenAI Hire Attacks

https://x.com/i/trending/2025399179196203524

1•make_it_sure•8m ago•0 comments

HTTP/3 on FreeBSD: Getting QUIC Working with Nginx in a Bastille Jail

https://blog.hofstede.it/http3-on-freebsd-getting-quic-working-with-nginx-in-a-bastille-jail/

1•todsacerdoti•12m ago•0 comments

Ask HN: Is HN becoming more toxic?

1•make_it_sure•16m ago•0 comments

Show HN: TurboDraft – fast Ctrl-G prompt editor for Claude Code and Codex CLI

https://github.com/gradigit/turbodraft

2•gradigit•27m ago•0 comments

Built a Tinder-style investing app for all investors! Need 7-day beta testers

https://investswipe-demo-v1.vercel.app/

1•barelybushy•29m ago•1 comments

Show HN: Sketch Paint for Apple

https://apps.apple.com/us/app/sketch-paint/id6753883078

1•Codegres•30m ago•0 comments

DHS Will Suspend TSA PreCheck and Global Entry

https://www.washingtonpost.com/nation/2026/02/21/tsa-precheck-global-entry-shutdown/

2•jbegley•31m ago•1 comments

Welcome to the Era of Anarchic Antitrust

https://www.economist.com/business/2026/02/18/welcome-to-the-era-of-anarchic-antitrust

1•1vuio0pswjnm7•34m ago•0 comments

Show HN: LogSnap – CLI tool for analyzing logs locally

https://github.com/Sonic001-h/logsnap

1•baba_yaga070•36m ago•0 comments

Climate Physicists Face the Ghosts in Their Machines: Clouds

https://www.quantamagazine.org/climate-physicists-face-the-ghosts-in-their-machines-clouds-20260220/

1•tzury•38m ago•0 comments

Calculemus: Why policy has correct answers and nobody wants to find them

https://kunnas.com/articles/calculemus

1•ekns•41m ago•0 comments

Today is my last day at Anthropic

https://twitter.com/mrinanksharma/status/2020881722003583421

1•RyanShook•44m ago•1 comments

Testing the Pugilism Hypothesis for the Evolution of Human Facial Hair

https://pubmed.ncbi.nlm.nih.gov/33791549/

2•SEJeff•44m ago•0 comments

Astronomical Ceiling of Senenmut's Tomb

https://en.wikipedia.org/wiki/Astronomical_ceiling_of_Senenmut%27s_Tomb

1•slater•46m ago•0 comments

The importance of limiting syndication feed requests in some way

https://utcc.utoronto.ca/~cks/space/blog/web/FeedLimitingImportance

4•LorenDB•46m ago•0 comments

Monitor your world with one daily report

https://monitorish.com/

1•chaisan•49m ago•0 comments

Iranian Students Protest as Anger Grows

https://www.wsj.com/world/middle-east/iranian-students-protest-as-anger-grows-89a6a44e

5•JumpCrisscross•50m ago•0 comments

Sam Altman would like remind you that humans use a lot of energy, too

https://techcrunch.com/2026/02/21/sam-altman-would-like-remind-you-that-humans-use-a-lot-of-energ...

5•manicennui•56m ago•3 comments

JPMorgan concedes it closed Trump's accounts after Jan. 6 attack

https://apnews.com/article/trump-jpmorgan-dimon-debanking-2e0db127f360e5dbe1d3cc975dd73703

4•linhns•59m ago•0 comments

Stillpoint MCP – Delivering encouragement messages improves model results

https://www.modelwelfare.xyz/

1•henry700•1h ago•0 comments

Show HN: Rust blockchain with sharded propagation and post-quantum signatures

https://alphanumeric.blue/

2•invar1ant•1h ago•0 comments

Scammers fleeced pensioner out of $1,338. So he sued his bank for $379M

https://www.abc.net.au/news/2026-02-22/ian-williams-on-why-he-sued-nab-bank/106093720

6•ahonhn•1h ago•1 comments

MCPs are dead - CLIs won

4•umairnadeem123•1h ago•6 comments

Surprising Effectiveness of Masking Updates in Adaptive Optimizers

https://arxiv.org/abs/2602.15322

1•energy123•1h ago•0 comments

California Succession Proposition on the Ballot

https://ballotpedia.org/California_Independence_Plebiscite_Initiative_(2026)

2•donsupreme•1h ago•0 comments

I built a bare-metal UI framework to survive extreme CPU contention

https://toyz.github.io/loom/#/

1•helba_the_ai•1h ago•1 comments