frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What are most up-to-date LLM Benchmarks for Agentic Coding

2•vladgur•2h ago
There are a lot of SOTA and smaller models coming out every month and many of them claim great coding output, tool execution, etc at a better cost than their competitor, but i havent been able to find any up-to-date benchmark that would actually confirm and compare these models in terms of speed, quality and price.

For instance: https://gso-bench.github.io/leaderboard.html seems to be a few months behind and is missing few key models like Grok and some others.

How do you decide which model to use for your day-to-day and are there good metrics that help with that decision

DeepWiki to MdBook Converter

https://docs.deepwiki-to-mdbook.zenosmosis.com/
1•rustic-indian•1m ago•1 comments

A Book Tracking Package for Emacs

https://lars.ingebrigtsen.no/2025/04/15/a-book-tracking-package-for-emacs/
1•smartmic•3m ago•0 comments

Biological sex is binary: the new heresy

1•Pocomon•5m ago•0 comments

Heron's Formula for Spherical Triangle

https://www.johndcook.com/blog/2025/11/08/heron-on-a-sphere/
1•ibobev•7m ago•0 comments

Rolling Correlation

https://www.johndcook.com/blog/2025/11/09/rolling-correlation/
1•ibobev•7m ago•0 comments

Show HN: Valid8r, Functional validation for Python CLIs using Maybe monads

https://github.com/mikelane/valid8r
1•lanemik•8m ago•0 comments

Google ADK-Go

https://github.com/google/adk-go
1•chisleu•13m ago•1 comments

AMD doubles rack size for 2027 with Verano CPUs and 144 MI500 GPUs

https://www.techradar.com/pro/here-is-a-glimpse-of-the-absurdly-powerful-ai-rack-amd-will-launch-...
2•ibobev•15m ago•0 comments

The Roman Empire's Road Network

https://gizmodo.com/the-roman-empires-entire-road-network-just-got-mapped-and-its-mind-blowing-20...
3•gmays•15m ago•0 comments

ExtraDocAI – Transform your documents into structured data

https://extradoc.ai/
2•fernando_gb•17m ago•0 comments

German Food Banks Feed 37,000 US Soldiers

https://hanschristensen.substack.com/p/german-food-banks-feed-37000-us-soldiers
14•DyslexicAtheist•18m ago•2 comments

The devastating memo that plunged the BBC into crisis

https://www.telegraph.co.uk/news/2025/11/06/read-devastating-internal-bbc-memo-in-full/
6•thesmtsolver•20m ago•0 comments

'Up to 15 or 20′ air traffic controllers are retiring daily

https://www.pennlive.com/nation-world/2025/11/up-to-15-or-20-air-traffic-controllers-are-retiring...
3•geox•20m ago•0 comments

Offer HN: Free Executive Assistant Onboarding Template (Sop)

https://north-pressure-8f8.notion.site/SOP-Executive-Preferences-2a650a3082a481fe8034e27e84f8b15b...
2•LillyTam•21m ago•1 comments

Show HN: I built SyncForge to connect indie artists with sync deals

https://trysyncforge.xyz
2•artskyinc•22m ago•0 comments

A collection of outlandish HCI papers

https://floe.butterbrot.org/matrix/rants/weird/
2•luu•24m ago•0 comments

Industrial facilities owned by profitable companies release more toxic waste

https://theconversation.com/industrial-facilities-owned-by-profitable-companies-release-more-of-t...
2•PaulHoule•30m ago•0 comments

It's Not Me [pdf]

https://www.berkshirehathaway.com/news/nov0625.pdf
5•kamaraju•30m ago•0 comments

The evolution of laziness: Why humans resist the gym [video]

https://www.youtube.com/watch?v=TLY0TNm67hY
2•paulpauper•31m ago•0 comments

The Algorithmic Turn: The Emerging Evidence on AI Tutoring That's Hard to Ignore

https://carlhendrick.substack.com/p/the-algorithmic-turn-the-emerging
3•paulpauper•33m ago•1 comments

Farmers' Almanac will cease publication

https://www.washingtonpost.com/nation/2025/11/08/farmers-almanac-ends-publication/
4•paulpauper•33m ago•1 comments

"Good engineering management" is a fad

https://lethain.com/good-eng-mgmt-is-a-fad/
3•coderintherye•35m ago•1 comments

Password to Louvre video surveillance system was 'Louvre', according to employee

https://abcnews.go.com/International/password-louvres-video-surveillance-system-louvre-employee/s...
7•scruple•37m ago•1 comments

Resolving the Scourge of Java's Checked Exceptions on Its Streams and Lambdas

https://javajanitorjim.substack.com/p/java-janitor-jim-resolving-the-scourge
3•jimofl•39m ago•0 comments

If You're Not Active, You're Sick – You Just Don't Know It Yet

https://howardluksmd.substack.com/p/if-youre-not-active-youre-sick-you
3•rzk•40m ago•0 comments

"erase startup-config" isn't enough

https://alyx.sh/posts/erase-startup-config/
3•todsacerdoti•44m ago•0 comments

Custom doorbell app with Home Assistant and WebRTC

https://www.naps62.com/posts/custom-doorbell-app-with-homeassistant
3•naps62•45m ago•0 comments

Role of Inactivity in Chronic Diseases

https://pmc.ncbi.nlm.nih.gov/articles/PMC6347102/
5•rzk•49m ago•0 comments

Kaist Team Pioneers Core Technology for C-to-Rust Conversion, and More

https://m.dongascience.com/en/news/74991
3•kuil009•53m ago•1 comments

Conversion Rate Optimization

https://en.wikipedia.org/wiki/Conversion_rate_optimization
2•ugur2nd•54m ago•0 comments