frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: What is "response-level error rate" and how is it measured?

2•myyke•2h ago
There's this chart around gpt-5's hallu and error rates:

https://api.wandb.ai/files/byyoung3/images/projects/37269171/0da61431.png

from:

https://wandb.ai/byyoung3/ml-news/reports/GPT-5-Benchmark-Scores---VmlldzoxMzkwMTYyMg

I'm wondering what "response-level error rate" is exactly and it is measured?

gpt 4.1 says it's sampled production prompts, rated by humans. Is that it?

Ask HN: Anyone else using open-source platforms to avoid SaaS lock-in?

1•sharan_gohar_•4m ago•0 comments

Linux KDE Plasma Features That Changed How I Use My PC

https://www.howtogeek.com/linux-kde-plasma-features-that-completely-changed-how-i-use-my-pc/
1•jrepinc•6m ago•0 comments

Ask HN: Has Cloudflare blocked your domain without notice?

4•turowicz•7m ago•1 comments

Salestarget.ai – AI-powered B2B lead generation, cold email outreach, and CRM

1•Salestargetai•9m ago•0 comments

Sqlite3 will also read and write ZIP archives

https://mastodon.social/@jpmens/114991866577330548
1•mtmail•11m ago•0 comments

Think you can solve 6 Wordles at once?

https://bythomas.co.uk/stackronym/
1•tedavis•14m ago•1 comments

US to rewrite its past national climate reports

https://www.france24.com/en/live-news/20250807-us-to-rewrite-its-past-national-climate-reports
5•mdhb•14m ago•0 comments

Moby Dick Big Read (2012-2013)

https://www.mobydickbigread.com/
1•robin_reala•15m ago•0 comments

Show HN: Moocup – open-source offline-first tool to create preety screenshots.

https://github.com/jellydeck/moocup
1•jdsane•16m ago•0 comments

Why Is History Crucial to Politics?

https://www.historyforpeace.pw/post/why-is-history-crucial-to-politics-romila-thapar
2•fluffybeing•17m ago•0 comments

Redefining enterprise data with agents and AI-native foundations

https://cloud.google.com/blog/products/data-analytics/new-agents-and-ai-foundations-for-data-teams
1•mariuz•18m ago•0 comments

How to Use Antlr Pattern Matching

https://tomassetti.me/how-to-use-antlr-pattern-matching/
2•ingve•20m ago•0 comments

GPT-5 vs. GPT-4 for AI medical diagnosis examples

https://github.com/joelparkerhenderson/ai-medical-diagnosis-examples
1•jph•21m ago•0 comments

Show HN: A tempmail service made with Rust

https://vortex.skyfall.dev/
2•JustSkyfall•24m ago•0 comments

Tell HN: Thing I learned this year was keeping a work journal

4•Muromec•26m ago•0 comments

Ask HN: What teckstack should I use as a noob?

1•Qualitywolf2•27m ago•1 comments

I Built OmniAgent: The Missing Bridge Between MCP and Custom Business Logic

1•abiorh001•29m ago•0 comments

HTTP Is Not Simply

https://daniel.haxx.se/blog/2025/08/08/http-is-not-simple/
1•bigblind•30m ago•0 comments

Show HN: MBCompass – FOSS Android and Compass App

https://github.com/CompassMB/MBCompass
1•nativeforks•32m ago•1 comments

One Event at a Time: Funding Your Community the Realistic Way

https://georgiker.com/blog/one-event-at-a-time/
1•cmaureir•33m ago•0 comments

Show HN: A fast and detailed guide for the game Grounded

https://grounded2.net
1•airobus•36m ago•1 comments

Show HN: I built a service to run Claude Code in the Cloud

https://agentwrap.dev/
2•dvolkhonskiy•40m ago•1 comments

Abusing Ubuntu 24.04 features for root privilege escalation

https://labs.snyk.io/resources/abusing-ubuntu-root-privilege-escalation/
1•todsacerdoti•41m ago•0 comments

Booting 5000 Erlangs on Ampere One 192-core

https://underjord.io/booting-5000-erlangs-on-ampere-one.html
2•ingve•42m ago•0 comments

AI pilots can be a nightmare

https://blog.paid.ai/p/ai-pilots-can-be-a-nightmare
1•arnon•43m ago•0 comments

Ask HN: Which processor to pick for learning assembly?

3•shivajikobardan•44m ago•2 comments

Checkpointing CUDA Applications with CRIU (2024)

https://developer.nvidia.com/blog/checkpointing-cuda-applications-with-criu/
1•tanelpoder•44m ago•0 comments

Tesla shuts down Dojo, the AI training computer that is key to full self-driving

https://techcrunch.com/2025/08/07/tesla-shuts-down-dojo-the-ai-training-supercomputer-that-musk-said-would-be-key-to-full-self-driving/
4•jnord•44m ago•0 comments

New 3D Golf Simulation (video game series)

https://blog.gingerbeardman.com/2024/11/09/new-3d-golf-simulation-video-game-series/
1•msephton•47m ago•0 comments

Results of 2nd International Olympiad in AI (IOAI)

https://ioai-official.org/china-2025/results-2025/
1•pfilo•49m ago•0 comments