frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Free evals API for AI startups (ship 10x faster with evals you can trust)

2•sfox100•19h ago
Hey HN,

We built Composo because AI apps fail unpredictably and teams have no idea if their changes helped.

LLM-as-judge doesn't work - it gives random scores, doesn't work well for agents, and doesn't tell you what to fix.

We've built purpose-built evaluation models that give you: - Deterministic scores (same input = same score, always) - Instant identification of where prompts, retrievals, agents & tool calls fail - Exact failure analysis ("tool calls are looping due to poorly specified schema")

We're 92% accurate vs 72% for SOTA LLM-as-judge.

Giving 10 startups free access: - 10k eval credits - Just launched our evals API for agents & tool calling - 5 min setup

Already helping teams at Palantir, Accenture, and Tesla ship reliable AI.

Apply: composo.short.gy/startups

Happy to answer questions about evaluation, reward models, or why LLMs are bad at judging themselves. startups@composo.ai

Proton Authenticator – secure 2FA, your way

https://proton.me/blog/authenticator-app
1•tessierashpool9•3m ago•0 comments

Show HN: GistFans–First developer platform with 100% community-driven governance

https://www.gistfans.com/
1•ff12wq111•5m ago•1 comments

70% of IT budgets are being drained just to keep legacy systems afloat

https://www.techolution.com/blog/5-reasons-postponing-legacy-modernization-2026-could-spell-catastrophic-risk/
1•WoodenDist2857•6m ago•0 comments

Peter Thiel backing first private US uranium enrichment facility in Paducah

https://www.wkms.org/energy/2025-07-25/billionaire-peter-thiel-backing-first-privately-developed-us-uranium-enrichment-facility-in-paducah
1•mrtksn•10m ago•0 comments

Seeing with Your Ear: A Humble Experiment in AI, Depth, and Spatial Sound

https://medium.com/@jan.mittelman/seeing-with-your-ear-a-humble-experiment-in-ai-depth-and-spatial-sound-08271701f336
1•vedmakk•11m ago•0 comments

CRISPR-GPT for agentic automation of gene-editing experiments

https://www.nature.com/articles/s41551-025-01463-z#author-information
1•instagraham•12m ago•0 comments

The upstart company that wants to build the largest aircraft

https://www.bbc.com/future/article/20250729-windrunner-the-company-that-wants-to-build-the-worlds-largest-aircraft
1•disqard•12m ago•0 comments

New Google AI model maps world in 10-meter squares for machines to read

https://www.theregister.com/2025/07/31/google_ai_maps_world/
1•beardyw•17m ago•2 comments

MLCommons Releases MLPerf Client v1.0

https://mlcommons.org/2025/07/mlperf-client-v1-0/
1•akshayt•19m ago•0 comments

Russian Government-Linked Social Engineering Targets App-Specific Passwords

https://citizenlab.ca/2025/06/russian-government-linked-social-engineering-targets-app-specific-passwords/
2•villaaston1•19m ago•0 comments

Node LTS now supports TypeScript

https://nodejs.org/en/blog/release/v22.18.0
1•robpalmer•22m ago•0 comments

Giorgia Meloni's government makes a bet on unproven nuclear technology

https://www.politico.eu/article/giorgia-meloni-government-plan-nuclear-technology/
1•mdp2021•24m ago•1 comments

Stack traces for Postgres errors with backtrace_functions

https://www.enterprisedb.com/blog/stack-traces-postgres-errors-backtracefunctions
1•ibobev•25m ago•0 comments

Break the selective silence on the genocide in Gaza

https://www.thelancet.com/journals/lancet/article/PIIS0140-6736%2825%2901541-7/fulltext
2•mhga•27m ago•0 comments

A delightfully silly database that lives in CPU cache

https://blog.canoozie.net/when-your-database-lives-in-cpu-cache-because-why-not/
1•jtregunna•30m ago•0 comments

First Australian-made rocket crashes shortly after lift-off

https://www.bbc.com/news/videos/cz93xzv3njjo
1•taubek•32m ago•0 comments

Relic of the space race hidden in Everglades: secret moon rocket (2014)

https://www.dailymail.co.uk/news/article-2564844/Relic-space-race-hidden-Everglades-The-secret-10-storey-moon-rocket-abandoned-40-years.html
2•taubek•36m ago•0 comments

UK Civil service interns must be working class, government says

https://www.bbc.co.uk/news/articles/c3ez3v9v8jqo
1•mellosouls•39m ago•0 comments

Computer Networking a Top-Down Approach, 9th Edition

https://gaia.cs.umass.edu/kurose_ross/ninth.php
1•SiqingYu•39m ago•0 comments

Wan AI – Wan 2.2: Leading AI Video Generation Model

https://www.wan-ai.co
1•laiwuchiyuan•39m ago•0 comments

Kindle Jailbreak Update

https://kindlemodding.org/jailbreaking/5.18/
1•dumbumdum•40m ago•1 comments

Show HN: Go-Pubsub – A Lightweight, Real-Time Pub-Sub Library for Golang

https://github.com/F2077/go-pubsub
1•F2077•51m ago•0 comments

Proximity Audio/Video Technology

https://www.thegamer.com/best-games-with-proximity-chat/
1•JamesPark1982•53m ago•0 comments

The vibe coding okay, the real game changer will be the design of our projects

1•cesargstn•54m ago•0 comments

Switzerland Slammed with 39% Tariff Rate in US Trade Blitz

https://www.swissinfo.ch/eng/switzerland-slammed-with-39%25-tariff-rate-in-us-trade-blitz/89768316
2•sschueller•58m ago•0 comments

The Comprehensive Guide to Knowledge Graphs

https://www.agilelab.it/blog/the-comprehensive-guide-to-knowledge-graphs
1•ninocan•58m ago•0 comments

Show HN: New VSCode extension Function Explorer

https://marketplace.visualstudio.com/items?itemName=eridien.vscode-function-explorer
1•mchahn•1h ago•0 comments

The Beman Project: Tomorrow's C++ Standard Libraries Today

https://bemanproject.org/
1•ingve•1h ago•0 comments

Show HN: Dotfiles Management Tool

https://github.com/crhuber/dot
1•cr_huber•1h ago•0 comments

Greentea OS non-NT/non-Unix system from scratch runs .exe files

https://github.com/GreenteaOS/Greentea/releases/tag/2025.7.29
2•PeyTy•1h ago•4 comments