frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Deterministic evals API – the alt for LLMasJudge (free credits)

1•sfox100•21h ago
Hey HN,

We built Composo because AI apps fail unpredictably and teams have no idea if their changes helped.

LLM-as-judge doesn't work - it gives random scores, doesn't work well for agents, and doesn't tell you what to fix.

We've built purpose-built evaluation models that give you: - Deterministic scores (same input = same score, always) - Instant identification of where prompts, retrievals, agents & tool calls fail - Exact failure analysis ("tool calls are looping due to poorly specified schema")

We're 92% accurate vs 72% for SOTA LLM-as-judge.

Giving 10 startups free access: - 10k eval credits - Just launched our evals API for agents & tool calling - 5 min setup

Already helping teams at Palantir, Accenture, and Tesla ship reliable AI.

Apply: composo.short.gy/startups

Happy to answer questions about evaluation, reward models, or why LLMs are bad at judging themselves. startups@composo.ai

OpenCQRS – an open-source CQRS framework for the JVM

https://github.com/open-cqrs/opencqrs
1•goloroden•22s ago•0 comments

Show HN: I made a website to find relevant conversations about your brand

https://socialbrandmonitoring.com
1•tech_nurgaliyev•1m ago•0 comments

My first browser extension – AEOadvice.com

https://aeoadvice.com/
1•scencan•2m ago•1 comments

Amazon DocumentDB Serverless is now available

https://aws.amazon.com/blogs/aws/amazon-documentdb-serverless-is-now-available/
1•mariuz•2m ago•0 comments

Why Won't Anyone Use the Beautiful Corporate Spaces

https://loganmarek.com/why-wont-anyone-use-the-beautiful-corporate-spaces/
1•xvok•2m ago•0 comments

Google ADK and AMD Instinct GPUs: The Dynamic Duo for AI Agents

https://www.amd.com/en/developer/resources/technical-articles/2025/google-adk-amd-instinct-gpus-the-dynamic-duo-for-ai-agents.html
1•mariuz•2m ago•0 comments

How to Build a Satellite?

https://www.youtube.com/watch?v=5voQfQOTem8
1•kehiy•4m ago•0 comments

'This wasn't obvious': the potato evolved from a tomato ancestor

https://www.theguardian.com/science/2025/jul/31/potato-evolved-from-tomato-ancestor-researchers-find
2•defrost•5m ago•0 comments

Onshape – Product Development Platform

https://www.onshape.com/en/
1•kehiy•5m ago•0 comments

Quadratic Voting

https://www.radicalxchange.org/wiki/quadratic-voting/
1•xucian•5m ago•1 comments

Brightest explosion ever seen is still baffling astronomers

https://www.popsci.com/science/biggest-gamma-ray-burst-boat/
1•Bluestein•10m ago•0 comments

Subagents.sh – Share and discover Claude Code sub-agents

https://subagents.sh/
1•augmnt•12m ago•1 comments

Bbor62 – A compact binary-to-text compressor

https://github.com/goudvuur/bbor62
1•beligum•14m ago•1 comments

Top Anonymous Email Services for Privacy Lovers

https://cyble.com/knowledge-hub/anonymous-email-services-for-privacy/
1•cybleinc•19m ago•0 comments

Fujitsu starts development of 10000 plus superconducting quantum computer

https://global.fujitsu/en-global/newsroom/gl/2025/08/01-01
2•donutloop•19m ago•0 comments

I built a free, open-source security scanner with shareable dashboards

https://github.com/Huluti/Secrover
1•hugoposnic•20m ago•1 comments

US Energy Department misrepresents climate science in new report

https://phys.org/news/2025-08-energy-department-misrepresents-climate-science.html
1•OutOfHere•23m ago•0 comments

The Art of Parsing and Comparing Version Strings

https://secalerts.co/news/the-art-of-parsing-and-comparing-version-strings/7bVWMEyNBrMIbBmixgGVsI
2•louisstow•25m ago•0 comments

One diet soft drink daily may increase diabetes risk by more than a third

https://www.monash.edu/news/articles/one-can-of-artificially-sweetened-soft-drink-daily-may-increase-diabetes-risk-by-more-than-a-third
2•t0lo•25m ago•1 comments

Isle FPGA Computer

https://projectf.io/isle/fpga-computer.html
1•z303•27m ago•0 comments

Ask HN: How do I sandbox Gemini Code Assist on Mac from accessing other files?

1•nuker•33m ago•0 comments

China struggles to break its addiction to manufacturing [Financial Times]

https://www.ft.com/content/f7979a8f-874a-4b47-8304-d93d30171980
2•wuschel•39m ago•2 comments

Why Japanese Developers Write Code Differently – Why It Works Better

https://medium.com/@sohail_saifi/why-japanese-developers-write-code-completely-differently-and-why-it-works-better-de84d6244fab
1•zdkaster•41m ago•0 comments

Ubiquiti users report having access to others' UniFi routers, cameras (2023)

https://www.bleepingcomputer.com/news/security/ubiquiti-users-report-having-access-to-others-unifi-routers-cameras/
2•janandonly•43m ago•0 comments

How to Grow Human Bones

https://nautil.us/how-to-grow-human-bones-1227312/
1•dnetesn•50m ago•0 comments

Windows 10 at 10: How Microsoft led developers round in circles

https://www.theregister.com/2025/08/01/windows_10_dev_comment/
4•rntn•50m ago•0 comments

The First Lunar Road Trip

https://nautil.us/the-first-lunar-road-trip-1227738/
1•dnetesn•51m ago•0 comments

Show HN: Built sth that makes social media suck less

https://mc-web-feedme.framer.website/feedme
1•cbpark•53m ago•0 comments

OpenIPC is an alternative open firmware for your IP camera

https://openipc.org/
1•janandonly•54m ago•0 comments

Asdfg

https://skeptics.stackexchange.com/
1•vihavo•54m ago•0 comments