frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: What metrics do you track when building agents?

2•rstagi•3h ago
The main metric I look at is inference cost, which is a pretty good indicator when something's off.

However, a couple of days a go I discovered we were loading ~300 tools due to a bug, affecting both costs and accuracy. It was going on for some time and I didn't realize it by looking at the costs, cause it got hidden by other changes I made in the same period.

Now I started tracking that and some other metrics, but it made me wonder what else I might be missing. So: what are you tracking? Any tips? Curious to hear

Tell HN: I don't trust Bigco AI agents with AI research IP

5•botencat•30m ago•1 comments

Ask HN: What do you do with the user guide of the universe?

2•KenographerPrim•48m ago•4 comments

Ask HN: How to get a non-technical friend into tech?

4•yesitcan•1h ago•3 comments

Ask HN: Would you trust encryption at the mobile keyboard layer?

3•dkatsura•2h ago•1 comments

Ask HN: Why are the upvote arrows so small?

3•guilhermeasper•3h ago•4 comments

Ask HN: Is anyone experimenting with different ways of using LLMs for coding?

194•yehiaabdelm•2d ago•194 comments

Ask HN: What metrics do you track when building agents?

2•rstagi•3h ago•0 comments

Ask HN: How Do You "Not Write Any Code by Hand" with a Token Budget?

3•mc-0•4h ago•0 comments

Blog with AI Auto Poster

3•david3289•8h ago•0 comments

Ask HN: Good fast IDE for reading and navigating code in multiple languages

5•akkad33•13h ago•7 comments

LongCat-2.0

5•rika321•13h ago•0 comments

Tell HN: FileVault does not protect Wi-Fi passwords on macOS 26

3•turbidimeter•9h ago•1 comments

Ask HN: Who is hiring? (July 2026)

243•whoishiring•4d ago•326 comments

Ask HN: Who wants to be hired? (July 2026)

150•whoishiring•4d ago•461 comments

Happy Independence Day

33•GauntletWizard•1d ago•8 comments

Ask HN: Since when does Craigslist's front page have emojis?

39•argee•5d ago•33 comments

Ask HN: America turns 250 today. What does it mean to you?

12•abixb•1d ago•6 comments

I can build anything, but only the void sees it

9•urbanogt5•1d ago•22 comments

Tell HN: Megalodon.jp is faster than archive.today and doesn't require reCAPTCHA

5•Cider9986•23h ago•2 comments

Ask HN: How Do You Connect OpenAI Secure MCP Tunnel with Claude Desktop

3•mcpzero•17h ago•0 comments

Ask HN: Why are so many "AI evangelists" posting such insufferable content?

66•seattle_spring•3d ago•37 comments

Tell HN: Installing Cursor on iOS irreversibly changes your privacy settings

248•zkldi•5d ago•34 comments

How many failed startups have you launched?

18•steelebillings•2d ago•12 comments

Tell HN: Old Reddit now requires login

87•jay_kyburz•4d ago•18 comments

Where can I find or get in contact with farmers specifically in the US?

6•strapchay•1d ago•6 comments

I built a environment reloader for Windows Shells

3•byjonas•1d ago•0 comments

Retrieval is not the future of AI – if it was, Google would have won already

4•lamprouge•1d ago•2 comments

Ask HN: Where are the good search engines for mathematical formulas?

3•lo0dot0•1d ago•1 comments

Fable 5. Safety Taken to an Extreme

10•sergeysmirnov•1d ago•8 comments

Tell HN: Fewer PRs done with proper prompting, review, and refinement wins

7•tomerbd•2d ago•4 comments