news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Trainly – Free 72-hour audit of your AI agent's production traces

https://www.trainlyai.com/audit

5•kavin_key•2h ago

Comments

kavin_key•2h ago

Hey HN, Kavin, co-founder of Trainly. We built observability for AI agents, and the hardest part of selling it has been getting people to believe they have a problem. "My agent works fine" is the universal answer, right up until you actually look at the traces. So we're giving the diagnostic part away. Drop in our SDK with a one-line @observe decorator (or prompt Claude code to), let it run for 72 hours, and we'll send back a report on what we found: silent tool failures, retry loops, latency and cost outliers, error patterns by input shape, and any obvious behavioral weirdness in the data.

A few technical notes:

The audit itself is 100% heuristic, no LLM calls on our end, just queries over the traces you send us. Your prompts aren't ending up in anyone's context window by accident. Trace cap is 10k per audit. After that the API key auto-disables at the auth layer; the SDK stops tracing silently so it won't break your app.

The paid product layers unsupervised semantic anomaly detection on top, HDBSCAN over joint embeddings + behavioral features, UMAP for visualization, LLM-generated cluster summaries. The audit is the heuristic pass; it'll surface the obvious stuff, not the subtle drift.

First 50 audits are free. Email + SDK install, no call required.

Context: we're pre-first-customer. Part of why I'm doing this is I want real agent traces to stress-test the anomaly detection against, and part is that cold outreach is slow and I'd rather have 20 of you stress test the product than spend another month in LinkedIn DMs. If the report is useless, tell me why. Link: https://trainlyai.com/audit Happy to answer anything, infra, pricing, why not just use Braintrust, whatever.

Show HN: Broccoli, one shot coding agent on the cloud

https://github.com/besimple-oss/broccoli

32•yzhong94•4h ago•28 comments

Show HN: I built a map of the GeminiNet

https://rbtms.github.io/gemini_map/

2•rbtms•30m ago•0 comments

Show HN: Trainly – Free 72-hour audit of your AI agent's production traces

https://www.trainlyai.com/audit

5•kavin_key•2h ago•1 comments

Show HN: One ESLint rule to kill the "ChatGPT em dash" in your codebase

https://github.com/oleg-koval/drop-em-dash-eslint-rule

2•orthodoz•1h ago•1 comments

Show HN: Everest Drive – a multiplayer spaceship crew simulator in the browser

https://everestdrive.io/

4•jakej256•3h ago•2 comments

Show HN: GoModel – an open-source AI gateway in Go

https://github.com/ENTERPILOT/GOModel/

193•santiago-pl•1d ago•71 comments

Show HN: Netlify for Agents

https://netlify.ai

8•bobfunk•4h ago•3 comments

Show HN: A free tool for non-technical folks to easily publish a website

https://weejur.com

2•npilk•4h ago•4 comments

Show HN: Ctx – a /resume that works across Claude Code and Codex

https://github.com/dchu917/ctx

71•dchu17•2d ago•27 comments

Show HN: VidStudio, a browser based video editor that doesn't upload your files

https://vidstudio.app/video-editor

291•kolx•1d ago•104 comments

Show HN: Backlit Keyboard API for Python

https://github.com/itsmeadarsh2008/backlit-kbd

27•itsmeadarsh•3d ago•5 comments

Show HN: Daemons – we pivoted from building agents to cleaning up after them

https://charlielabs.ai/

64•rileyt•1d ago•31 comments

Show HN: Mediator.ai – Using Nash bargaining and LLMs to systematize fairness

https://mediator.ai/

154•sanity•2d ago•74 comments

Show HN: Ohita – a tool to simplify API key management for AI agents

https://ohita.tech/

2•jusasiiv•6h ago•0 comments

Show HN: Aide – A customizable Android assistant (voice, choose your provider)

https://aideassistant.com/

6•yincrash•17h ago•4 comments

Show HN: Almanac MCP, turn Claude Code into a Deep Research agent

https://www.openalmanac.org/

13•rohans0509•22h ago•1 comments

Show HN: Prompt-to-Excalidraw demo with Gemma 4 E2B in the browser (3.1GB)

https://teamchong.github.io/turboquant-wasm/draw.html

159•teamchong•3d ago•62 comments

Show HN: Holos – QEMU/KVM with a compose-style YAML, GPUs and health checks

https://github.com/zeroecco/holos

55•zeroecco•1d ago•23 comments

Show HN: FMQL – graph query and bulk-edit CLI for Markdown and YAML frontmatter

https://github.com/buyuk-dev/fmql

5•buyukdev•1d ago•1 comments

Show HN: MDV – a Markdown superset for docs, dashboards, and slides with data

https://github.com/drasimwagan/mdv

150•drasim•4d ago•53 comments

Show HN: Irregular German Verbs – a simple app, no ads or tracking

https://bacist.com/german-irregular-verbs-app/

5•baCist•15h ago•3 comments

Show HN: No JavaScript Club

https://nojs.club/

6•basilikum•22h ago•3 comments

Show HN: Run TRELLIS.2 Image-to-3D generation natively on Apple Silicon

https://github.com/shivampkumar/trellis-mac

201•shivampkumar•2d ago•35 comments

Show HN: Gemini Plugin for Claude Code

https://github.com/m-ghalib/gemini-plugin-cc

10•morawr•16h ago•3 comments

Show HN: WeTransfer Alternative for Developers

https://dlvr.sh/

22•mariusbolik•1d ago•8 comments

Show HN: Faceoff – A terminal UI for following NHL games

https://www.vincentgregoire.com/faceoff/

133•vcf•3d ago•45 comments

Show HN: Open Chronicle – Local Screen Memory for Claude Code and Codex CLI

https://github.com/Screenata/open-chronicle

5•taoh•17h ago•1 comments

Show HN: GBrain, an AI tool for diagnosis and therapy for neurodivergents

https://www.neuroplusgbrain.net/

3•FDX2018•17h ago•2 comments

Show HN: MemFactory: Unified Inference and Training Framework for Agent Memory

https://arxiv.org/abs/2603.29493

8•MemTensor•18h ago•0 comments

Show HN: Hydra – Never stop coding when your AI CLI hits a rate limit

https://github.com/saadnvd1/hydra

7•saadn92•1d ago•2 comments