frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Relai-SDK – simulate → evaluate → optimize AI agents

https://github.com/relai-ai/relai-sdk
4•sfeizi•18h ago
What relai-sdk is an open-source toolkit for making AI agents reliable via a complete learning loop: simulate → evaluate → optimize.

Why Agent runs are stochastic; tool-calls fail; hard to reproduce, measure, and fix at scale. It’s also hard to align behavior with goals across output quality/format, cost, and latency. We need a loop that integrates user feedback and LLM evaluators directly into the agent code (prompts, configs, models, graphs) without overfitting.

How - Simulation: LLM personas, mocked MCP servers/tools, synthetic data; can condition on real traces - Evaluation: code-based + LLM-based evaluators; turn human reviews into optimization-ready benchmarks - Optimization with Maestro: tune prompts, configs and even agent graph for improved quality, cost and latency

Try it pip install relai

GitHub: https://github.com/relai-ai/relai-sdk

Docs: https://docs.relai.ai/ (2-min overview: https://youtu.be/qKsJUD_KP40)

Looking for feedback on - Where graph-level suggestions help (beyond prompt tuning) - Evaluator signals you rely on for reliability (and what we’re missing) - Simulation setups/environments you’d want out of the box

Notes Founder here. Happy to share internals, tradeoffs, and limitations.

Works with LangGraph / OpenAI Agents / Google ADK / etc. SDK Apache-2.0 license.

Show HN: Bash Screensavers

https://github.com/attogram/bash-screensavers
4•attogram•23m ago•1 comments

Show HN: Ordered – A sorted collection library for Zig

12•habedi0•6h ago•5 comments

Show HN: JSON Query

https://jsonquerylang.org/
137•wofo•19h ago•64 comments

Show HN: I was tired of people dmming me just "hi", so I made this - NoGreeting

https://nogreeting.kuber.studio
3•kuberwastaken•2h ago•0 comments

Show HN: Dlog – Journaling and AI coach that learns what drives wellbeing (Mac)

https://dlog.pro/
39•dr-j•18h ago•19 comments

Show HN: Erdos – open-source, AI data science IDE

https://www.lotas.ai/erdos
77•jorgeoguerra•19h ago•31 comments

Show HN: Git Auto Commit (GAC) – LLM-powered Git commit command line tool

https://github.com/cellwebb/gac
50•merge-conflict•18h ago•30 comments

Show HN: Write Go code in JavaScript files

https://www.npmjs.com/package/vite-plugin-use-golang
148•yar-kravtsov•1d ago•44 comments

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

https://github.com/dvir-biton/MyraOS
237•dvirbt•1d ago•54 comments

Show HN:Interactive RISC-V CPU Visualizer (Sequential and Pipelined)

https://mostlykiguess.github.io/RISC-V-Processor-Implementation/
2•mostlyk•5h ago•0 comments

Show HN: Linux Smart Directories Navigation

https://github.com/abdulbadii/smart-directories-navigation
3•dogol•8h ago•0 comments

Show HN: Helium Browser for Android with extensions support, based on Vanadium

https://github.com/jqssun/android-helium-browser
61•jqssun•1d ago•29 comments

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

https://shadcnthemer.com
132•miketromba•2d ago•40 comments

Show HN: nblm - Rust CLI/Python SDK for NotebookLM Enterprise automation

https://github.com/K-dash/nblm-rs
9•K-dash•20h ago•0 comments

Show HN: Diagram as code tool with draggable customizations

https://github.com/RohanAdwankar/oxdraw
252•RohanAdwankar•2d ago•59 comments

Show HN: LLM Rescuer – Fixing the billion dollar mistake in Ruby

https://github.com/barodeur/llm_rescuer
94•barodeur•3d ago•14 comments

Show HN: Easily visualize torch, Jax, tf, NumPy, etc. tensors

https://github.com/hardik-vala/tensordiagram
2•HardikVala•14h ago•0 comments

Show HN: Whatdidido – CLI to summarize your work from Jira/Linear

https://github.com/oliviersm199/whatdidido
6•olism•20h ago•0 comments

Show HN: Action Engine — An API/Agent Buildkit Putting Flexibility First

https://actionengine.dev/
2•helenapankov•15h ago•3 comments

Show HN: Learn Basic Chess Movements

https://www.minichessgames.com/?hn
11•patrickdavey•1d ago•1 comments

Show HN: TrueType Rasterizer

https://github.com/StanislavPetrov15/TT_Rasterizer
3•s_petrov•16h ago•0 comments

Show HN: Vetr.is – Privacy-First Cloud in Iceland

https://vetr.is
4•falkensmaze66•16h ago•1 comments

Show HN: LinkPatrol – Free merchant-agnostic tool to find broken affiliate links

https://www.linkpatrol.in
2•deepu_bansal87•18h ago•1 comments

Show HN: Relai-SDK – simulate → evaluate → optimize AI agents

https://github.com/relai-ai/relai-sdk
4•sfeizi•18h ago•0 comments

Show HN: Omnia OS, the Most Efficient Email Client Without AI

https://omniaos.co/
5•jmchugh9•19h ago•0 comments

Show HN: OpenSkills - Run Claude Skills Locally Using Any LLM

https://github.com/BandarLabs/open-skills
6•mkagenius•19h ago•0 comments

Show HN: ChatHawk – Stop Copy-Pasting the Same Question Across Every AI Model

https://chathawk.co
4•chadlad101•20h ago•0 comments

Show HN: Chonky – a neural text semantic chunking goes multilingual

https://huggingface.co/mirth/chonky_mmbert_small_multilingual_1
42•hessdalenlight•2d ago•5 comments

Show HN: Ubik - A new way to use AI in citation-based work and research

https://www.ubik.studio
3•ieuanking•20h ago•0 comments

Show HN: Pinpam, TPM2-backed pin authentication for Linux

https://github.com/RazeLighter777/pinpam
3•razighter777•21h ago•0 comments