frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Realistic Synthetic Conversations for Testing LLMs

https://github.com/Channel-Labs/synthetic-conversation-generation
2•otterk10•7mo ago
Testing multi-turn conversational AI is tough, especially when you lack large volumes of real user data. Existing synthetic data tools often generate conversations that lack diversity and are not statistically representative, leading models to overfit synthetic patterns.

To help with this problem, I'm open-sourcing a synthetic conversation generation library. This library generates more realistic multi-conversations than other synthetic data libraries by using the following techniques:

  1. Decoupling Persona & Conversation Generation: This library first create diverse user personas, ensuring each new persona differs from the last. This builds a wide range of user types before generating conversations, tackling bias and improving coverage.

  2. Modeling Realistic Stopping Points: Instead of arbitrary turn limits, the library dynamically assesses if the user's goal is met or if they're frustrated, ending conversations naturally like real users would.
You can generate user personas tailored to your AI's specs and then simulate user messages using those personas. The library calls your AI endpoint (via a configurable HTTP definition) for responses during the simulation.

I built this because I needed a better way to test conversational agents for my clients, and found existing tools lacking in generating high-fidelity dialogues. Would love to hear your feedback and any suggestions!

Comments

badmonster•7mo ago
How does the library handle hallucination or off-topic drift during user simulation, especially when simulating frustration or goal completion? Are there mechanisms to detect and constrain unrealistic turns during generation?

Arc Prize 2025 Results and Analysis: Year of the Refinement Loop

https://arcprize.org/blog/arc-prize-2025-results-analysis
2•frozenseven•3m ago•0 comments

Agent Harnesses Are Just Shells

https://pavpanchekha.com/blog/agents-are-shells.html
2•yurivish•4m ago•0 comments

Wall Street Races to Cut Its Risk from AI's Borrowing Binge

https://www.bloomberg.com/news/articles/2025-12-05/wall-street-races-to-cut-its-risk-from-ai-s-bo...
1•zerosizedweasle•5m ago•0 comments

I got Claude and ChatGPT to stop being sycophantic cheerleaders

https://medium.com/@scott_waddell/how-i-got-claude-and-chatgpt-to-stop-being-sycophantic-cheerlea...
1•scott_waddell•8m ago•0 comments

A Golden Land? Questioning Frontiers, Fantasies, Fulfillment in the Northwest

https://lithub.com/a-golden-land-questioning-frontiers-fantasies-and-fulfillment-in-the-pacific-n...
1•felixbraun•10m ago•0 comments

Show HN: FlowCoder – Flowcharts for "Programming" Claude Code and Codex

https://github.com/px-pride/flowcoder
1•px_pride•12m ago•0 comments

Evaluating TCP BBRv2 on the Dropbox edge network

https://arxiv.org/abs/2008.07699
1•fanf2•14m ago•0 comments

New 'physics shortcut' lets laptops tackle quantum problems

https://www.livescience.com/technology/computing/new-physics-shortcut-lets-laptops-tackle-quantum...
1•jhncls•15m ago•0 comments

National Security Strategy of the United States of America [pdf]

https://www.whitehouse.gov/wp-content/uploads/2025/12/2025-National-Security-Strategy.pdf
2•TechTechTech•16m ago•2 comments

Ask HN: "Freelancer? Seeking freelancer?" threads gone?

1•AlexITC•16m ago•0 comments

Show HN: Sloppylint – A linter for AI-generated Python code

https://github.com/rsionnach/sloppylint
1•kyub•20m ago•0 comments

Show HN: A new AI driven task management tool

https://thebraindump.azurewebsites.net
1•vijaym1979•21m ago•0 comments

Show HN: BinaryStorage – High-performance PHP binary key/value store

https://github.com/olivier-ls/binary-storage-php
1•asmodios•22m ago•0 comments

Brendan Gregg Leaving Intel

https://www.brendangregg.com/blog/2025-12-05/leaving-intel.html
5•blankx32•24m ago•0 comments

Frank Gehry Died

https://www.bbc.co.uk/news/articles/c5y2p22z9gno
2•ksajadi•25m ago•1 comments

Mystery triggers earthquake sensors causing false alert (M5.9) in Bay Area

https://www.sfgate.com/bayarea/article/mysterious-trigger-earthquake-sensors-calif-21225679.php
1•boringg•25m ago•0 comments

React2Shell free hands-on lab: learn to exploit and detect

https://tryhackme.com/room/react2shellcve202555182
1•realtryhackme•26m ago•0 comments

A $20 drug in Europe requires a prescription and $800 in the U.S.

https://www.statnews.com/2025/10/31/why-miebo-costs-40-times-more-than-its-european-version/
22•geox•29m ago•3 comments

Leaving Intel

https://www.brendangregg.com/blog//2025-12-05/leaving-intel.html
4•speckx•29m ago•1 comments

Perpetual Futures

https://www.bitsaboutmoney.com/archive/perpetual-futures-explained/
2•sirodoht•33m ago•0 comments

Uncertainty Under Ignorance

https://github.com/eb4890/IntegrityUnderIgnorance/blob/main/README.md
1•rando77•41m ago•0 comments

Show HN: A Call of Duty event clipper and compilation maker using Python and AI

https://github.com/karimm-ai/NiceShot_AI
1•niceshot-ai•42m ago•0 comments

Predicting the Past: AI for Ancient Texts

https://predictingthepast.com/
1•tesserato•42m ago•0 comments

Ask HN: Bullied by Indian compliance automation platform

2•bitlad•44m ago•0 comments

Ask HN: How is you and your team are using AI?

1•IdontKnowRust•44m ago•0 comments

The Anatomy of a Triton Attention Kernel

https://arxiv.org/abs/2511.11581
2•PaulHoule•46m ago•0 comments

The missing standard library for multithreading in JavaScript

https://github.com/W4G1/multithreading
5•W4G1•47m ago•0 comments

Unity 6.3 LTS is now available

https://unity.com/blog/unity-6-3-lts-is-now-available
1•binarynate•47m ago•0 comments

Show HN: Vibe Code WP Plugins

https://steem.dev/
1•fasthightimess•48m ago•0 comments

GLP-1 Drugs, Psilocybin Mushrooms, and the Case for Sublingual Psilocin

https://psychedelicstoday.com/2025/08/05/glp-1-drugs-psilocybin-mushrooms-and-the-case-for-sublin...
1•toomuchtodo•49m ago•0 comments