frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Realistic Synthetic Conversations for Testing LLMs

https://github.com/Channel-Labs/synthetic-conversation-generation
2•otterk10•7mo ago
Testing multi-turn conversational AI is tough, especially when you lack large volumes of real user data. Existing synthetic data tools often generate conversations that lack diversity and are not statistically representative, leading models to overfit synthetic patterns.

To help with this problem, I'm open-sourcing a synthetic conversation generation library. This library generates more realistic multi-conversations than other synthetic data libraries by using the following techniques:

  1. Decoupling Persona & Conversation Generation: This library first create diverse user personas, ensuring each new persona differs from the last. This builds a wide range of user types before generating conversations, tackling bias and improving coverage.

  2. Modeling Realistic Stopping Points: Instead of arbitrary turn limits, the library dynamically assesses if the user's goal is met or if they're frustrated, ending conversations naturally like real users would.
You can generate user personas tailored to your AI's specs and then simulate user messages using those personas. The library calls your AI endpoint (via a configurable HTTP definition) for responses during the simulation.

I built this because I needed a better way to test conversational agents for my clients, and found existing tools lacking in generating high-fidelity dialogues. Would love to hear your feedback and any suggestions!

Comments

badmonster•7mo ago
How does the library handle hallucination or off-topic drift during user simulation, especially when simulating frustration or goal completion? Are there mechanisms to detect and constrain unrealistic turns during generation?

Grok: 'It's basically the curlable Résumé.'

https://www.cssdesignawards.com/sites/kiarash-adl-portfolio/48521/
1•GPTVisionGod•3m ago•0 comments

Study: Effects of LLMs versus Web Search on Depth of Learning

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5104064
1•mustaphah•4m ago•0 comments

Managing Postgres Extensions with ImageVolume

https://www.gabrielebartolini.it/articles/2025/12/cnpg-recipe-23-managing-extensions-with-imagevo...
1•bo0tzz•5m ago•0 comments

DiskBBQ – Elasticsearch's vector storage format

https://www.elastic.co/search-labs/blog/diskbbq-elasticsearch-introduction
1•emschwartz•5m ago•0 comments

Docker Compose Models Explained

https://oneuptime.com/blog/post/2025-12-03-docker-compose-models-explained/view
2•ndhandala•8m ago•0 comments

Yoloe: Real-Time Seeing Anything

https://docs.ultralytics.com/models/yoloe/
1•ensocode•10m ago•0 comments

Putting email in its place with Emacs and Mu4e

https://eamonnsullivan.co.uk/posts-output/email-setup/2025-12-3-putting-email-in-its-place/
1•eamonnsullivan•11m ago•0 comments

Postgres CDC in ClickHouse, A year in review

https://clickhouse.com/blog/postgres-cdc-year-in-review-2025
1•saisrirampur•12m ago•0 comments

Constructing a JPEG XL MD5 hash quine

https://stackchk.fail/blog/jxl_hashquine_writeup
1•fanf2•12m ago•0 comments

NASA Rover Detects Electric Sparks in Mars Dust Devils, Storms

https://www.jpl.nasa.gov/news/nasa-rover-detects-electric-sparks-in-mars-dust-devils-storms/
1•mzs•12m ago•1 comments

3D-Printed Carotid Artery-on-Chips for Personalized Thrombosis Investigation

https://advanced.onlinelibrary.wiley.com/doi/10.1002/adma.202508890
1•PaulHoule•13m ago•0 comments

IoT and AI-driven solutions for human-wildlife conflict

https://www.sciencedirect.com/science/article/pii/S2772375525000620
1•ensocode•14m ago•0 comments

Support Techdirt's Uncompromising Coverage, Get Our First Commemorative Coin

https://www.techdirt.com/2025/12/03/support-techdirts-uncompromising-coverage-get-our-first-comme...
1•doener•14m ago•1 comments

Trump Returns to Gasoline as Fuel of Choice for Cars

https://www.nytimes.com/2025/12/03/climate/trump-fuel-economy-car-rules.html
3•thelastgallon•15m ago•1 comments

Using AI to generate alt text for 27000 images

https://www.ianlurie.com/digital-marketing/use-ai-to-generate-alt-text/
1•AznHisoka•16m ago•0 comments

City Of Winter: A roleplaying game about a fantastical family saga

https://sphaerenmeisters-spiele.de/City-of-Winter/en
1•doener•16m ago•0 comments

Ragas: Automated Evaluation of Retrieval Augmented Generation

https://arxiv.org/abs/2309.15217
1•Anon84•19m ago•0 comments

Show HN: Cross-platform network traffic monitoring system

https://github.com/mohyware/packet-meter
1•mohyware•20m ago•0 comments

Show HN: Rust Client Library for Gradium.ai TTS/STT API

https://github.com/cydanix/rust-gradium
2•irqlevel•21m ago•0 comments

RISC-V Oral History Panel [video]

https://www.youtube.com/watch?v=lbMIuX28d18
3•camel-cdr•22m ago•0 comments

Cloudflare Timeouts in Europe

https://www.cloudflarestatus.com/incidents/s35102g9syk0
4•decide1000•23m ago•4 comments

Trump admin funds geothermal network expansion

https://insideclimatenews.org/news/03122025/rare-win-for-renewable-energy-trump-administration-fu...
2•fghorow•24m ago•0 comments

Making Friends with Foreigners

https://shxlpa.substack.com/p/making-friends-with-foreigners
3•itarmonkey•29m ago•1 comments

Developing a New Electric Vehicle Sound

https://acoustics.org/developing-a-new-electric-vehicle-sound/
2•geox•30m ago•0 comments

The framework isn't for you. You're just along for the voyage

https://doingsoftwarewrong.com/blog/javascript-frameworks/
3•ChunkyAu•33m ago•0 comments

"Netflix killed casting from phones"

https://www.androidauthority.com/netflix-casting-chromecast-google-tv-streamer-3620784/
1•valgaze•34m ago•1 comments

I ran out of money, spent my savings on a Hong Kong prostitute,& became a commie

https://docs.google.com/document/d/1Am8bYA1aoXuSGFg7w7NjlHXFZiSAEt_oAVITPYdNRGo/edit?tab=t.0
3•jxmorris12•35m ago•1 comments

Viewable with Any Browser: Campaign

https://www.anybrowser.org/campaign/
1•4dm1r4lg3n3r4l•35m ago•0 comments

Finally; a genuinely good GUI for NetworkManager on Wayland

https://github.com/cachebag/nmrs
1•cachebag•36m ago•1 comments

Dynamic Custom Fields in Laravel Without Migrations: A Deep Dive

https://github.com/Relaticle/relaticle
2•birdculture•36m ago•0 comments