frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Realistic Synthetic Conversations for Testing LLMs

https://github.com/Channel-Labs/synthetic-conversation-generation
2•otterk10•5mo ago
Testing multi-turn conversational AI is tough, especially when you lack large volumes of real user data. Existing synthetic data tools often generate conversations that lack diversity and are not statistically representative, leading models to overfit synthetic patterns.

To help with this problem, I'm open-sourcing a synthetic conversation generation library. This library generates more realistic multi-conversations than other synthetic data libraries by using the following techniques:

  1. Decoupling Persona & Conversation Generation: This library first create diverse user personas, ensuring each new persona differs from the last. This builds a wide range of user types before generating conversations, tackling bias and improving coverage.

  2. Modeling Realistic Stopping Points: Instead of arbitrary turn limits, the library dynamically assesses if the user's goal is met or if they're frustrated, ending conversations naturally like real users would.
You can generate user personas tailored to your AI's specs and then simulate user messages using those personas. The library calls your AI endpoint (via a configurable HTTP definition) for responses during the simulation.

I built this because I needed a better way to test conversational agents for my clients, and found existing tools lacking in generating high-fidelity dialogues. Would love to hear your feedback and any suggestions!

Comments

badmonster•5mo ago
How does the library handle hallucination or off-topic drift during user simulation, especially when simulating frustration or goal completion? Are there mechanisms to detect and constrain unrealistic turns during generation?

Archaeology of the IBM PC110 [video]

https://www.youtube.com/watch?v=8Uja7g9hQlo
2•gilad•9m ago•0 comments

Fed Lost Access to Private Jobs Data Ahead of Government Shutdown

https://www.wsj.com/economy/central-banking/fed-lost-access-to-private-jobs-data-ahead-of-governm...
1•zerosizedweasle•11m ago•0 comments

I built Parall – a native macOS app to run multiple instances of any app

https://parall.app/
2•IGHOR•12m ago•1 comments

Mapping smoking rates by state with Matplotlib and geopandas

https://aaronjbecker.com/posts/matplotlib-choropleth-mapping-smoking-rates/
1•aaronjbecker•13m ago•0 comments

Musk Hijacks Tesla Earnings Call to Pitch $1T Pay Plan

https://www.bloomberg.com/news/articles/2025-10-22/musk-hijacks-tesla-earnings-call-to-pitch-1-tr...
3•zerosizedweasle•15m ago•0 comments

Tesla's increased costs outweighed its revenue growth

https://www.cnbc.com/2025/10/23/cnbc-daily-open-teslas-increased-costs-outweighed-its-revenue-gro...
1•zerosizedweasle•16m ago•0 comments

Devman's RaaS launch: the affiliate who aims to become the boss

https://analyst1.com/devmans-raas-launch-the-affiliate-who-aims-to-become-the-boss/
1•ropable•16m ago•0 comments

The OS/2 Display Driver Zoo

http://www.os2museum.com/wp/the-os-2-display-driver-zoo/
2•userbinator•17m ago•0 comments

Clojure Zippers

https://grishaev.me/en/clojure-zippers/
2•prydt•17m ago•0 comments

The First Data-Driven Platform That Makes Hosting Comparisons Fair

1•Hostingmoz•18m ago•0 comments

Show HN: Reflective Humanism Bot – Tracing the Hidden Logic of Human Reflection

https://reflective-humanism.medium.com/reflective-humanism-4495d6fc7567
1•rfl-hm•21m ago•0 comments

React vs. Backbone in 2025

https://backbonenotbad.hyperclay.com/
1•panphora•22m ago•0 comments

The Myth of Outrunning Your Diet

https://williamjbarry.substack.com/p/the-myth-of-outrunning-your-diet
2•wjb3•22m ago•0 comments

What Makes Music Sound "Good?"

https://dmitri.mycpanel.princeton.edu/whatmakesmusicsoundgood.html
2•suioir•22m ago•1 comments

Dotenvchecker – find unused or missing environment variables in Python project

https://github.com/dipendrapant/dotenvcheck
1•pantdipendra•25m ago•1 comments

Former world chess champion may face discipline for treatment of D. Naroditsky

https://apnews.com/article/naroditsky-death-chess-kramnik-cheating-allegations-396c3609a805ffb18d...
1•petethomas•29m ago•0 comments

I made a Indian website, check it out and review it

1•herapherigoods•35m ago•0 comments

Cancer Patients Receiving Covid mRNA Shots Show Dramatically Longer Survival

https://www.bloomberg.com/news/articles/2025-10-23/covid-mrna-vaccine-found-to-enhance-response-t...
5•marc__1•38m ago•0 comments

Tesla is trying to deceive investors into thinking it has San Francisco Robotaxi

https://electrek.co/2025/10/22/tesla-is-trying-to-deceive-investors-into-thinking-it-has-san-fran...
5•TheAlchemist•49m ago•2 comments

Aldi unveils jacket potato jacket complete with silver foil poncho

https://www.dezeen.com/2025/10/21/aldi-jacket-potato-jacket/
2•ohjeez•53m ago•2 comments

Our Response to Reddit's Lawsuit

https://www.reddit.com/r/perplexity_ai/s/eio6l38oYV
4•frankacter•1h ago•3 comments

OpenAI CISO: mitigation of prompt injection risks in Atlas

https://twitter.com/cryps1s/status/1981037851279278414
2•dsr12•1h ago•0 comments

Skyfall-GS – Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

https://skyfall-gs.jayinnn.dev/
3•ChrisArchitect•1h ago•0 comments

Merchant Category Codes [pdf]

https://www.citibank.com/tts/solutions/commercial-cards/assets/docs/govt/Merchant-Category-Codes.pdf
1•josephcsible•1h ago•1 comments

Cards Against Humanity lawsuit forced SpaceX to vacate land on US/Mexico border

https://arstechnica.com/tech-policy/2025/10/cards-against-humanity-gets-settlement-from-spacex-pl...
5•radeeyate•1h ago•0 comments

Inside Kimi 1.5: A self-contained summary of its reinforcement learning efforts

https://yashmore.notion.site/Inside-Kimi-1-5-A-self-contained-summary-of-its-reinforcement-learni...
1•sert_121•1h ago•0 comments

Structured Prompting

https://narphorium.com/blog/structured-prompting/
1•narphorium•1h ago•1 comments

Sauna as a Service (SaaS)

https://supersaunners.com/
2•supersaunners•1h ago•0 comments

The state of Kotlin scripting (November 2024)

https://mbonnin.net/2024-11-21_state-of-kotlin-scripting/
1•sea-gold•1h ago•0 comments

How to Predict Everything (1999)

https://www.newyorker.com/magazine/1999/07/12/how-to-predict-everything
1•salkahfi•1h ago•0 comments