frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Realistic Synthetic Conversations for Testing LLMs

https://github.com/Channel-Labs/synthetic-conversation-generation
2•otterk10•6mo ago
Testing multi-turn conversational AI is tough, especially when you lack large volumes of real user data. Existing synthetic data tools often generate conversations that lack diversity and are not statistically representative, leading models to overfit synthetic patterns.

To help with this problem, I'm open-sourcing a synthetic conversation generation library. This library generates more realistic multi-conversations than other synthetic data libraries by using the following techniques:

  1. Decoupling Persona & Conversation Generation: This library first create diverse user personas, ensuring each new persona differs from the last. This builds a wide range of user types before generating conversations, tackling bias and improving coverage.

  2. Modeling Realistic Stopping Points: Instead of arbitrary turn limits, the library dynamically assesses if the user's goal is met or if they're frustrated, ending conversations naturally like real users would.
You can generate user personas tailored to your AI's specs and then simulate user messages using those personas. The library calls your AI endpoint (via a configurable HTTP definition) for responses during the simulation.

I built this because I needed a better way to test conversational agents for my clients, and found existing tools lacking in generating high-fidelity dialogues. Would love to hear your feedback and any suggestions!

Comments

badmonster•6mo ago
How does the library handle hallucination or off-topic drift during user simulation, especially when simulating frustration or goal completion? Are there mechanisms to detect and constrain unrealistic turns during generation?

Continuous Claude – run Claude Code in a loop

https://github.com/AnandChowdhary/continuous-claude
1•anandchowdhary•3m ago•1 comments

How I wrote the fastest Blender exporter and so could you!

https://lotusspring.substack.com/p/how-i-wrote-the-fastest-blender-exporter
2•tahatorabpour•3m ago•0 comments

Context Management in Amp

https://ampcode.com/guides/context-management
1•tosh•7m ago•0 comments

Furgit: Fast implementation of Git in pure Go

https://github.com/runxiyu/furgit
1•birdculture•7m ago•0 comments

AI Bubble Anxiety on the Rise: Two AI CEOs Weigh in [video]

https://www.youtube.com/watch?v=KU57CWWWc3g
2•mgh2•9m ago•0 comments

ScriptCat: FOSS userscript manager with MV3 Support (Tampermonkey alternative)

https://docs.scriptcat.org/en/
1•maxloh•11m ago•1 comments

Mechanistic Interpretability Priorities [video]

https://www.youtube.com/watch?v=XZX_CFfVgIc
1•wadamczyk•11m ago•0 comments

Wealth

https://saul.pw/mag/wealth/
2•andsoitis•14m ago•0 comments

I made a FIRE/ Retire Early calculator

https://www.planwell.ai/retirement
2•arundhati2000•16m ago•0 comments

Mag World – thinking in orders of magnitude

https://saul.pw/mag/
1•andsoitis•19m ago•0 comments

China Cleantech Exports Data Explorer

https://ember-energy.org/data/china-cleantech-exports-data-explorer/
1•doener•19m ago•0 comments

Ask HN: What is the modern equivalent of Bell Labs?

2•rootsudo•25m ago•3 comments

Color Spaces, Bitmaps and Pumpkins

https://pmig96.wordpress.com/2025/11/12/color-spaces-bitmaps-and-pumpkins/
1•msephton•27m ago•0 comments

John Cage's 4'33" [video]

https://www.youtube.com/watch?v=JTEFKFiXSx4
1•thunderbong•29m ago•0 comments

Lazy Skills

https://boliv.substack.com/p/lazy-skills-a-token-efficient-approach
1•brunooliv•33m ago•1 comments

Certain Bulk Drug Substances Use in Compounding Present Significant Safety Risks

https://www.fda.gov/drugs/human-drug-compounding/certain-bulk-drug-substances-use-compounding-may...
1•randycupertino•33m ago•1 comments

The Most Extraordinary Companies in History

https://www.downtownjoshbrown.com/p/the-most-extraordinary-companies-in-history
1•mooreds•34m ago•0 comments

Marin's Anti-Racist Library

https://andys.blog/library/
2•andytratt•35m ago•2 comments

Managing up, down, and the robots [audio]

https://www.swarmia.com/podcast/michael-lopp-aka-rands/
1•mooreds•35m ago•0 comments

Sign smarter with Google eSignature tool

https://fotc.com/blog/sign-smarter-with-google-esignature-tool/
1•mooreds•36m ago•0 comments

Don't Post Passive-Aggressive Webpages

https://dontpostpassiveaggressivewebpages.com/
2•todsacerdoti•36m ago•0 comments

Matrox MGA Millennium

https://www.dosdays.co.uk/topics/Manufacturers/matrox_millennium.php
1•doener•36m ago•0 comments

He's Been Right About AI for 40 Years. Now He Thinks Everyone Is Wrong

https://www.wsj.com/tech/ai/yann-lecun-ai-meta-0058b13c
1•Brajeshwar•37m ago•0 comments

Amelia Earhart Records Released by U.S. Spy Agency

https://www.scientificamerican.com/article/amelia-earhart-records-released-by-u-s-spy-agency/
2•Brajeshwar•37m ago•0 comments

When Your Year of Work Gets Copied Overnight: What Matters?

https://glama.ai/blog/2025-11-15-when-your-year-of-work-gets-copied-overnight-what-actually-matters
4•punkpeye•38m ago•2 comments

How Did a Medieval Spice Cabinet Survive 500 Years Underwater?

https://www.atlasobscura.com/articles/spices-500-year-old-shipwreck-baltic
1•Brajeshwar•38m ago•0 comments

Toy b-tree implementation in C

https://github.com/danielfalbo/btree
2•danielfalbo•38m ago•0 comments

Britain's first small modular reactors to be built in Wales

https://www.theregister.com/2025/11/13/anglesey_smr/
1•vintagedave•39m ago•0 comments

When Bill Gates Yelled at Me About Climate Change

https://www.theframelab.org/when-bill-gates-yelled-at-me-about-climate-change/
6•doener•43m ago•4 comments

Xen exploitation part 1: XSA-105, from nobody to root

https://blog.quarkslab.com/./xen-exploitation-part-1-xsa-105-from-nobody-to-root.html
1•coldsunrays•44m ago•0 comments