frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Realistic Synthetic Conversations for Testing LLMs

https://github.com/Channel-Labs/synthetic-conversation-generation
2•otterk10•7mo ago
Testing multi-turn conversational AI is tough, especially when you lack large volumes of real user data. Existing synthetic data tools often generate conversations that lack diversity and are not statistically representative, leading models to overfit synthetic patterns.

To help with this problem, I'm open-sourcing a synthetic conversation generation library. This library generates more realistic multi-conversations than other synthetic data libraries by using the following techniques:

  1. Decoupling Persona & Conversation Generation: This library first create diverse user personas, ensuring each new persona differs from the last. This builds a wide range of user types before generating conversations, tackling bias and improving coverage.

  2. Modeling Realistic Stopping Points: Instead of arbitrary turn limits, the library dynamically assesses if the user's goal is met or if they're frustrated, ending conversations naturally like real users would.
You can generate user personas tailored to your AI's specs and then simulate user messages using those personas. The library calls your AI endpoint (via a configurable HTTP definition) for responses during the simulation.

I built this because I needed a better way to test conversational agents for my clients, and found existing tools lacking in generating high-fidelity dialogues. Would love to hear your feedback and any suggestions!

Comments

badmonster•7mo ago
How does the library handle hallucination or off-topic drift during user simulation, especially when simulating frustration or goal completion? Are there mechanisms to detect and constrain unrealistic turns during generation?

Judicial Malfeasance and Palestine Action

https://www.craigmurray.org.uk/archives/2025/11/judicial-malfeasance-and-palestine-action/
1•jjgreen•32s ago•0 comments

Security Flaws in DeepSeek-Generated Code Linked to Political Triggers

https://www.crowdstrike.com/en-us/blog/crowdstrike-researchers-identify-hidden-vulnerabilities-ai...
2•shalmanese•4m ago•0 comments

$9 author sues over 384M Indian Jones movie

https://www.google.com/search?q=+author+sues+over+384M+Indian+Jones+movie
1•asdefghyk•6m ago•1 comments

Model Context Protocol (MCP) Specification 2025-11-25

https://mcp.mintlify.app/specification/2025-11-25
1•somesnm•8m ago•0 comments

AI Legal system disruption with contract engineering

https://kyc.co/articles/vertical-markets-vanishing-lawyers-and-the-new-operating-system-of-commerce
1•steven555•12m ago•1 comments

I Reverse-Engineered Exa.ai Infrastructure Cost with Napkin Math

https://www.kshivendu.dev/blog/exa-napkin-math
1•kshivendu•14m ago•1 comments

Hurl 7.1.0, the Pretty Edition

https://hurl.dev/blog/2025/11/26/hurl-7.1.0-the-pretty-edition.html
1•jicea•15m ago•0 comments

Show HN: How to Quickly and Free Remove TikTok Video Watermarks

https://vdraw.ai/tiktok-watermark-remover
1•passioner•20m ago•0 comments

ArcOS v1.1 – A Natural-Language Cognitive Operating System

https://github.com/Takeshi-Sakamoto5/ArcOS-v1.1
1•takeshi_sakamo•24m ago•1 comments

Which Browser Should I Use In 2025

https://hackaday.com/2025/04/07/which-browser-should-i-use-in-2025/
2•heatherleelove•27m ago•0 comments

Health Care Systems

https://rodgercuddington.substack.com/p/healthcare-systems
2•freespirt•27m ago•1 comments

A New Blueprint: House of Leaves and AI

https://oxonianreview.com/articles/a-new-blueprint-house-of-leaves-and-ai
1•bryanrasmussen•28m ago•0 comments

Building Self-Hosting Rails Applications: Design Decisions and Why

https://sendbroadcast.net/blog/self-hosting-rails
1•amalinovic•30m ago•0 comments

Foreign tourists to pay extra fee to visit US national parks

https://www.bbc.com/news/articles/c1kpnxvpgy2o
1•mikhael•33m ago•0 comments

Benchmarking GPT-5.1 vs. Gemini 3.0 vs. Opus 4.5 across 3 Coding Tasks

https://blog.kilo.ai/p/benchmarking-gpt-51-vs-gemini-30-vs-opus-45
2•heymax054•38m ago•0 comments

Recent Performance and Administration Features in Firebird

https://www.ibphoenix.com/articles/art-00000602
1•mariuz•39m ago•0 comments

Show HN: Lifeline – Visual memory journal with emotion auras and AI companion

https://mylifelineapp.com/
1•Remi_Etien•40m ago•0 comments

Why Pricing Power Is the Most Important Economic Signal No One Tracks

https://capitalfolly.com/
2•d_e_solomon•48m ago•2 comments

Should R ecosystem be a choice for longer-term projects?

1•northlondoner•51m ago•0 comments

If you're building an AI product, interface is your primry competitive advantage

https://eleganthack.com/ux-is-your-moat-and-youre-ignoring-it/
2•kaizenb•52m ago•0 comments

Kastor – Build data pipelines visually

https://kastor-242087227970.us-west1.run.app/
1•Snidow•57m ago•1 comments

Statistical Process Control in Python

https://timothyfraser.com/sigma/statistical-process-control-in-python.html
10•lifeisstillgood•58m ago•0 comments

Show HN: SpacePigeon – Save and Restore macOS Workspaces

https://github.com/louivers/spacepigeon
1•kakmuis•1h ago•0 comments

It's Not Just You – The iOS Keyboard Is Broken

https://youtu.be/hksVvXONrIo
3•jmaker•1h ago•1 comments

Show HN: Hanzi Stroke – An interactive tool to learn Chinese character writing

https://www.hanzistroke.com/en
1•YarkYao•1h ago•0 comments

After nearly 100 years, scientists may have detected dark matter

https://phys.org/news/2025-11-years-scientists-dark.html
3•alex-moon•1h ago•0 comments

Stanford AI Club: Jeff Dean on Important AI Trends [video]

https://www.youtube.com/watch?v=AnTw_t21ayE
1•pss314•1h ago•0 comments

Show HN: SkimIt – An extension to highlight Green/Red flags on LinkedIn profiles

https://chromewebstore.google.com/detail/skimit-linkedin-recruiter/ipaajbgmiinahmfbmmjpikmfjkccpocj
1•ngninja•1h ago•0 comments

AWS is 10x slower than a dedicated server for the same price [video]

https://www.youtube.com/watch?v=Ps3AI1kTIR4
80•wolfgangbabad•1h ago•98 comments

Show HN: I built directory of fashion brands because I didn't know how to dress

https://brandlist.it.com
2•EthanSeo•1h ago•2 comments