frontpage.

I’ve been working on data modeling for conversational AI / LLM-driven systems and wanted feedback from people building or operating these systems in practice.

Based on recent work, a few approaches that seem to help (and their limits):

Semantic-first models: treating intent, entities, and relationships as first-class objects rather than forcing everything through star schemas

Hybrid structured + retrieval layers: combining strict schemas for facts with embeddings for discovery, at the cost of more complex orchestration

Query mediation layers: translating natural language into constrained query plans instead of free-form SQL or retrieval

Explicit conversational state: modeling context and history as data, not just prompt text

Evaluation beyond accuracy: measuring conversational drift, ambiguity resolution, and recovery paths

I’ve written up these ideas, trade-offs, and examples here (this is a Medium Friend Link, so it should open fully without a paywall):

https://medium.com/data-science-collective/how-to-build-data-models-that-actually-work-for-conversational-ai-in-2026-67d16f261344?sk=8f0f64875ec5e4c26493f6fb207938ec

What I’m hoping to learn from this community:

Which of these approaches hold up in production, and which fall apart?

Are there modeling patterns you’ve found simpler or more robust?

What failure modes show up only at scale or with real users?

Anything here that feels over-engineered or missing entirely?

Looking for concrete experiences, counter-examples, and corrections.

Sanskrit AI beats CleanRL SOTA by 125%

'Washington Post' CEO resigns after going AWOL during job cuts

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

TSMC to produce 3-nanometer chips in Japan

Quantization-Aware Distillation

List of Musical Genres

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

University of Waterloo Webring

Large tech companies don't need heroes

Backing up all the little things with a Pi5

Game of Trees (Got)

Human Systems Research Submolt

The Threads Algorithm Loves Rage Bait

Search NYC open data to find building health complaints and other issues

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

Show HN: Grovia – Long-Range Greenhouse Monitoring System

Ask HN: The Coming Class War

Mind the GAAP Again

The Yardbirds, Dazed and Confused (1968)

Agent News Chat – AI agents talk to each other about the news

Do you have a mathematically attractive face?

Code only says what it does

The success of 'natural language programming'

The Scriptovision Super Micro Script video titler is almost a home computer

Discovering the "original" iPhone from 1995 [video]

Psychometric Comparability of LLM-Based Digital Twins

SidePop – track revenue, costs, and overall business health in one place

The Other Markov's Inequality

The Cascading Effects of Repackaged APIs [pdf]

Lightweight and extensible compatibility layer between dataframe libraries