frontpage.

Show HN: We built a narrative analysis engine for fiction writers

4•homeonthemtn•1h ago

Our app (LaoTzu Writer Studio) has a feature called The Guardian which catches continuity errors and contradictions in manuscripts. So if you say your character has blue eyes in one chapter, but someone stares longingly into their green eyes in a later chapter, it'll flag that as a discontinuity. On a single thread, that's easy to track, but as a body of related attributes it gets very complicated, especially without discreet input from the user.

We originally tried a named entity recognition-based approach with the goal of tracking entities, attributes, and relationships across the manuscript. We benchmarked on 96 novels from Project Gutenberg with various inconsistencies injected into each one, then ran the "The Guardian" layer across them to ferret them out. Unfortunately this presented 2,500 false positives across 96 novels, so ~26 false positives a novel. It's not technically bad but it's enough to become an unreliable nuisance of a feature

For our next approach, we instead opted to build our own model, which we call "Confucius". This is a purpose-built narrative world model that sits underneath the entire analysis layer.

It consists of five structures which I'm just lazily copying and pasting from our docs here: PropertyGraph — entities as nodes, relationships as weighted edges, co-occurrence counts CausalDAG — setup/payoff chains, unresolved narrative threads IntervalTree — precise word-position intervals for every entity (where is each character in the manuscript at every point) FenwickTree — entity density over word position, O(log n) range queries Trie — fuzzy entity lookup, name variants, partial matches

Confucius is passive in that it only knows what you tell it via an event emission system. We then slot in an LLM for the extraction layer. We tested three approaches for said LLM

1. NER Only

2. Local GGUF Model only

3. Anthropic Haiku Only

NER, in any combination, made things worse, it was low detection and generated the same high number of false positives. GGUF resulted in 100% detection, with zero false positives, and likewise for Anthropic

So based on this, we opted to ship with 3 tiers - heuristic only (no AI required, but basic surface metrics), local GGUC (Qwen3, ~500mb one-time download which enables full Guardian features), or a managed API subscription (Haiku on our key)

We're certainly proud of the result, but unto itself its been a fascinating journey as we surface additional features with each model refinement (e.g. "voice fingerprint" is our newest - essentially the consistency of the characters voice over the span of the book)

We've got a kickstarter going to help fund refinements and model expenses[1], and a roadmap for additional apps down the line which we'll have on the main site[2]. We'd love for folks to try out the app so we can get some real user feedback for UI/UX refinements so please do check out the demo, or just ping us on the side

1. https://www.kickstarter.com/projects/laotzustudio/laotzu-wri...

2. https://www.redwoodrhetorica.com/

Unlocking Asynchronicity in Continuous Batching

Recovering the State of Xorshift128

The Supreme Court just told every freight broker that they can be sued

Software Sandboxing: The Basics

Show HN: Vouch, I scanned 50 AI-coded repos with my own scanner

Cursing the government does not fix potholes. Spray-painting them does

Ukrainian drone strike on fuel depot prompts Latvian prime minister resignation

Microsoft to automatically roll back faulty Windows drivers

Simon the Sorcerer Origins

AST-outline: AST-based code-navigation CLI

Six Joints, Twenty-One Fingers, and the Math of Reach

Too dangerous or just too expensive? The real reason Anthropic is hiding Mythos

Getting Secret Management Right in Kubernetes

The AI-Native Developer

AP News: Dirtnado Sweeps Through Minnesota Farm

Maldives holds first underwater Cabinet meeting in a bid for climate

The language debate is back!

Cerebras CEO: AI chip demand is 'not speculative', IPO price doubles

Ask HN: Hacker News is suffocating me

PauseHer – hold a yoga pose to unlock Instagram or TikTok

Truth, Power, and Honest Journalism

Spreadsheet Errors: Manual Data Mistakes Are Costing Thousands

Trump poised to drop IRS suit, launch $1.7B 'weaponization' fund for allies

Omnisearch – A lightweight metasearch engine written in C

AI Did Not

'I didn't want to be the guinea pig': inside tech's AI-fueled manager purge

Browser Run: now running on Cloudflare Containers, it's faster and more scalable

The old world of tech is dying and the new cannot be born

DeepSeek V4 Pro and Flash vs. Claude Opus 4.7 and Kimi K2.6

Show HN: Bit-exact Elixir port of UltraLogLog (Ertl, VLDB 2024)