frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks

https://www.narrator.sh/
1•Jetwu•12h ago
Hey HN! I'd love to get some people to mess around with a little side project I built to teach myself DSPy! I've been a big fan of reading fiction + webnovels for a while now, and have always been curious about two things: how can LLMs iteratively learn to write better based on reader feedback, and which LLMs are actually best at creative writing (research benchmarks are cool, but don't necessarily translate to real-world usage).

That's exactly why I built narrator.sh! The platform takes in a user input for a novel idea, then generates serialized fiction chapter-by-chapter by using DSPy to optimize the writing based on real reader feedback. I'm using CoT and parallel modules to break down the writing task, refine modules + LLM-as-a-judge for reward functions, and the SIMBA optimizer to recompile user ratings from previous chapters to improve subsequent ones.

Instead of synthetic benchmarks, I track real reader metrics: time spent reading, ratings, bookmarks, comments, and return visits. This creates a leaderboard of which models actually write engaging fiction that people want to finish.

Right now the closest evals for creative writing LLMs come from the author perspective (OpenRouter's usage data for tools like Novelcrafter). But ultimately readers decide what's good, not authors.

You can try it at https://narrator.sh. Here's the current leaderboard: https://narrator.sh/llm-leaderboard (it's a bit bare right now b/c there's not that many users haha)

(Fair warning: there's some adult content since I posted on Reddit for beta testers and people got creative with prompts. I'm working on diversifying the content!)

Progress towards universal Copy/Paste shortcuts on Linux

https://mark.stosberg.com/universal-copy-paste/
46•uncircle•2d ago•34 comments

Blurry rendering of games on Mac

https://www.colincornaby.me/2025/08/your-mac-game-is-probably-rendering-blurry/
272•bangonkeyboard•8h ago•142 comments

Gemma 3 270M: Compact model for hyper-efficient AI

https://developers.googleblog.com/en/introducing-gemma-3-270m/
618•meetpateltech•14h ago•245 comments

We rewrote the Ghostty GTK application

https://mitchellh.com/writing/ghostty-gtk-rewrite
263•tosh•9h ago•84 comments

Why LLMs can't really build software

https://zed.dev/blog/why-llms-cant-build-software
482•srid•17h ago•289 comments

South Park and the greatest TV contract clause

https://www.readtrung.com/p/south-park-and-the-greatest-tv-contract
61•JustExAWS•1h ago•10 comments

I used to know how to write in Japanese

https://aethermug.com/posts/i-used-to-know-how-to-write-in-japanese
64•mrcgnc•5h ago•50 comments

Making reliable distributed systems in the presence of software errors (2003) [pdf]

http://erlang.org/download/armstrong_thesis_2003.pdf
37•vismit2000•4d ago•4 comments

Blood oxygen monitoring returning to Apple Watch in the US

https://www.apple.com/newsroom/2025/08/an-update-on-blood-oxygen-for-apple-watch-in-the-us/
379•thm•17h ago•268 comments

Time to End Roundtripping by Big Pharma

https://www.cfr.org/blog/time-end-roundtripping-big-pharma
56•luu•5h ago•23 comments

Citybound: City building game, microscopic models to vividly simulate organism

https://aeplay.org/citybound
67•modinfo•7h ago•23 comments

I made a real-time C/C++/Rust build visualizer

https://danielchasehooper.com/posts/syscall-build-snooping/
269•dhooper•14h ago•57 comments

The secret code behind the CIA's Kryptos puzzle is up for sale

https://news.artnet.com/art-world/cia-kryptos-sculpture-code-auction-2677451
68•elahieh•5h ago•34 comments

Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks

https://www.narrator.sh/
18•Jetwu•12h ago•7 comments

Show HN: OWhisper – Ollama for realtime speech-to-text

https://docs.hyprnote.com/owhisper/what-is-this
165•yujonglee•14h ago•46 comments

Steve Wozniak: Life to me was never about accomplishment, but about happiness

https://yro.slashdot.org/comments.pl?sid=23765914&cid=65583466
638•MilnerRoute•12h ago•399 comments

Teenage Engineering's free computer case

https://teenage.engineering/store/computer-2
43•textadventure•2h ago•22 comments

Streaming services are driving viewers back to piracy

https://www.theguardian.com/film/2025/aug/14/cant-pay-wont-pay-impoverished-streaming-services-are-driving-viewers-back-to-piracy
753•nemoniac•13h ago•590 comments

Org-social is a decentralized social network that runs on Org Mode

https://github.com/tanrax/org-social
139•tanrax•1d ago•57 comments

Show HN: I built a free alternative to Adobe Acrobat PDF viewer

https://github.com/embedpdf/embed-pdf-viewer
225•bobsingor•14h ago•45 comments

The Kuzma Self-Playing Guitar System

https://www.core77.com/posts/137962/The-Kuzma-Self-Playing-Guitar-System
16•surprisetalk•3d ago•8 comments

Launch HN: Cyberdesk (YC S25) – Automate Windows legacy desktop apps

57•mahmoud-almadi•15h ago•44 comments

What does Palantir actually do?

https://www.wired.com/story/palantir-what-the-company-does/
280•mudil•1d ago•232 comments

OneSignal (YC S11) Is Hiring Engineers

https://onesignal.com/careers
1•gdeglin•9h ago

Airbrush art of the 80s was Chrome-tastic (2015)

https://www.coolandcollected.com/airbrush-art-of-the-80s-was-chrome-tastic/
79•Michelangelo11•10h ago•34 comments

DINOv3

https://github.com/facebookresearch/dinov3
106•reqo•10h ago•19 comments

Architecting large software projects [video]

https://www.youtube.com/watch?v=sSpULGNHyoI
98•jackdoe•2d ago•43 comments

Map of Wales and Pronunciation from Wikipedia

https://www.mapiau.cymru/mapiau/MapLlais/index.html
8•gregsadetsky•3d ago•1 comments

Show HN: Understanding the Spatial Web Browser Engine

https://m-creativelab.github.io/jsar-runtime/blogs/spatial-browser-engine.html
9•yorkie•2d ago•1 comments

Is chain-of-thought AI reasoning a mirage?

https://www.seangoedecke.com/real-reasoning/
149•ingve•16h ago•131 comments