Show HN: Every AI model draws itself as an RPG character (open-source pipeline)

1•okandship•1h ago

Comments

okandship•1h ago

Hi HN, I built an open-source pipeline that turns AI model metadata into dark fantasy RPG character portraits, and here's the twist: each model generates its own.

How it works (4 stages):

1. Creator → Monster. GPT-5.2 plays "Dark Fantasy Loremaster", it analyzes the AI company's name and assigns a monster archetype + color palette. "Black Forest Labs" → Wyrdwood Witch-Hart. "ByteDance Seed" → Spriggan Code-Dancer. "xAI" → Rune-Eyed Homunculus.

2. Model → Item + Material. A second GPT-5.2 role ("Creative Art Director") maps each model's size and modality to a holdable item. Image models get weapons (small=dagger, large=greatsword). Video models get time/vision tools (hourglasses, mirrors). Audio gets resonance tools. 3D gets construction tools. So Qwen Image Max (large, image) → Colossal Calligrapher's Greatsword made of Starforged Brass.

3. Self-portrait. The model's own fal.ai endpoint generates a portrait of itself as the monster holding the item. FLUX.2 paints its own Witch-Hart. Seedream paints its own Spriggan. The prompt is literally just: {material} {monster} holding {item}.

4. Style unification. Every raw portrait gets restyled through Seedream v4.5's edit endpoint with the creator's color palette + a global style reference to make them look cohesive.

Everything is cached as markdown files (human-readable, git-diffable), uploaded to S3 via Bun's native S3 API, and served on the site: https://modeldrop.fyi

The whole thing runs on the Vercel AI SDK (@ai-sdk/fal for images, ai package for text generation) with Zod schemas for validation. All prompts, identity caches, and generation metadata are version-controlled.

Repo: https://github.com/okandship/MODELDROP (CC0, public domain)

What I think is interesting: the same pipeline produces wildly different results because each image model has its own "style fingerprint." FLUX portraits look different from Seedream portraits even with identical prompt structure. The restyle pass smooths this out but you can still feel the model's personality.

Would love feedback. What creators or models am I missing?

Spotify-fs Store any file inside Spotify tracks

Show HN: Gottp – A Postman/Insomnia-Like TUI API Client Built in Go

Migrating from Slurm to Kubernetes

Evolving Git for the next decade (FOSDEM 2026)

Lightweight daemon to remap the Copilot keyboard key in Linux using libevdev

We built a museum exhibit about a 1990s game hint line, with a physical binder

EU commission eyes turning 5G antennas into drone detectors

Maxis Software Toys

SEO Score for Your Docs

Interactive guide to Bitcoin's proof of work

Show HN: TagLib-WASM – Read/write audio metadata with all JavaScript runtimes

Talk to Proteins

Opus 4.6, Codex 5.3, and the post-benchmark era

Andreessen Horowitz's Rising Influence over Trump-Era AI Policy

Challenger Center announces Space Coding Challenges with Hack Club

The hunt for zero-CVE container images

Old Reddit Broken

Turning YouTube into Cloud Storage [video]

Gallup will no longer measure presidential approval after 88 years

Show HN: Simulate Anybody's Gmail Inbox

Show HN: Steam and Autism, a book by Opus 4.6

Are we losing our sense of "Quality" in the age of AI agents

Metasurfaces create super-sized neutral atom arrays for quantum computing

Google sent personal and financial information of student journalist to ICE

Building High-Performance Electron Apps

What Is a Diminished Value Claim? The Secret to Recovering Your Car's Lost Value

Can my SPARC server host a website?

The Incoming Slopocalypse and the Death(?) Of Open Source

A practical guide to use AI Coding agents

Subreply – a text-only social network