frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: We fingerprinted 178 AI models' writing styles and similarity clusters

https://rival.tips/research/model-similarity
55•nuancedev•1h ago
We have a dataset of 3,095 standardized AI responses across 43 prompts. From each response, we extract a 32-dimension stylometric fingerprint (lexical richness, sentence structure, punctuation habits, formatting patterns, discourse markers).

Some findings:

- 9 clone clusters (>90% cosine similarity on z-normalized feature vectors) - Mistral Large 2 and Large 3 2512 score 84.8% on a composite metric combining 5 independent signals - Gemini 2.5 Flash Lite writes 78% like Claude 3 Opus. Costs 185x less - Meta has the strongest provider "house style" (37.5x distinctiveness ratio) - "Satirical fake news" is the prompt that causes the most writing convergence across all models - "Count letters" causes the most divergence

The composite clone score combines: prompt-controlled head-to-head similarity, per-feature Pearson correlation across challenges, response length correlation, cross-prompt consistency, and aggregate cosine similarity.

Tech: stylometric extraction in Node.js, z-score normalization, cosine similarity for aggregate, Pearson correlation for per-feature tracking. Analysis script is ~1400 lines.

Comments

jefftk•1h ago
> "Models with >75% writing similarity but massive price gaps. The cheap model writes the same way. You are paying for the brand.

* > ...*

* > Gemini 2.5 Flash Lite Preview 06-17 and Claude 3 Opus: 78.2%*

As someone who has tried to use many of these models for writing assistance, you're very wrong here. It really matters whether the model can get what I'm trying to communicate well enough to be helpful, or else I'll just write it myself. If you actually play with them a bit it's very clear these models are not substitutes. This goes for many on your list!

lubujackson•1h ago
It makes sense. The cheaper models are often distilled versions, so they may ape language but miss the connective tissue that makes the entire output coherent.
rogerrogerr•1h ago
I'd bet this whole thing is vibe'd out of nothingness and no human actually thought about whether saying "you are paying for the brand" makes any sense at all.

How the hell are companies and individuals not taking reputational hits for saying blatantly wrong things in AI-voice, under their name?

anonzzzies•23m ago
Also are Gemini and Opus not both big brands? If it was some small ai shop vs opus then sure. So seems indeed to make little sense?
leonidasv•1h ago
I've always wondered if the "typical" AI writing style is just an unavoidable RL artifact or a deliberate fingerprint to prevent model collapse as low-effort AI-generated text floods the training data pool (the web).
redox99•1h ago
Besides claiming opus and gemini flash share 99% of style being suspicious, the point that you are wasting money on the expensive model is non sensical. You pay primarily for the intelligence, not the writing style.

Is this article AI slop?

kurthr•57m ago
It would be shocking to me if the large model trainers didn't have tools like this to analyze their outputs, but this is interesting work!

You can see who likely (post)trained/distilled their models or borrowed parameters from each other. I do wonder if the 32 dimensions were chosen/named from principal components or pre-selected and designed, but the tool seems like an effective discriminator in any case.

Were the prompts similarly selected for orthogonality? I've wondered how the different LLMs would respond from iterative zero-shot prompt_n generation by summary from a previous response_n to generate zero-shot response_n+1. Would it statistically converge to a more distinguishable prompt for that LLM?

qaid•45m ago
Ugh. subheadings were a major turn off.

I expected it to be an analysis of AI-generated writing styles. Not full of them.

;)

add-sub-mul-div•34m ago
It's a spam account, like nearly all submissions about AI their account is just self promotion history. Together with the subject matter I'd expect nothing but lazy bullshit.
Imustaskforhelp•6m ago
One thing I don't understand then is why Hackernews upvotes these posts when there are some other SHOW HN posts which sometimes are so much nicer and thoughtful like (https://news.ycombinator.com/item?id=47589735) etc.

our community is shooting a gun in its own feet if it continues to upvote complete self promotion, I try to upvote as many cool projects I find here but the number games is definitely frustrating

Are we entirely sure that this person hasn't used AI bot to upvote his comments to front page, I wish to much rather believe that than people upvoting it especially when most if not all comments are about how it feels extremely AI slop.

maybe such forms of (rage/click bait?) truly sells in that regards and HN isn't so invulnerable as (I) we think it is.

apercu•28m ago
Has anyone else used LLMs to fact check other LLMS?

I hate to say it, but Gemini lies less frequently than paid models from OPenAI and Anthropic (Open AI is worst in my use cases).

My guess is that Google has better training data (and uses less synthetic data which might be creating training feedback loops in other models), has more of a "be calibrated" model than a "be helpful" model, but it could just be that they leverage more RAG than leveraging weights more.

But, I really shouldn't speculate the "why" as I'm out of my domain. Just curious if others use all the models they can and compare outputs as much as I do.

docheinestages•19m ago
The muted colors on a dark background makes everything hard to read.
groby_b•12m ago
Without showing the prompts and responses, it's yet another meaningless AI benchmark.

Many of those numbers do not really match what I've seen in the wild, and without clear illustration why you arrived at the number it's not a helpful number.

glaslong•5m ago
I'm curious about the sorts of users who care about style but will either one-shot with default style, not providing samples or direction, or who even choose models on that style rather than, you know, substance.

Show HN: Explore the Silk Roads through an interactive map

https://www.intofarlands.com/silk-roads-map
15•intofarlands•57m ago•1 comments

Show HN: Go-Bt: Minimalist Behavior Trees for Go

https://github.com/rvitorper/go-bt
12•rvitorper•1h ago•0 comments

Show HN: We built a camera only robot vacuum for less than 300$ (Well almost)

https://indraneelpatil.github.io/blog/2026/robot-vacuum/
76•indraneelpatil•2d ago•33 comments

Show HN: I pipe free sports streams into Jellyfin – no ads, just HLS

https://github.com/pcruz1905/hls-restream-proxy
39•pruz•3h ago•6 comments

Show HN: An interactive map of Tolkien's Middle-earth

https://middle-earth-interactive-map.web.app/
262•frasermarlow•19h ago•54 comments

Show HN: PII-hound – A fast, dependency-free PII scanner in Go

https://github.com/saddledata/pii-hound
2•dbuckman•46m ago•2 comments

Show HN: We fingerprinted 178 AI models' writing styles and similarity clusters

https://rival.tips/research/model-similarity
55•nuancedev•1h ago•14 comments

Show HN: Gemma 4 Multimodal Fine-Tuner for Apple Silicon

https://github.com/mattmireles/gemma-tuner-multimodal
211•MediaSquirrel•20h ago•27 comments

Show HN: BAREmail ʕ·ᴥ·ʔ – minimalist Gmail client for bad WiFi

https://github.com/matt-virgo/baremail
5•Virgo_matt•1h ago•1 comments

Show HN: Brutalist Concrete Laptop Stand (2024)

https://sam-burns.com/posts/concrete-laptop-stand/
763•sam-bee•1d ago•232 comments

Show HN: A cartographer's attempt to realistically map Tolkien's world

https://www.intofarlands.com/atlasofarda
160•intofarlands•1d ago•31 comments

Show HN: Unicode Steganography

https://steganography.patrickvuscan.com
25•PatrickVuscan•1d ago•4 comments

Show HN: Voxcode: local speech to text and ripgrep = transcript and code context

https://github.com/jensneuse/voxcode
4•jensneuse•4h ago•1 comments

Show HN: Android SSH client with full Terminal, server monitoring and runbooks

https://saltserv.com/posts/cura-sysadmin-server-monitoring-android/
3•0dayman•5h ago•0 comments

Show HN: Pion/handoff – Move WebRTC out of browser and into Go

https://github.com/pion/handoff
96•Sean-Der•1d ago•17 comments

Show HN: Ghost Pepper – Local hold-to-talk speech-to-text for macOS

https://github.com/matthartman/ghost-pepper
461•MattHart88•1d ago•195 comments

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

https://github.com/final-run/finalrun-agent
26•ashish004•1d ago•12 comments

Show HN: Stop paying for Dropbox/Google Drive, use your own S3 bucket instead

https://locker.dev
242•Zm44•1d ago•198 comments

Show HN: Anos – a hand-written ~100KiB microkernel for x86-64 and RISC-V

https://github.com/roscopeco/anos
112•noone_youknow•4d ago•31 comments

Show HN: Hippo, biologically inspired memory for AI agents

https://github.com/kitfunso/hippo-memory
123•kitfunso•1d ago•24 comments

Show HN: Tusk for macOS and Gnome

https://shapemachine.xyz/tusk/
118•factorialboy•4d ago•45 comments

Show HN: I built a tiny LLM to demystify how language models work

https://github.com/arman-bd/guppylm
896•armanified•2d ago•134 comments

Show HN: Real-time AI (audio/video in, voice out) on an M3 Pro with Gemma E2B

https://github.com/fikrikarim/parlor
288•karimf•2d ago•36 comments

Show HN: GovAuctions lets you browse government auctions at once

https://www.govauctions.app/
313•player_piano•1d ago•89 comments

Show HN: Gemma Gem – AI model embedded in a browser – no API keys, no cloud

https://github.com/kessler/gemma-gem
154•ikessler•2d ago•21 comments

Show HN: TTF-DOOM – A raycaster running inside TrueType font hinting

https://github.com/4RH1T3CT0R7/ttf-doom
65•4RH1T3CT0R•1d ago•12 comments

Show HN: A game where you build a GPU

https://jaso1024.com/mvidia/
954•Jaso1024•3d ago•186 comments

Show HN: Mo – checks GitHub PRs against decisions approved in Slack

https://motionode.com/index
9•oscarcaldera•18h ago•2 comments

Show HN: I made a YouTube search form with advanced filters

https://playlists.at/youtube/search/
319•nevernothing•2d ago•201 comments

Show HN: Can an AI model fit on a single pixel?

https://github.com/dvelton/ai-pixel
8•deevelton•15h ago•7 comments