Slop Detective – Fight the Slop Syndicate

https://slopdetective.kagi.com/

28•speckx•2h ago

Comments

Der_Einzige•39m ago

We wrote the paper on how to remove slop from LLMs.

https://arxiv.org/abs/2510.15061

Also somewhat tangentially relevant video: https://www.youtube.com/watch?v=Tsp2bC0Db8o

pbaehr•22m ago

I feel like this is a good educational goal but a very poor execution.

We're meant to assume correct sentences were written by humans and AI adds glaring factual errors. I don't think it is possible at this point to tell a single human written sentence from an AI written sentence with no other context and it's dangerous to pretend it is this easy.

Several of the AI images included obvious mistakes a human wouldn't have made, but some of them also just seemed like entirely plausible digital illustrations.

Oversimplifying generative AI identification risks overconfidence that makes you even easier to fool.

Loosely related anecdote: A few months ago I showed an illustration of an extinct (bizarre looking) fish to a group of children (ages 10-13ish). They immediately started yelling that it was AI. I'm glad they are learning that images can be fake, but I actually had to explain that "Yes, I know this is not a photo. This animal is long extinct and this is what we think it looked like so a person drew it. No one is trying to fool you."

hrimfaxi•18m ago

Ironically, it seems the descriptions are AI-written?

(minor spoiler)

The text accompanying an image of a painting:

> This image shows authentic human photography with natural imperfections, consistent lighting, and realistic proportions that indicate genuine capture rather than artificial generation. Meindert Hobbema. The Avenue at Middelharnis (1689, National Gallery, London)

phreack•3m ago

What bugs me the most about nearly everyone selling AI products is that they apparently want or need to believe in the power of LLMs for everything, not just the product, and this means that they also generate the explanatory texts and descriptions and readmes and... it makes the product itself feel of a much worse quality.

I don't mind that you're selling an AI product if it's good but at least put some humanity on the marketing side.

vintagedave•15m ago

Curiously it focuses on overly descriptive phrasing, and factually incorrect statements, as signs of AI.

I don't think this is accurate. AI has a flavour or tone we all know, but it could have generated factually plausible statements (that you could not diagnose in this test) or plausible text.

I could not tell the real from fake music at all.

I support (and pay for) Kagi, but wasn't overly impressed here. At worst I think it might give people too much confidence. Wikipedia has a great guideline on spotting AI text and I think the game here should integrate and reflect its contents: https://en.wikipedia.org/wiki/Wikipedia:Signs_of_AI_writing

A_D_E_P_T•6m ago

Right. Its examples fall into categories like:

- AI slop is trivially factually wrong, and frequently overconfident.

- AI slop is verbose.

But, as you note, IRL this is not usually the case. It might have been true in the GPT-3.5 or early GPT-4 days, but things have moved on. GPT-5.1 Pro can be laconic and is rarely factually wrong.

The best way to identify AI slop text is by their use of special and nonstandard characters. A human would usually write "Gd2O3" for gadolinium oxide, whereas an AI would default to "Gd₂O₃". Chat-GPT also loves to use the non-breaking hyphen (U+2011), whereas all humans typically use the standard hyphen-minus character (U+002D). There's more along these lines. The issue is that the bots are too scrupulously correct in the characters they use.

As for music, it can be very tough to distinguish. Interestingly, there are some genres of music that are entirely beyond the ability of AI to replicate.

CamperBob2•5m ago

Interestingly, there are some genres of music that are entirely beyond the ability of AI to replicate.

Sounds interesting, what are some of those genres?

yesfitz•12m ago

I like the idea, but I think the game progression needs another pass from a designer.

I started on "Level 1" and got 2 things wrong (both false positives if it matters) and instead of feeling like I learned anything, I felt as though I was set up to fail because the image prompt was missing sufficient context or the text prompt was too simple to be human. Either I was dumb or the game was dumb.

Maybe I'm just too old and 8-11 year-old kids wouldn't be so easily discouraged, but I'd recommend:

1. Picking on one member of the "slop syndicate" at a time.

2. Show some examples (evidence) before beginning the evaluation.

moralestapia•4m ago

>Water is wet. Wetness is what water has. What makes water water is that it's wet. The wetness of water means water is wet. So water has wetness.

>This was actually AI-generated slop! Repeats 'water is wet' multiple times.

I didn't know writing "water is wet" repeatedly was enough to de-humanize you.

>In many situations, it could be argued that grass may sometimes appear to have a greenish quality, though this might not always be the case.

>This was actually AI-generated slop! Won't commit to 'grass is green' and uses uncertain words.

What? Not all grass is green.

Fun times ahead.

Voyager 1 Is About to Reach One Light-Day from Earth

Scaleway turns Mac Minis into high‑density, Raspberry Pi–Managed servers

Investors expect AI use to soar. That's not happening

A Fast 64-Bit Date Algorithm (30–40% faster by counting dates backwards)

From blood sugar to brain relief: GLP-1 therapy slashes migraine frequency

A Vibe Coded SaaS Killed My Team

A cell so minimal that it challenges definitions of life

Show HN: I turned algae into a bio-altimeter and put it on a weather balloon

OpenAI needs to raise at least $207B by 2030

Optery (YC W22) Hiring CISO, Release Manager, Tech Lead (Node), Full Stack Eng

Statistical Process Control in Python

JOPA: Java compiler in C++, Jikes modernized to Java 6 with Claude

DRAM prices are spiking, but I don't trust the industry's why

Show HN: KiDoom – Running DOOM on PCB Traces

Surprisingly, Emacs on Android is pretty good

Copyparty, the FOSS file server [video]

Slop Detective – Fight the Slop Syndicate

Is DWPD Still a Useful SSD Spec?

Qiskit open-source SDK for working with quantum computers

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

Cloudflare outage should not have happened

Trillions spent and big software projects are still failing

Jakarta is now the biggest city in the world

CS234: Reinforcement Learning Winter 2025

Show HN: We built an open source, zero webhooks payment processor

How to repurpose your old phone into a web server

1,700-year-old Roman sarcophagus is unearthed in Budapest

A new bridge links the math of infinity to computer science

Launch HN: Onyx (YC W24) – Open-source chat UI

FLUX.2: Frontier Visual Intelligence

Voyager 1 Is About to Reach One Light-Day from Earth

Scaleway turns Mac Minis into high‑density, Raspberry Pi–Managed servers

Investors expect AI use to soar. That's not happening

A Fast 64-Bit Date Algorithm (30–40% faster by counting dates backwards)

From blood sugar to brain relief: GLP-1 therapy slashes migraine frequency

A Vibe Coded SaaS Killed My Team

A cell so minimal that it challenges definitions of life

Show HN: I turned algae into a bio-altimeter and put it on a weather balloon

OpenAI needs to raise at least $207B by 2030

Optery (YC W22) Hiring CISO, Release Manager, Tech Lead (Node), Full Stack Eng

Statistical Process Control in Python

JOPA: Java compiler in C++, Jikes modernized to Java 6 with Claude

DRAM prices are spiking, but I don't trust the industry's why

Show HN: KiDoom – Running DOOM on PCB Traces

Surprisingly, Emacs on Android is pretty good

Copyparty, the FOSS file server [video]

Slop Detective – Fight the Slop Syndicate

Is DWPD Still a Useful SSD Spec?

Qiskit open-source SDK for working with quantum computers

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

Cloudflare outage should not have happened

Trillions spent and big software projects are still failing

Jakarta is now the biggest city in the world

CS234: Reinforcement Learning Winter 2025

Show HN: We built an open source, zero webhooks payment processor

How to repurpose your old phone into a web server

1,700-year-old Roman sarcophagus is unearthed in Budapest

A new bridge links the math of infinity to computer science

Launch HN: Onyx (YC W24) – Open-source chat UI

FLUX.2: Frontier Visual Intelligence

Slop Detective – Fight the Slop Syndicate

Comments