GenAI Image Editing Showdown

https://genai-showdown.specr.net/

40•rzk•3h ago

Comments

isoprophlex•34m ago

The "editing" showdown is very good. Introduced me to the Seedream model which i didn't know about until now.

I don't fully understand the iterative methodology tho - they allow multiple attempts, which are judged by another multimodal llm? Won't they have limited accuracy in itself?

sans_souse•32m ago

I had to upvote immediately once I got to Alexander the Great on a Hippity Hop

halflife•31m ago

The horse chimera is much better

croes•27m ago

What about the classic: A analog watch that shows the time 08:15?

Did current models overcome the 10:10 bias?

konart•21m ago

>Cephalopodic Puppet Show

I'm pretty sure that only Gemini made it. Other models did not meet the 'each tentacle covered' criteria.

jedbrooke•18m ago

for the OpenAI 4o model on the octopus sock puppet prompt, the prompt clearly states that each tentacle should have a sock puppet, whereas the OpenAI 4o image only has 6 puppets with 2 tentacles being puppetless. I’m not sure if we can call that a pass

snowfield•12m ago

I'd assume that behind the scenes the models generate several passes and only show the user the best one, that would be smart, as to to make it seem their model is better than others

Is also pretty obvious that the models have some built in prompt system rules that makes the final output a certain style. They seem very consistent

It also looks like 40 has the temperature turned way down, to ensure max adherence, while midjourney etc seem to have higher temperature.more interesting end results, flourishing, complex Materials and backgrounds

Also what's with 4o's sepia tones. Post editing in the gen workflows?

I don't believe any of these just generate the image though, there's likely several steps in each workflows to present the final images outputted to the user in the absolute best light.

thorum•11m ago

Actual link seems to be: https://genai-showdown.specr.net/image-editing

typpilol•8m ago

This is the editing link yes. I just got done looking at it from the other link.

The other stuff is text to image (not editing)

neilv•10m ago

> "A dolphin is using its fluke to discipline a mermaid by paddling it across the backside."

If this one were shown in a US work environment, I might say a collegial something privately to the person, about it not seeming the most work-appropriate.

echelon•6m ago

Please fix the title.

The title of this article is "image editing showdown", but the subject is actually prompt adherence in image generation from prompting.

Midjourney and Flux Dev aren't image editing models. (Midjourney is an aesthetically pleasing image generation model with low prompt adherence.)

Image editing is a task distinct from image generation. Image editing models include Nano Banana (Gemini Flash), Flux Kontext, and a handful of others. gpt-image-1 sort of counts, though it changes the global image pixels such that it isn't 1:1 with the input.

Edit: Dang, can you please fix this? Someone else posted the actual link, and it's far more interesting than the linked article:

https://genai-showdown.specr.net/image-editing

This article is great.

Washington lawyer on furlough lives out dream of running a hot dog cart

GenAI Image Editing Showdown

Show HN: Project Journal – Give AI coding assistants persistent memory

Sandbox Your Program Using FreeBSD's Capsicum [video]

TIL: Figma provides a helper function for gradient transforms

Scientists are racing to grow human teeth in the lab

We want to move Ruby forward

The Magic of Precision Engineering

Gluing and framing a 9000-piece jigsaw

AI Pullback Has Officially Started

Lampedusa's 1958 Novel The Leopard skewered the super-rich

Practical Defenses Against Technofascism

The Magna Anima Genius Project

Raster Master v5.4 Sprite/Tile/Map Editor 88 Stars on GitHub

Salesforce Enterprise Deep Research

Operating Systems Written in Free Pascal

Sustained western growth and Artificial Intelligence

Tell HN: Don't Vibe Your Design

Hey LLM, write production-ready code

Student Handcuffed After School's AI System Mistakes a Bag of Chips for a Gun

Show HN: I analyzed 3,465 remote job listings – 72% hide salary information

Why bosses need to wake up to dark patterns

The Layer 1 Blockchain Built for AI Agent

Success Always Spawns Haters

DHS Posts Video Featuring Song Popular with Nazi Creators

Language Modeling with Hierarchical Reasoning Models: Lessons from 1M Parameters

GameStop Declares Console Wars Over

Quick Dungeon Crawler Update 3.5.0: New Passives, CRIT DMG Nerf

Jan van Eijk's wise lessons and advice

How I Used Lies About a Cartoon to Prove History Is Meaningless on the Internet (2016)

Washington lawyer on furlough lives out dream of running a hot dog cart

GenAI Image Editing Showdown

Show HN: Project Journal – Give AI coding assistants persistent memory

Sandbox Your Program Using FreeBSD's Capsicum [video]

TIL: Figma provides a helper function for gradient transforms

Scientists are racing to grow human teeth in the lab

We want to move Ruby forward

The Magic of Precision Engineering

Gluing and framing a 9000-piece jigsaw

AI Pullback Has Officially Started

Lampedusa's 1958 Novel The Leopard skewered the super-rich

Practical Defenses Against Technofascism

The Magna Anima Genius Project

Raster Master v5.4 Sprite/Tile/Map Editor 88 Stars on GitHub

Salesforce Enterprise Deep Research

Operating Systems Written in Free Pascal

Sustained western growth and Artificial Intelligence

Tell HN: Don't Vibe Your Design

Hey LLM, write production-ready code

Student Handcuffed After School's AI System Mistakes a Bag of Chips for a Gun

Show HN: I analyzed 3,465 remote job listings – 72% hide salary information

Why bosses need to wake up to dark patterns

The Layer 1 Blockchain Built for AI Agent

Success Always Spawns Haters

DHS Posts Video Featuring Song Popular with Nazi Creators

Language Modeling with Hierarchical Reasoning Models: Lessons from 1M Parameters

GameStop Declares Console Wars Over

Quick Dungeon Crawler Update 3.5.0: New Passives, CRIT DMG Nerf

Jan van Eijk's wise lessons and advice

How I Used Lies About a Cartoon to Prove History Is Meaningless on the Internet (2016)

GenAI Image Editing Showdown

Comments