frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We ran over 600 image generations to compare AI image models

https://latenitesoft.com/blog/evaluating-frontier-ai-image-generation-models/
41•kalleboo•2h ago

Comments

Dwedit•1h ago
You can always identify the OpenAI result because it's yellow.
Bombthecat•16m ago
And mid journey because it's cell shading:)
Hoasi•10m ago
Also because it’s mid :)
jstummbillig•1h ago
> If you made it all the way down here you probably don’t need a summary

Love the optimism

LogicFailsMe•55m ago
I skipped to the end to see if they did any local models. spoilers: they didn't.
sema4hacker•1h ago
Are artists and illustrators going the way of the horse and buggy?
LogicFailsMe•48m ago
No, but this is the beginning of a new generation of tools to accelerate productivity. What surprises me is that the AI companies are not market savvy enough to build those tools yet. Adobe seems to have gotten the memo though.
somenameforme•36m ago
In testing some local image gen software, it takes about 10 seconds to generate a high quality image on my relatively old computer. I have no idea the latency on a current high end computer, but I expect it's probably near instantaneous.

Right now though the software for local generation is horrible. It's a mish-mash of open source stuff with varying compatibility loaded with casually excessive use of vernacular and acronyms. To say nothing of the awkwardness of it mostly being done in python scripts.

But once it gets inevitably cleaned up, I expect people in the future are going to take being able to generate unlimited, near instantaneous images, locally, for free, for granted.

bnj•16m ago
I've been waiting for solutions that integrate into the artistic process instead of replacing it. Right now a lot of the focus is on generating a complete image, but if I was in photoshop (or another editor) and could use AI tooling to create layers and other modifications that fit into a workflow, that would help with consistency and productivity.

I haven't seen the latest from adobe over the last three months, but last I saw the firefly engine was still focused on "magically" creating complete elements.

Hoasi•13m ago
> Adobe seems to have gotten the memo though.

So far Adobe AI tools are pretty useless, according to many professional illustrators. With Firefly you can use other (non-Adobe) image generators. The output is usually barely usable at this point in time.

jonathanstrange•24m ago
Artists no, illustrators and graphic designers yes. They'll mostly become redundant within the next 50 years. With these kind of technologies, people tend to overestimate the short-term effects and severely underestimate the long-term effects.
Bombthecat•17m ago
Yes and now. IKEA and co didn't replace custom made tables, just reduced the number of people needing a custom table.

Same will happen to music, artists etc. They won't vanish. But only a few per city will be left

kevin009•1h ago
Everyday I generate more than 600 image and also compare them, it takes me 5 hours
alienbaby•1h ago
Interesting experiment, though I'm not certain quite how the models are usefully compared.
th0ma5•1h ago
This seems to imply that the capabilities being tested are like the descriptive words used in the prompts, but, as a category using random words would be just as valid for exercising the extents of the underlying math. And when I think of that reality I wonder why a list of tests like this should be interesting and to what ends. The repeated nature of the iteration implies that some control or better quality is being sought but the mechanism of exploration is just trial and error and not informative of what would be repeatable success for anyone else in any other circumstance given these discoveries.
XYZ12334•58m ago
Waiting for the SimonW futa conversion benchmark
fsniper•49m ago
Is it me or ChatGPT change subtle or sometimes more prominent things? Like ball holding position of the hand, face features like for head, background trees and alike?
yapyap•46m ago
Using gen. ai for filters is stupid, a filter guarantees the same object but filtered, a gen. AI version of this guarantees nothing and an expensive AI bill.

It’s like using gen. ai to do math instead of extracting the numbers from a story and just doing the math with +, -, / and *

whoaoweird•37m ago
It was interesting to see how often the OpenAI model changed the face of the child. Often the other two models wouldn't, but OpenAI would alter the structure of their head (making it rounder), eyes (making them rounder), or altering the position and facing of the children in the background.

It's like OpenAI is reducing to some sort of median face a little on all of these, whereas the other two models seemed to reproduce the face.

For some things, exactly reproducing the face is a problem -- for example in making them a glass etching, Gemini seemed unwilling to give up the specific details of the child's face, even though that would make sense in that context.

frotaur•18m ago
It's crazy that the 'piss filter' of openAI image generation hasn't been fixed yet. I wonder if it's on purpose for some reason ?
gs17•17m ago
It's interesting to me that the models often have their "quirks". GPT has the orange tint, but it also is much worse at being consistent with details. Gemini has a problem where it often returns the image unchanged or almost unchanged, to the point where I gave up on using it for editing anything. Not sure if Seedream has a similar defining "feature".

They noted the Gemini issue too:

> Especially with photos of people, Gemini seems to refuse to apply any edits at all

Terminal Latency on Windows (2024)

https://chadaustin.me/2024/02/windows-terminal-latency/
55•bariumbitmap•1h ago•22 comments

A Catalog of Side Effects

https://bernsteinbear.com/blog/compiler-effects/
7•speckx•19m ago•1 comments

Scaling HNSWs

https://antirez.com/news/156
62•cyndunlop•5h ago•3 comments

The history of Casio watches

https://www.casio.com/us/watches/50th/Heritage/1970s/
60•qainsights•2d ago•29 comments

Cache-friendly, low-memory Lanczos algorithm in Rust

https://lukefleed.xyz/posts/cache-friendly-low-memory-lanczos/
66•lukefleed•2h ago•7 comments

FFmpeg to Google: Fund Us or Stop Sending Bugs

https://thenewstack.io/ffmpeg-to-google-fund-us-or-stop-sending-bugs/
172•CrankyBear•1h ago•97 comments

We ran over 600 image generations to compare AI image models

https://latenitesoft.com/blog/evaluating-frontier-ai-image-generation-models/
43•kalleboo•2h ago•21 comments

Show HN: Cactoide – Federated RSVP Platform

https://cactoide.org/
35•orbanlevi•3h ago•14 comments

Pikaday: A friendly guide to front-end date pickers

https://pikaday.dbushell.com
36•mnemonet•5h ago•15 comments

Creating minimal music with code in any programming language

https://zserge.com/posts/etude-in-c/
13•etrvic•6d ago•0 comments

iPhone Pocket

https://www.apple.com/newsroom/2025/11/introducing-iphone-pocket-a-beautiful-way-to-wear-and-carr...
338•soheilpro•9h ago•867 comments

Weave (YC W25) is hiring a founding ML engineer

https://www.ycombinator.com/companies/weave-3/jobs/ZPyeXzM-founding-ml-engineer
1•adchurch•3h ago

Show HN: Data Formulator – interactive AI agents for data analysis (Microsoft)

https://data-formulator.ai/
11•chenglong-hn•2h ago•6 comments

How I fell in love with Erlang

https://boragonul.com/post/falling-in-love-with-erlang
324•asabil•1w ago•190 comments

Firefox expands fingerprint protections

https://blog.mozilla.org/en/firefox/fingerprinting-protections/
164•ptrhvns•4h ago•82 comments

The R47: A new physical RPN calculator

https://www.swissmicros.com/product/model-r47
121•dm319•4d ago•69 comments

Drawing Text Isn't Simple: Benchmarking Console vs. Graphical Rendering

https://cv.co.hu/csabi/drawing-text-performance-graphical-vs-console.html
39•PaulHoule•5h ago•28 comments

Array Programming the Mandelbrot Set

https://jcmorrow.com/mandelbrot/
28•jcmorrow•4d ago•3 comments

Grebedoc – static site hosting for Git forges

https://grebedoc.dev
27•todsacerdoti•4h ago•4 comments

Advent of Code on the Z-Machine

https://entropicthoughts.com/advent-of-code-on-z-machine
85•todsacerdoti•8h ago•17 comments

Show HN: Creavi Macropad – Built a wireless macropad with a display

https://creavi.tech/blog/creavi-macropad-build-log/
7•cmpx•1h ago•2 comments

Show HN: Gametje – A casual online gaming platform

https://gametje.com
81•jmpavlec•5h ago•30 comments

Widespread distribution of bacteria containing PETases across global oceans

https://academic.oup.com/ismej/article/19/1/wraf121/8159680?login=false
96•PaulHoule•7h ago•59 comments

Why effort scales superlinearly with the perceived quality of creative work

https://markusstrasser.org/creative-work-landscapes.html
113•eatitraw•11h ago•93 comments

The Perplexing Appeal of the Telepathy Tapes

https://asteriskmag.com/issues/12-books/paradigm-shifted-the-perplexing-appeal-of-the-telepathy-t...
46•surprisetalk•6h ago•40 comments

The 'Toy Story' You Remember

https://animationobsessive.substack.com/p/the-toy-story-you-remember
1053•ani_obsessive•16h ago•293 comments

Welcome, the entire land - "Hello, world!" in hieroglyphics (2009)

https://optional.is/required/2009/12/03/welcome-the-entire-land/
77•andrelaszlo•9h ago•29 comments

High speed X-ray video: jumping beans, wind-up toys and more

https://www.youtube.com/watch?v=xdpDd7dyU00
49•surprisetalk•4d ago•17 comments

DARPA and Texas Bet $1.4B on Unique Foundry -3D heterogeneous integration

https://spectrum.ieee.org/3d-heterogeneous-integration
57•pseudolus•8h ago•14 comments

SoftBank sells its entire stake in Nvidia

https://www.cnbc.com/2025/11/11/softbank-sells-its-entire-stake-in-nvidia-for-5point83-billion.html
271•mfiguiere•12h ago•167 comments