frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
1•vladeta•5m ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•6m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
1•lifeisstillgood•7m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•9m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•11m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
1•birdculture•12m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•14m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
1•ramenbytes•17m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•18m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•21m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•22m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
2•cinusek•22m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•24m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•27m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•32m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•32m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•35m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
2•ryan_j_naughton•35m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
1•ravenical•37m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
1•ValdikSS•37m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•39m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/
1•dive•40m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls
1•thomassmith65•46m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
2•somethingp•47m ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
4•saubeidl•48m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•51m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205
1•tosh•53m ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew
1•gl2334•53m ago•0 comments

New hire fixed a problem so fast, their boss left to become a yoga instructor

https://www.theregister.com/2026/02/06/on_call/
1•Brajeshwar•55m ago•0 comments

Four horsemen of the AI-pocalypse line up capex bigger than Israel's GDP

https://www.theregister.com/2026/02/06/ai_capex_plans/
1•Brajeshwar•55m ago•0 comments
Open in hackernews

Imagen 4 is now generally available

https://developers.googleblog.com/en/announcing-imagen-4-fast-and-imagen-4-family-generally-available-in-the-gemini-api/
213•meetpateltech•5mo ago

Comments

qoez•5mo ago
Looks so much better than the yellow tinted chatgpt output in my eyes
tripplyons•5mo ago
After manually white balancing to remove the tint, I find GPT-Image-1 (the model used in ChatGPT) to be better.
nkzd•5mo ago
I am currently building an AI product which relies on Imagen 3 to generate a lot of photorealistic, cinematic or HDR images. I tried Imagen 4 during preview, but results were too "cartoonish". Did anyone else have the same experience?
LeoPanthera•5mo ago
Yes, it seems very reluctant to generate anything that could be mistaken for a photo.
joegibbs•5mo ago
Yeah me too, I think 3 does a much better job for photos or even just images that look like realistic renders. I use 3 for generating grids of age-progressed portraits for a game and it does a better job at sticking to the prompt. 4 also seems to spit out ones that have that really smooth look that makes it really obvious it’s AI.
ctippett•5mo ago
Clicking on "Read the documentation" leads to a page that documents nothing about the latest Imagen models and only provides examples using Gemini 2.0 Flash.
typpilol•5mo ago
Classic Google
mattxxx•5mo ago
I guess it's kinda nicely genuine that the "four panel comic strip" has some errors in it (misunderstanding caption + cat high-fiving itself in the bonus fifth panel)
jug•5mo ago
I was just thinking that. It has many, many errors.

1. Not seen browsing ”ai.dev”.

2. The text ”Imagen 4 is now generally available!” is spoken, not a comic caption.

3. Invalid second panel.

4. Hallucinates ”Meet Imagen 4 fast!”

5. Hallucinates ”It offers low..” etc. (this is the second part of a single sentence said by the cat)

6. Hallucinates ”You can export images in 2K!” (this sentence is not asked for)

7. Doesn’t have the cat and the dog in the fourth panel.

—

Here’s the gpt-image-1 counterpart with the issues I could find:

https://chatgpt.com/share/689f7e4b-01e4-8011-8997-0f37edf8c2...

1. The text ”Imagen 4 is now generally available!” is still spoken, not a caption.

2. ”low latency” -> ”low-laten”

(3. Has that ugly gpt-image-1 trademark yellow filter requiring work in post to avoid.)

I didn’t bring up the ”retro comic look” thing. I certainly think it’s an issue with Imagen 4’s version. It doesn’t look very old school at all. But I can’t judge the OpenAI one either on that, I’m no comic book expert, so I just skipped that one.

latexr•5mo ago
> I didn’t bring up the ”retro comic look” thing. (…) I’m no comic book expert, so I just skipped that one.

I’m no Scott McCloud, but the OpenAI version definitely does a better job with the retro style. The yellow filter you criticised actually helps to sell the illusion. The Imagen version utterly fails in the retro area, that style is very much modern.

But there are other important flaws in the OpenAI version. The fourth panel has a different cat (the head shape and stripes are wrong) and it bleeds into the previous panel. Technically that could be a stylistic choice, except that the floor/table is inconsistent, making it clear it was a mistake.

edaemon•5mo ago
The cat also has more fingers on one hand than the other. It's a small, inconsequential thing but it always draws my eye in generated images.
typpilol•5mo ago
I got this result with the basic copilot app

https://i.imgur.com/kSuqCYg.jpeg

mattxxx•5mo ago
honestly, that's pretty good
vunderba•5mo ago
The pervasive yellow tinge indicates that that is almost assuredly `gpt-image-1` - OpenAI's flagship model and (aesthetics aside) the highest scoring model in terms of strict prompt adherence that I've seen.

https://genai-showdown.specr.net

pogue•5mo ago
What do you have to do to remove the watermark? Is Google's SynthID watermark on top of the image as well or is it embedded in EXIF data?
postalcoder•5mo ago
Google's SynthID is embedded into the content itself. Google open sourced their SynthID for text.

Repo: https://github.com/google-deepmind/synthid-text

Paper: https://www.nature.com/articles/s41586-024-08025-4

With images and video, it's less clear exactly what they're doing, but it's watermarking on the pixel leve. From one of their blog posts:

  Videos are composed of individual frames or still images. So we developed a watermarking technique inspired by our SynthID for image tool. This technique embeds a watermark directly into the pixels of every video frame, making it imperceptible to the human eye, but detectable for identification.
https://deepmind.google/discover/blog/watermarking-ai-genera...

Elevenlab's audio watermarking is trivial to shake off with compression, but google claims that synthid is resilient to such manipulation.

pogue•5mo ago
Has anyone identified the SynthID in an image or is there a tool that will determine images are AI generated by checking if it's there?
postalcoder•5mo ago
synthid used to be a waitlist-only tool but you can now check to see if images are made by imagen in google’s cloud console. You have to have a Vertex billing account to use it.

https://console.cloud.google.com/vertex-ai/studio/media/gene...

razster•5mo ago
Ran your same prompt, copypasta, got this. https://i.imgur.com/wOocci9.png Cat on panel 3 seems a bit off. I like the first panel.
Revisional_Sin•5mo ago
Wasn't Imagen 4 released months ago?
nevir•5mo ago
Yes, but usage was very limited / restricted. Now it's widely available
cubefox•5mo ago
I hate that they always announce their image models months before they make them available. They should just announce them later. OpenAI does this much better, with a few days delay at most.
SweetSoftPillow•5mo ago
They were available, just rate limited.
gawa•5mo ago
The webcomics is awful. It feels off, the characters look very fake, unsettling in the way they communicate. The prompt is shown bellow the image, but for me the result looks closer to a prompt "Create lifeless characters reciting marketing slop. They must fake an over exaggerated excitement but it should be clear they don't believe in what they're saying and have no souls".

Also, the prompt specifically ask "Panel 4 should show the cat and dog high-fiving" but the cat is high-fiving ... the cat. Personally I find this hallucinated plot twist good, it makes the ending a bit better. Although technically this is demonstrating a failure of the tool to follow the instructions from the prompt. Interesting choice of example for an official announcement.

typpilol•5mo ago
It's weird because I just asked the basic copilot app the same and got a much better result.

https://i.imgur.com/kSuqCYg.jpeg

SweetSoftPillow•5mo ago
It's definitely just a matter of personal preference. To me, your image looks much worse and has the very distinctive look of the GPT-image-1 model.
cobbzilla•5mo ago
It’s more than visual preferences — his image actually adheres to the specified requirements. it hasn’t been shown that Imagen can do that, which might be a showstopper for many people, regardless of aesthetics.
typpilol•5mo ago
And this is literally just the free tier copilot app from the android store lol. Something I would never use in professional life unlike Claude
tmvphil•5mo ago
The way it totally disregards the many explicit instructions given in the "four panel" comic strip.
ajd555•5mo ago
Same for the poster. Asks for the ship to be going towards the right, and it's clearly doing the opposite
math_dandy•5mo ago
To the left of the "detailed spaceship" I think I see a distortion pattern reminiscent of a cloaked Klingon bird of prey moving to the right. Or I'm just hallucinating patterns in nebular noise.
smokel•5mo ago
As seen from the AI's perspective.
Jare•5mo ago
The ship is reminiscent of Galactica's oldschool vipers. Different, but very similar overall structure.
thanhhaimai•5mo ago
> Imagen 4 Ultra: When your creative vision demands the highest level of detail and strict adherence to your prompts, Imagen 4 Ultra delivers highly-aligned results.

It seems that you may need the "Ultra" version if you want strict prompt adherence.

It's an interesting strategy. Personally, I notice that most of the times I actually don't need strict prompt adherence for image generation. If it looks nice, I'll accept it. If it doesn't, I'll click generate again. For creativity task, following the prompt too strictly might not be the outcome the users want.

mikepurvis•5mo ago
I've found this is an interesting balance with Copilot specifically. Like, on the one hand I'm glad it aims for the bare minimum and doesn't try to refactor my whole codebase on every shot... at the same time, there's certain obvious things where I wish it was able to think a bit bigger picture, or even engage me interactively, like "hey, I can do a self-contained implementation here, but it's a bit gross; it looks like adding dependency X to the project keeps this a one liner— which way should it go?"
hdjrudni•5mo ago
Give me a 'precision' slider then. On one end it should do precisely what you asked, to a T, even if what you asked for is dumb, and on the other end it should try to capture the spirit of what you wanted plus any obvious oversights.
chatmasta•5mo ago
I’ve had good experience with iterative prompting when generating images with Gemini (idk which model — it’s whatever we get with our enterprise subscription at work, presumably the latest.) It’s noticeably better than ChatGPT at incorporating its previous image attempt into my instructions to generate the next iteration.
cubefox•5mo ago
Though that was only Imagen 4 Fast, not Imagen 4 or Imagen 4 Ultra.
userbinator•5mo ago
In the little experimentation I did with AI image generation, it seems more a game of trying multiple times until you get something that actually looks right, so I wonder how many attempts they did.
topato•5mo ago
Right? Came to the comments specifically for this, but am confused by people's responses. With prompt adherence this bad, is it worth the 2 cents you spent on it? I don't see how it's even useful for deciding if you want to use the ultra version, or for anything else really.... Maybe if you want to redo it in Photoshop? But at that point, breaking out the old Wacom tablet and making a composite image would probably be just as time intensive, but with much higher image quality (and none of the tale tell signs of AIgen)
ben_w•5mo ago
Even if you only earn $12/hour, 2 cents is worth it to save just 6 seconds.

An image has to be much worse than that to fail to save you 6 seconds.

That said, this is their own chosen example of what it can do, so I'd have to assume it is much worse than that on average.

hdjrudni•5mo ago
Will this save me 6 seconds? It'll take me longer than that to come up with a prompt, type it, enter it into the service, wait for it to generate, download it...

And again, if I can't use it because it's totally wrong, then... what are we even doing here?

ben_w•5mo ago
> Will this save me 6 seconds? It'll take me longer than that to come up with a prompt, type it, enter it into the service, wait for it to generate, download it...

It will probably save a lot more, but the point is 6 seconds is the threshold at which 2 cents is "worth it".

Good art takes a long time to create.

If this image were representative, errors and all, it would be where you could expect a professional to reach after an hour or so, give or take — I've seen professionals working on an icon set for multiple days, and most webcomics I see, even when it's their full time job and they've got a good system going to make their output easy for themselves, don't tend to do produce outputs like this should have been more than once per day.

> And again, if I can't use it because it's totally wrong, then... what are we even doing here?

On this, I tend to agree. If you have a specific output in mind, quite often they're just wildly wrong. Repeated generations are just plain bad, and the system just can't seem to get what's being asked for.

weego•5mo ago
Hopefully it's better than midjourney at least. Ignoring key parts of the prompt seems to be a feature.
vunderba•5mo ago
Midjourney scores the absolute lowest in terms of prompt adherence against any of the other SOTA models (Kontext, Imagen, gpt-image-1, etc). At this point, its biggest feature is probably as an "exploratory tool" for visualizations by cranking up the chaos and weirdness parameters.
math_dandy•5mo ago
I was going to nitpick the missing apostrophe in movie posters caption ("STARFALLS REVENGE") but its missing from the prompt, too.
sowbug•5mo ago
> its

Muphry's Law strikes again.

decimalenough•5mo ago
> Muphry's

Indeed.

mkl•5mo ago
No, https://en.wikipedia.org/wiki/Muphry's_law.
Rexxar•5mo ago
This one is intended.
cco•5mo ago
Just proves my pet opinion that English apostrophe rules are all universally wrong and confusing.

It's and its are backwards. The latter breaks the possessive s rule.

Speaking of, the possessive s should _always_ be added, no reason to sometimes omit it if the name ends in an s.

Ass backwards, all of it.

smokel•5mo ago
The comments here are priceless. In less than five years time we have gone from "That's impossible" to "Meh, it doesn't solve P=NP if prompted.".

For those commenting in the latter category, it might be worthwhile to read a bit about the underlying technology and share your insights on why it does not deliver.

quantumHazer•5mo ago
this is false and the two things are not correlated.

if you followed news during the GAN cycle you could extrapolate that deep NN could do this type of things. it is really cool that this things happened so fast, but we are talking about companies that have the money to deploy thousands of cars around the globe to collect data, so they absolutely know how to gather data

amelius•5mo ago
You are ignoring all the hyping here.
oinfoalgo•5mo ago
Deep Dream was 2016.

The problem with 2025 is I have seen thousands of better examples than that landscape. The reflections in the lake are complete trash.

Then I think of Veo 3 that is just incredible. So no, it is not impressive if a still from the video model is vastly better than the static image generator from the same company.

I find it especially annoying because I can't think of another company this would happen at. It is just so Google.

CrzyLngPwd•5mo ago
As others have said, with so many errors, it's just more AI slop.

Does the world need yet another AI slop generator?

typpilol•5mo ago
I asked basically copilot the same and got a much better result lol

https://i.imgur.com/kSuqCYg.jpeg

arjie•5mo ago
Interesting how Imagen doesn't suffer this yellow tint effect.
typpilol•5mo ago
I assume that's from the retro word in the prompt
hdjrudni•5mo ago
Seems so. https://imgur.com/a/PWffBhk
cobbzilla•5mo ago
Makes one wonder if there’s a hidden pre/system prompt for Imagen that’s interfering with optimal results.
coldcode•5mo ago
>Image generation may not always trigger:

>The model may output text only. Try asking for image outputs explicitly (e.g. "generate an image", "provide images as you go along", "update the image").

>The model may stop generating partway through. Try again or try a different prompt.

Seriously?

typpilol•5mo ago
Does it still charge 2 cents for that? Lol
ivape•5mo ago
Anyone know if this can be prompted with image to image?
dsrtslnd23•5mo ago
they explicitly do not support that yet.
lacoolj•5mo ago
> the generally availability

One of the biggest corporations in the world and they can't re-read before posting a typo in the title.

Heads be shakin

jimmy76615•5mo ago
I'm glad they can't. The reason large cooperations tend to suck is because some bored management guy cares about typos and invents a process for getting your headlines approved by some other dude who is just as bored and useless.

It's a typo, it doesn't matter.

HocusLocus•5mo ago
I have found Imagein to be a good general purpose editor and we use it to clean up bitmaps, and adjust black points and white points and curves on greyscale, so it is good for preparing B&W greyscale photographs for print to compensate for dot gain in halftone screens on laser printers. Its 'color separation' capability is rudimentary/first draft though and is ridiculously close to inverse RGB rather than CMYK. For good color seps we use Photoshop so I can control undercolor removal.
neom•5mo ago
Are you talking about this google product, or another tool altogether?
anonymousiam•5mo ago
They're probably talking about the original Imagen printing product line from the 1980's. I thought I might be the only one to remember them in this thread, so I did a search for printer and found the GP comment.

https://tug.org/TUGboat/tb02-2/tb03imagen.pdf

nh43215rgb•5mo ago
This is different from nano banana that others are talking about as the new google model?
a1371•5mo ago
In the couple of prompts I gave it, it's better than the last version but I feel that Google is sacrificing quality for the sake of speed. While it's a lot faster, the output is not as good as OpenAI.

Meanwhile Veo3 is far better than the OpenAI's equivalent. I assume speed is not a priority there; both take their time.

iandanforth•5mo ago
I tried the following prompt and other than producing a four panel comic that was black and white it completely ignored every other instruction. This was with 4 ultra. Maybe someone else will have better luck but the failure seemed stable.

''' A four panel comic strip. Simple black on white. Stick figures for characters. In the first panel there is a stick figure man and a stick figure bird eating bird seed at his feet. He is slightly hunched over to show he is looking at the bird. In the second panel. He is more hunched over looking more closely at the bird. In the third panel he is even more hunched over practically with his head to the bird, he is crouched down, knees bent, hands on thighs. In the upper left of the third panel the tip of an enormous beak can be seen, but it's only a few lines so could be anything. In the final panel the beak has gobbled up the man and his arms and legs are flailing outside of the beak while the small bird continues to eat birdseed on the ground. '''

vunderba•5mo ago
I've updated my GenAI Comparison site to include Imagen4 Ultra, so now we have four Google related generative models (Gemini Flash, Imagen3, Imagen4, and Imagen4 Ultra).

Despite claims that Ultra supports improved strict prompt adherence, we saw no evidence that it scored any better than Imagen 4 and in some cases seemed to ignore the prompt altogether (see the "Not the Bees" comic). In many cases, it also seemed much less steerable than Imagen3 requiring many of the prompts to be rewritten.

https://genai-showdown.specr.net?models=IMAGEN_3,IMAGEN_4,IM...

gizmodo59•5mo ago
Looks like OpenAI imagegen is still the SOTA?
BoorishBears•5mo ago
LMArena (https://lmarena.ai/?chat-modality=image) currently has a model codenamed `nano-banana` that is generally strictly better than gpt-image-1

There's some speculation it's Gemini 3's multi-modal output, and other speculation that it's an OpenAI model. Hard to definitively since these models tend to hallucinate when interrogated.

vunderba•5mo ago
Other than LMArena and a website I can't verify is authentic, it's hard for me to run tests on this new model but I have serious doubts that it'll pass my more difficult prompts such drawing a valid 2d maze with clearly marked exit and entrance.

gpt-image-1 is in a class all of its own with regards to prompt adherence in the "text to image" category.

Once it hits GA I'll put it through its paces and add it to the site!

cubefox•5mo ago
I tested it with generating a man holding a Penrose triangle made of wood. While gpt-image-1 succeeded, nano-banana failed. The aesthetics of nano-banana did look much better though. I would guess that it is a diffusion model, based on the fact that it adds irrelevant but pretty background details, which gpt-image-1 tends to avoid.
djha-skin•5mo ago
Maybe the fact that they're working on imagen explains why Gemini is just so bad.
kingstnap•5mo ago
The undespecification of
patates•5mo ago
Prompt: realistic photo of a duck reading a book about web architecture

Result: https://imgur.com/a/Ri0yb31

This is supposed to be SOTA?