frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Gemini 2.5 Flash Image

https://deepmind.google/models/gemini/image/
232•meetpateltech•2h ago
Developers Announcement: https://developers.googleblog.com/en/introducing-gemini-2-5-...

Comments

qoez•1h ago
Anyone know how it handles '1920s nazi officer'? They stopped doing humans for a while but now I see they're back so I wonder how they're handling the criticism they got from that
napo•1h ago
it said: "I can create images about lots of things but not that. Can I try a different one for you?"
napo•1h ago
when giving more context it replied:

""" Unfortunately, I can't generate images of people. My purpose is to be helpful and harmless, and creating realistic images of humans can be misused in ways that are harmful. This is a safety policy that helps prevent the generation of deepfakes, non-consensual imagery, and other problematic content.

If you'd like to try a different image prompt, I can help you create images of a wide range of other subjects, such as animals, landscapes, objects, or abstract concepts. """

bastawhiz•1h ago
What a weird rejection. You have to scroll pretty far in the article to see an example output that doesn't have a realistic depiction of a person.
tanaros•1h ago
The rejection message doesn’t seem to be accurate. I tried “happy person” as a prompt in AI Studio and it generated a happy human without any complaints.

It’s possible that they relaxed the safety filtering to allow humans but forgot to update the error message.

geysersam•51m ago
It's unfortunate they can't just explain the real reason they don't want to generate the image:

"Unfortunately I'm not able to generate images that might cause bad PR for Alphabet(tm) or subsidiaries. Is there anything else I can generate for you?"

martythemaniak•1h ago
What is a "1920s nazi officer" what do they look like?
detaro•1h ago
brown uniform, red armband with swastika was the usual SA look in the 1920s.
rvnx•58m ago
Mh. Apparently like this if we ask AI:

https://postimg.cc/xX9K3kLP

...

sorokod•12m ago
The SA article has some photos

https://en.m.wikipedia.org/wiki/Sturmabteilung

Der_Einzige•1h ago
The moment the weights are on huggingface someone with orthogonalize/abliterate the model and make it uncensored.
rvnx•1h ago
BigBanana would be a good name for that future OnlyFans model
dpoloncsak•1h ago
I've been looking for a whitepaper or something. So far I've found this...which is not a whitepaper but seems relevant

https://developers.googleblog.com/en/introducing-gemini-2-5-...

It seems like this is 'nano-banana' all along

lemonish97•1h ago
Yes, they mention that the model is aka nano-banana in the blogpost
lifthrasiir•1h ago
FYI, this is the famed nano-banana model which has been now renamed to gemini-2.5-flash-image-preview in LMArena.
Mistletoe•1h ago
https://medium.com/data-science-in-your-pocket/what-is-googl...

For people like me that don’t know what nano-banana is.

mock-possum•59m ago
Wow I hate the ‘voice’ in that article - big if true though.
daemonologist•28m ago
I suspect the "voice" is a language model with a bad system prompt. (Possibly the author's own words run through an LLM, to be charitable.)
postscapes1•55m ago
This is what i came here to find out. Thanks.
patates•1h ago
It seems that they still block access from Europe, or from Germany at least.
punkpeye•1h ago
Use one of the router services
kridsdale1•1h ago
Get less contradictory regulations, then.
kneegerm•1h ago
They vote [well they don't] for it, then they complain, then they downvote and seethe. The European experience.
rvnx•1h ago
In EU they forbid us newspapers from non-approved countries, impose cookies banners everywhere, and now block porn. Soon they will forbid some AI models which have not passed EU censorship ("safety") validation. Because we all know that governments (or even Google with Android) are better at knowing what is the safest for you.

https://digital-strategy.ec.europa.eu/en/news/eu-rules-gener...

krige•46m ago
How do you do, fellow europeans?
elorant•1h ago
I can access it from Greece through AI Studio just fine.
Narciss•1h ago
Use it on fal.ai
kumarm•55m ago
Since API currently is not working (seems rate limits not set for Image Generation yet) I tried on fal.

Definitely inferior to results I see on AI Studio and image generation time is 6s on AI Studio vs 30 seconds on Fal.AI

beklein•1h ago
It works fine in OpenRouter
mindprince•1h ago
What is the difference between Gemini Flash Image models and the Imagen models?
og_kalu•1h ago
Imagen is a diffusion text to image model. You write some text that describes your image, you get an image out and that's it.

Flash Image is an image (and text) predicting large language model. In a similar fashion to how trained LLMs can manipulate/morph text, this can do that for images as well. Things like style transfer, character consistency etc.

You can communicate with it in a way you can't for imagen, and it has a better overall world understanding.

raincole•19m ago
Imagen: Stable Diffusion, but by Google

Gemini Flash Image: ChatGPT image, but by Google

mkl•1h ago
That lamp example is pretty impressive (though it's hard to know how cherry-picked it is). The lamp is plugged in, it's lighting the things in the scene, it's casting shadows.
j_m_b•1h ago
If this can do character consistency, that's huge. Just make it do the same for video...
ACCount37•49m ago
It's probably built on reused "secret sauce" from the video generation models.
asdev•1h ago
Looks like AI image generation is converging to a local maximum as well
therealmarv•1h ago
What is the max input and output resolution of images?

This is why I'm sticking mostly to Adobe Photoshop's AI editing because there are no restrictions in that regard.

abdusco•1h ago
Around 1 megapixel, AFAICT.
elorant•1h ago
I have a certain use case for such image generators. Feed them an entire news article I fetch from bbc and ask it to create an image to accompany the article. Thus far only midjourney managed to understand context. And now this, which is even more impressive. We live in interesting times.
oracleclyde•22m ago
I just tried it inside Gemini with a Medium article. Here's my prompt: "Read the article at this url and provide a hero image that incapsulates the message the author wants to convey: https://bioneers.org/supreme-oligarchy-billionaires-supreme-..."

The response was a summary of the article that was pretty good, along with an image that dagnabbit, read the assignment.

Narciss•1h ago
Nano banana is here!
keepamovin•1h ago
Those examples are gorgeous and amazing. This is really cool.
lyu07282•1h ago
still fails at analog clocks, if anyone else was also wondering
kumarm•1h ago
Seems to be failing at API Calls right now with "You exceeded your current quota, please check your plan and billing details. For more information on this error,"

Hope they get API issues resolved soon.

stuckinhell•1h ago
Is this the "nano banana" thing the art ai world was going crazy about recently ?
SweetSoftPillow•1h ago
Yes it is
abdusco•1h ago
I love that it's substantially faster than ChatGPT's image generation. It takes ages, so slow that the app tells you to not wait and sends you notification when the generation finishes.
andrewinardeer•22m ago
"Generate an image of OpenAI investors after using Gemini 2.5 Flash Image"
adidoit•1h ago
Very impressive.

I have to say while I'm deeply impressed by these text to image models, there's a part of me that's also wary of their impact. Just look at the comments beneath the average Facebook post.

knicholes•1h ago
I got scammed for $15k BTC last weekend during the (failed) SpaceX Launch. I believe the deepfake of Elon and transferred it over. The tech is very convincing, and the attacks ever increasingly sophisticated.
lionkor•1h ago
Not to victim-shame or anything, but that sounds more like more than one safety mechanism failed, the convincing tech only being a rather small part of it?
hansonkd•50m ago
I think the biggest failure is on the part of the companies hosting these streams.

Its been a while, but I remember seeing streams for Elon offering to "double your bitcoin" and the reasoning was he wanted to increase the adoption and load test the network. Just send some bitcoin to some address and he will send it back double!

But the thing was it was on youtube. Hosted on an imposter Tesla page. The stream had been going on for hours and had over ten thousand people watching live. If you searched "Elon Musk Bitcoin" During the stream on Google, Google actually pushed that video as the first result.

Say what you want about the victims of the scam, but I think it should be pretty easy for youtube or other streaming companies to have a simple rule to simply filter all live streams with Elon Musk + (Crypto|BTC|etc) in the title and be able to filter all youtube pages with "Tesla" "SpaceX" etc in the title.

lionkor•46m ago
I feel like somehow that would lessen it, but not really help much? There are obviously people with too much money in BTC who are trying to take any gamble to increase its value. It sounds like a deeper societal issue.
yifanl•59m ago
This presumes that you're okay with giving the real Elon your wallet but not a fake Elon, but why?
Imustaskforhelp•58m ago
Please pardon me since I don't know if this is satirical or not. I'd wish if you could clarify it.

Because if this is real, then the world is cooked

if not, then the fact that I think that It might be real but the only reason I believe its a joke is because you are on hackernews so I think that either you are joking or the tech has gotten so convincing that even people on hackernews (which I hold to a fair standard) are getting scammed.

I have a lot of questions if true and I am sorry for your loss if that's true and this isn't satire but I'd love it if you could tell me if its a satirical joke or not.

bauruine•54m ago
I guess it was something like [0] The Nigerian prince is now a deep fake Elon but the concept is the same. You need to send some money to get way more back.

[0]: https://www.ncsc.admin.ch/ncsc/en/home/aktuell/im-fokus/2023...

Imustaskforhelp•51m ago
hm, but isn't it wild thinking that elon is talking to you and asking you for 15k , like bro has the money of his lifetime, why would he ask you?

It doesn't make that much sense idk

Jensson•14m ago
Even Elon could lose his credit card or something, the story they spin is always something like that "I am rich but in a pickle, please send some money here and then I'll send you back 10x as much tomorrow when I get back to my account", but of course they never send it back.

Edit: But of course Elon would call someone he knows rather than a stranger, rich people know a lot of people so of course they would never contact you about this.

tantalor•12m ago
That's an "advance fee" scam.

https://en.wikipedia.org/wiki/Advance-fee_scam

kamranjon•58m ago
Would you consider writing a blog post about this experience? I'm incredibly interested in learning more details about how this unfolded.
paul7986•36m ago
Well just go on this guy's lawn and you will find your answer lol
pennaMan•57m ago
hey, I got a bridge to sell you, was $20k but we can lower it to $15k if you pay in BTC
testplzignore•39m ago
You're paying too much for your bridges man. Who's your bridge guy?
michelb•55m ago
These SpaceX scams are rampant on youtube and highly, highly lucrative. It’s crazy and you have to be very vigilant, as whatever is promised lines up with Elon’s MO.
rangerelf•10m ago
Why would anyone give them any money AT ALL?

It's not like they're poor or struggling.

Am I missing something?

nickthegreek•8m ago
it requires zero vigilance if you dont play the game.
jaredklewis•50m ago
This comment is perfect.
latchkey•49m ago
As always, it is the replies that make it worth it. GopherGeyser strikes again!
fxtentacle•36m ago
Plot twist: It wasn't a deepfake.

You sent your wallet to the real Elon and he used it as he saw fit. ;)

pjerem•7m ago
That’s what they said : they have been scammed !
AbraKdabra•17m ago
I don't mean to be rude, but this sounds like natural selection doing its work.
amatajohn•8m ago
the modern turing test:

am i getting scammed by a billionare or an AI billionaire?

UltraSane•7m ago
On the balance of probabilities it being a scam is vastly more likely than Elon actually wanting to contact you. Why would Elon need $15k in bitcoin?

It seems like money naturally flows from the gullible to the Machiavellian.

postalcoder•48m ago
I have been testing google's SynthID for images and while it isn't perfect, it is very good, insofar that I felt some relief from that same creeping dread over what these images will do to perceived reality.

It survives a lot of transformation like compression, cropping, and resizing. It even survives over alterations like color filtering and overpainting.

sigmar•43m ago
facebook isn't going to implement detection though. Many (if not most) of the viral pictures are AI-generated. and facebook is incentivized to let their users get fooled to generate endless scrolling
paul7986•39m ago
Along with those being fooled there are many comments saying this is fake, AI trash and etc. That portion of the commenters are teaching the ignorant and soon no one will believe what they see on the Internet as real.
MitPitt•34m ago
Facebook comments are obviously botted too
nikanj•5m ago
The comments are probably AI-generated too, because a site that seems to have lots of other people on it is more appealing than an empty wasteland
radarsat1•1h ago
I've had a task in mind for a while now that I've wanted to do with this latest crop of very capable instruction-following image editors.

Without going into detail, basically the task boils down to, "generate exactly image 1, but replace object A with the object depicted in image 2."

Where image 2 is some front-facing generic version, ideally I want the model to place this object perfectly in the scene, replacing the existing object, that I have identified ideally exactly by being able to specify its position, but otherwise by just being able to describe very well what to do.

For models that can't accept multiple images, I've tried a variation where I put a blue box around the object that I want to replace, and paste the object that I want it to put there at the bottom of the image on its own.

I've tried some older models, and ChatGPT, also qwen-image last week, and just now, this one. They all fail at it. To be fair, this model got pretty damn close, it replaced the wrong object in the scene, but it was close to the right position, and the object was perfectly oriented and lit. But it was wrong. (Using the bounding box method.. it should have been able to identify exactly what I wanted to do. Instead it removed the bounding box and replaced a different object in a different but close-by position.)

Are there any models that have been specifically trained to be able to infill or replace specific locations in an image with reference to an example image? Or is this just like a really esoteric task?

So far all the in-filling models I've found are only based on text inputs.

rushingcreek•56m ago
Yes! There is a model called ACE++ from Alibaba that is specifically trained to replace masked areas with a reference image. We use it in https://phind.design. It does seem like a very esoteric and uncommon task though.
ceroxylon•14m ago
I don't think it is that esoteric, that sounds like deepfake 101. If you don't mind answering, does Phind do anything to prevent / mitigate this?
awestroke•1h ago
Internal server error. lol
GaggiX•1h ago
An image seems to be 256 tokens looking the AIstudio tab, so you can generate 3906,25 images per 1M tokens, that seems a lot if I'm not wrong in some ways.

Edit: the blog post is now loading and reports "1290 output tokens per image" even though on the AI studio it said something different.

beyonddream•1h ago
“Internal server error

Sorry, there seems to be an error. Please try again soon.”

Never thought I would ever see this on a google owned websites!

lionkor•58m ago
A cheap quip would be "it's vibe-coded", but that might actually very well be the case at this point!
fariszr•51m ago
This is the gpt 4 moment for image editing models. Nano banana aka gemini 2.5 flash is insanely good. It made a 171 elo point jump in lmarena!

Just search nano banana on Twitter to see the crazy results. An example. https://x.com/D_studioproject/status/1958019251178267111

ceroxylon•29m ago
It seems like every combination of "nano banana" is registered as a domain with their own unique UI for image generation... are these all middle actors playing credit arbitrage using a popular model name?
bonoboTP•12m ago
I'd assume they are just fake, take your money and use a different model under the hood. Because they already existed before the public release. I doubt that their backend rolled the dice on LMArena until nano-banana popped up. And that was the only way to use it until today.
koakuma-chan•21m ago
Why is it called nano banana?
Jensson•16m ago
Engineers often have silly project names internally, then some marketing team rewrites the name for public release.
dcre•8m ago
Alarming hands on the third one: it can't decide which way they're facing. But Gemini didn't introduce that, it's there in the base image.
echelon•7m ago
> This is the gpt 4 moment for image editing models.

No it's not.

We've had rich editing capabilities since gpt-image-1, this is just faster and looks better than the (endearingly called) "piss filter".

Flux Kontext, SeedEdit, and Qwen Edit are all also image editing models that are reasonably capable. Qwen Edit especially.

Flux Kontext and Qwen are also possible to fine tune.

We've left the days of Stable Diffusion and Midjourney of prompt-only image generation.

notsylver•51m ago
I digitised our family photos but a lot of them were damaged (shifted colours, spills, fingerprints on film, spots) that are difficult to correct for so many images. I've been waiting for image gen to catch up enough to be able to repair them all in bulk without changing details, especially faces. This looks very good at restoring images without altering details or adding them where they are missing, so it might finally be time.
zwog•25m ago
Do you happen to know some software to repair/improve video files? I'm in the process of digitalizing a couple of Video 2000 and VHS casettes of childhood memories of my mom who start suffering from dementia. I have a pretty streamlined setup for digitalizing the videos but I'd like to improve the quality a bit.
Almondsetat•25m ago
All of the defects you have listed can be automatically fixed by using a film scanner with ICE and a software that automatically performs the scan and the restoration like Vuescan. Feeding hundreds (thousands?) of photos to an experimental proprietary cloud AI that will give you back subpar compressed pictures with who knows how many strange artifacts seems unnecessary
Barbing•24m ago
Hope it works well for you!

In my eyes, one specific example they show (“Prompt: Restore photo”) deeply AI-ifies the woman’s face. Sure it’ll improve over time of course.

indigodaddy•11m ago
Another question/concern for me: if I restore an old picture of my Gramma, will my Gramma (or a Gramma that looks strikingly similar) ever pop up on other people's "give me a random Gramma" prompts?
danielbln•24m ago
That time had arrived a few months ago already with Flux Kontext (https://bfl.ai/models/flux-kontext).
modeless•49m ago
This model is very impressive. Yesterday (as nano-banana) I gave it a photo of an indoor scene with a picture hanging on a wall, and asked it the picture on a wall with a copy of the whole photo. It worked perfectly the first time.

It didn't succeed in doing the same recursively, but it's still clearly a huge advance in image models.

jawns•41m ago
I was able to upload my kids' back-to-school photos and ask nano-banana to turn them into a goth, an '80s workout girl, and a tracksuit mafioso. The results were incredibly believable, and I was able to prank my mom with them!
mclau157•36m ago
I could see this destroying a lot of jobs like photography, editing, marketing, etc.
simianwords•27m ago
L like it but it is very restricted. I can't modify people's faces etc.
sandreas•26m ago
I wonder if this could be used for preprocessing documents before doing OCR...
bsenftner•21m ago
All these image models are time vampires and need to be looked at with very suspicious eyes. Try to make a room - that's easy, now try to make multiple views of the same room - next to impossible. If one is intending to use these image models for anything that requires consistency of imagery, forget it.
matsemann•21m ago
Half the time I ask Gemini to generate some image it claims it doesn't have the capability. And in general I've felt it's so hard to actually use the features Google announce? Like, a third of them is in one product, some in another which I can't use, and no idea what or where I should pay to get access. So confusing.
Al-Khwarizmi•12m ago
Yeah, in fact the website says "Try it in Gemini" and I'm not sure if I'm already trying it or not - if I choose Gemini 2.5 Flash in the regular Gemini UI, I'm using this?
sega_sai•6m ago
I think not. Because at least in the aistudio there is a dedicated gemini-2.5-flash-image-preview model. So I am assuming it is not available in the standard gemini chat window.
uejfiweun•18m ago
This is pretty remarkable, I'm having a lot of fun playing around with this. Kudos to Google.
kemyd•12m ago
I don't get the hype. Tested it with the same prompts I used with Midjourney, and the results are worse than in Midjourney a year ago. What am I missing?
bonoboTP•10m ago
The hype is about image editing, not pure text-to-image. Upload an input image, say what you want changed, get the output. That's the idea. Much better preservation of characters and objects.
kemyd•9m ago
Thanks for clarifying this. That makes a lot more sense.
cdrini•10m ago
Hmm, I think the hype is mainly for image editing, not generating. Although note I haven't used it! How are you testing it?
kemyd•6m ago
I tested it with two prompts:

// In this one, Gemini doesn't understand what "cinematic" is

"A cinematic underwater shot of a turtle gracefully swimming in crystal-clear water [...]"

// In this one, reflection in the water in the background has a different buildings

"A modern city where raindrops fall upward into the clouds instead of down, pedestrians calmly walking [...]"

Midjourney created both perfectly.

Built an alert layer on top of QuickBooks – then Intuit added a $300/month fee

https://uselunova.com/blog/alert-layer-on-top-of-quick-books
1•chidog99•27s ago•0 comments

House is Haunted: a decade-old RCE in the AION client

https://appsec.space/posts/aion-housing-exploit/
1•_zeta•1m ago•1 comments

No evidence ageing/declining populations compromise socio-economic performance

https://arxiv.org/abs/2508.16872
1•bikenaga•1m ago•0 comments

AI Risk Benchmark: GPT-5 Leads, but Misalignments Persist

https://substack.com/home/post/p-171928622
2•m1chael3ma•2m ago•0 comments

Beartype: Unbearably fast near-real-time type-checking in Python

https://github.com/beartype/beartype
1•ForHackernews•2m ago•0 comments

LLVM 21.1 Released with AMD GFX1250 Target, Improved RISC-V, New C/C++ Features

https://www.phoronix.com/news/LLVM-21.1-Released
1•ksec•3m ago•0 comments

Verifying drand Beacons on Ethereum

https://docs.drand.love/blog/2025/08/26/verifying-bls12-on-ethereum/
1•h0h0h0h0111•5m ago•0 comments

Spendly

https://app--spendly-e1fd5345.base44.app/
1•blxkcrift•6m ago•0 comments

Tanzlinden are German dancing trees, with the oldest from the 1680s

https://www.henrykuppen.nl/en/trees-to-make-you-happy-the-tanzlinde-of-peesten
1•speckx•6m ago•0 comments

GoDaddy gets patent for "Recommending domains from free text"

https://domainnamewire.com/2025/08/26/godaddy-gets-patent-for-recommending-domains-from-free-text/
3•us0r•6m ago•2 comments

Greenboot Rust Rewrite Approved for Fedora 43

https://www.phoronix.com/news/Greenboot-Rust-Fedora-43
1•mikece•8m ago•0 comments

First vision language model built off Open AI GPT-OSS

https://huggingface.co/OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview
1•BUFU•8m ago•0 comments

Cloudflare MCP Server Portals

https://blog.cloudflare.com/zero-trust-mcp-server-portals/
1•mikece•9m ago•0 comments

See if you can break my hiding algorithm –> take the private key

https://redactsure.com/bitcoinchallenge/
1•redactsure•9m ago•1 comments

Breaking the Creepy AI in Police Cameras [video]

https://www.youtube.com/watch?v=Pp9MwZkHiMQ
1•nativeit•9m ago•0 comments

AI/ML Invisible Watermarking and Blockchain Timestamping

https://www.scoredetect.com
1•Novest•11m ago•0 comments

Building an AI Agent with LangGraph

https://spin.atomicobject.com/build-ai-agent-langgraph/
1•philk10•11m ago•0 comments

Show HN: Tweakcc – Customize Claude Code's CLI (themes, verbs, spinner)

https://github.com/Piebald-AI/tweakcc
1•bl-ue•12m ago•0 comments

Show HN: My OSS P2P file transfer tool for learning Next.js (as a C++ dev)

https://www.privydrop.app/en
1•david_bai•13m ago•0 comments

The Future Isn't Model Agnostic

https://fly.io/blog/the-future-isn-t-model-agnostic/
1•indigodaddy•13m ago•0 comments

How China Became an Innovation Powerhouse

https://www.economist.com/business/2025/08/25/how-china-became-an-innovation-powerhouse
3•bookofjoe•15m ago•2 comments

Show HN: Free Web Dialer for U.S./Canada Toll-Free Numbers (Skype Replacement?)

https://tollfree.connect-ez.com
1•Connect-EZ•17m ago•0 comments

Canadian Tech Hiring Freeze Continues

https://www.hiringlab.org/en-ca/2025/08/26/canadian-tech-hiring-freeze-continues/
2•speak_plainly•18m ago•1 comments

It's Time for Americans to Start Talking About "Soft Secession"

https://cmarmitage.substack.com/p/its-time-for-americans-to-start-talking
3•speckx•19m ago•0 comments

Scientists unlock secret to thick, stable beer foams

https://arstechnica.com/science/2025/08/physics-of-why-belgian-beer-foam-is-so-stable/
1•geephroh•20m ago•0 comments

Apple´s Tim Cook battle results

https://hugston.com/articles/Apple_2016_Stand_Was_Never_About_One_Case
3•trilogic•20m ago•0 comments

Show HN: Simdgrep is a file grepper not written in Rust

https://github.com/coyove/simdgrep
1•coyove•22m ago•0 comments

LibreOffice 25.8 in Windows 7 x64 ESU environment

https://trackerninja.codeberg.page/post/complete-guide-on-how-to-run-libre-office-version-25-8-in...
2•spacedrone808•24m ago•0 comments

Trump Media, Crypto.com to launch crypto treasury firm

https://www.reuters.com/legal/government/trump-media-cryptocom-launch-crypto-treasury-firm-via-sp...
1•geox•24m ago•0 comments

Security Flaws in the WebMonetization Site

https://shkspr.mobi/blog/2025/08/security-flaws-in-the-webmonetization-site/
1•edent•24m ago•0 comments