frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Nano Banana can be prompt engineered for nuanced AI image generation

https://minimaxir.com/2025/11/nano-banana-prompts/
107•minimaxir•1h ago

Comments

doctorpangloss•47m ago
lots of words

okay, look at imagen 4 ultra:

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

Is Imagen thinking?

Let's compare to gemini 2.5 flash image (nano banana):

look carefully at the system prompt here: https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

compare to ideogram, with prompt rewriting: https://ideogram.ai/g/GRuZRTY7TmilGUHnks-Mjg/0

without prompt rewriting: https://ideogram.ai/g/yKV3EwULRKOu6LDCsSvZUg/2

We can do the same exercises with Flux Kontext for editing versus Flash-2.5, if you think that editing is somehow unique in this regard.

Is prompt rewriting "thinking"? My point is, this article can't answer that question without dElViNg into the nuances of what multi-modal models really are.

gryfft•37m ago
Can you provide screenshots or links that don't require login
PunchTornado•28m ago
sorry, but I don't understand you post. those links don't work.
dostick•46m ago
Use Google AI Studio to submit requests, and to remove watermark, open browser development tools and right click on request to “watermark_4” image and select to block it. And from next generation there will be no watermark!
doctorpangloss•45m ago
this is an excellent question. look at my other comment to see what happens when you do that.
squigz•44m ago
I'm getting annoyed by using "prompt engineered" as a verb. Does this mean I'm finally old and bitter?

(Do we say we software engineered something?)

vpShane•39m ago
You're definitely old and bitter, welcome to it.

You CREATED something, and I like to think that creating things that I love and enjoy and that others can love and enjoy makes creating things worth it.

squigz•29m ago
Don't get me wrong, I have nothing against using AI as an expression of creativity :)
officeplant•36m ago
Not really since "prompt engineering" can be tossed in the same pile as "vibe coding." Just people coping with not developing the actual skills to produce the desired products.
bongodongobob•31m ago
Couldn't care less. I don't need to know how to do literally everything. AI fills in my gaps and I'm a ton more productive.
squigz•30m ago
I wouldn't bother trying to convince people who are upset that others have figured out a way to use LLMs. It's not logical.
koakuma-chan•20m ago
Try getting a small model to do what you want quickly with high accuracy, high quality, etc, and using few tokens per request. You'll find out that prompt engineering is real and matters.
miladyincontrol•33m ago
Theres lots these models can do but I despise when people suggest they can do edits with "with only the necessary aspects changed".

No, that simply is not true. If you actually compare the before and after you can see it still regenerates all the details on the "unchanged" aspects. Texture, lighting, sharpness, even scale its all different even if varyingly similar to the original.

Sure they're cute for casual edits but it really pains me people suggesting these things are suitable replacements for actual photo editing. Especially when it comes to people, or details outside their training data theres a lot of nuance that can be lost as it regenerates them no matter how you prompt things.

Even if you

StevenWaterman•32m ago
That is true for gpt-image-1 but not nano-banana. They can do masked image changes
minimaxir•24m ago
Nano Banana is different and much better at edits without changing texture/lighting/sharpness/color balance, and I am someone that is extremely picky about it. That's why I add the note that Gemini 2.5 Flash is aware of segmentation masks, and that's my hunch why that's the case.
BoredPositron•13m ago
Nano banana has a really low spatial scaling and doesn't affect details like other models.
mkagenius•32m ago
> Nano Banana is still bad at rendering text perfectly/without typos as most image generation models.

I figured that if you write the text in Google docs and share the screenshot with banana it will not make any spelling mistake.

So, use something like "can you write my name on this Wimbledon trophy, both images are attached. Use them" will work.

minimaxir•21m ago
Google's example documentation for Nano Banana does demo that pipeline: https://ai.google.dev/gemini-api/docs/image-generation#pytho...

That's on my list of blog-post-worthy things to test, namely text rendering to image in Python directly and passing both input images to the model for compositing.

ml-anon•28m ago
"prompt engineered"...i.e. by typing in what you want to see.
harpiaharpyja•23m ago
Not all models can actually do that if your prompt is particular
darepublic•15m ago
"amenable to highly specific and granular instruction"
simonw•14m ago
... and then iterating on that prompt many times, based on your accumulated knowledge of how best to prompt that particular model.
minimaxir•8m ago
Case in point, the final image in this post (the IP bonanza) took 28 iterations of the prompt text to get something maximally interesting, and why that one is very particular about the constraints it invokes, such as specifying "distinct" characters and specifying they are present from "left to right" because the model kept exploiting that ambiguity.
pfortuny•17m ago
Well, I just asked it for a 13-sided irregular polygon (is it that hard?)…

https://imgur.com/a/llN7V0W

BoredPositron•14m ago
The kicker for nano banana is not prompt adherence which is a really nice to have but the fact that it's either working on pixel space or with a really low spatial scaling. It's the only model that doesn't kill your details because of vae encode/decode.
sebzim4500•4m ago
It's really cool how good of a job it did rendering a page given its HTML code. I was not expecting it to do nearly as well.

SlopStop: Community-driven AI slop detection in Kagi Search

https://blog.kagi.com/slopstop
2•msub2•51s ago•0 comments

Show HN: Fine-tune open-source LLMs quickly and easily (early access)

https://www.tinytune.xyz/
2•Jacques2Marais•1m ago•0 comments

How to Grow your Startup Fast in 2025

https://founderpath.com/blog/how-to-grow-startup-growth-hacks
1•tacon•1m ago•0 comments

The Inference Economy: Why demand matters more than supply

https://frontierai.substack.com/p/the-inference-economy-part-ii
1•cgwu•1m ago•0 comments

Collection of Postmortems

https://github.com/danluu/post-mortems
1•nateb2022•2m ago•0 comments

Usdot says 17,000 non-domiciled CDLs issued by California are cancelled

https://cdllife.com/2025/usdot-to-cancel-17000-non-domiciled-cdls-issued-by-california/
1•stopbulying•2m ago•0 comments

World still on track for catastrophic 2.6C temperature rise, report finds

https://www.theguardian.com/environment/2025/nov/13/world-still-on-track-for-catastrophic-26c-tem...
2•c-oreills•3m ago•0 comments

Court pauses DOT rule that pushes 200k non-domiciled CDL drivers out of work

https://www.overdriveonline.com/regulations/article/15771493/court-blocks-dot-from-pushing-200000...
1•stopbulying•4m ago•0 comments

Show HN: SolMemo – a decentralized notice board powered by Solana memos

https://solmemo.com
1•robputt•4m ago•0 comments

Fake packages flood NPM registry in major attack – here's what we know

https://www.techradar.com/pro/security/thousands-of-fake-packages-flood-npm-registry-in-major-att...
1•alsetmusic•4m ago•0 comments

Star Forts, Mines, and Other Maastricht Subterranea

https://www.bldgblog.com/2025/09/37929/
1•speckx•4m ago•0 comments

Matz: Ruby 4.0 is arriving this year

https://rubyweekly.com/issues/775
1•phoronixrly•5m ago•0 comments

Measuring urban noise from my home in Brazil

https://noisy-road.brunojoselanger.workers.dev/
1•bruno-langer•5m ago•1 comments

End-to-End OCR with Vision Language Models

https://www.ubicloud.com/blog/end-to-end-ocr-with-vision-language-models
1•0xjunhao•6m ago•0 comments

Taking the Moon's Temperature with Beeswax

https://www.universetoday.com/articles/taking-the-moons-temperature-with-beeswax
1•PaulHoule•7m ago•0 comments

Show HN: Agent-to-code JIT compiler for Z3-theorem-proving agents

https://github.com/stanford-mast/a1/tree/main/examples/z3_reasoning
2•calebhwin•9m ago•0 comments

Coding assistant Cursor raises $2.3B 5 months after its previous round

https://techcrunch.com/2025/11/13/coding-assistant-cursor-raises-2-3b-5-months-after-its-previous...
2•dphuang2•11m ago•1 comments

Google Analytics Copilot (GA4)

https://askgaai.com
1•PEGHIN•11m ago•1 comments

Observation of quantum Darwinism and classicality with superconducting circuits

https://www.science.org/doi/10.1126/sciadv.adx6857
1•QueensGambit•13m ago•0 comments

Let Your Intrusive Thoughts Win: How to Talk to Anyone

https://www.brandonhresko.com/blog/let-your-intrusive-thoughts-win
1•bhresko1•14m ago•0 comments

Disney warns of potentially long dispute with YouTube TV, shares fall

https://www.reuters.com/business/media-telecom/disney-boosts-dividend-buyback-parks-streaming-dri...
2•haunter•16m ago•0 comments

Project Kuiper is now Amazon Leo

https://www.aboutamazon.com/news/amazon-leo/project-kuiper-becomes-amazon-leo
1•jaredwiener•16m ago•0 comments

Escura InstantSnap [video]

https://www.youtube.com/watch?v=1DgE3ZOnOzY
1•gregsadetsky•16m ago•0 comments

Show HN: inbox.dog – AI agents for Gmail support that sort, reply, and escalate

https://inbox.dog/
2•acoyfellow•17m ago•0 comments

Blogrolls Are the Best(rolls)

https://sethmlarson.dev/blogrolls-are-the-best-rolls
2•8organicbits•17m ago•1 comments

Apple cuts App Store fee in half for 'mini apps'

https://www.cnbc.com/2025/11/13/apple-announces-new-program-that-cut-mini-app-fees-in-half.html
1•jbonniwell•17m ago•1 comments

Agentu: The sleekest way to build AI agents

https://pypi.org/project/agentu/
1•init0•17m ago•1 comments

GPT-5.1 for Developers

https://openai.com/index/gpt-5-1-for-developers/
2•tedsanders•18m ago•0 comments

Blockchain company files patent application for domain parking system

https://domainnamewire.com/2025/11/13/blockchain-company-files-patent-application-for-domain-park...
1•speckx•18m ago•1 comments

Performance Improvements in .NET 10 [video]

https://www.youtube.com/watch?v=snnULnTWcNM
1•PKop•19m ago•0 comments