frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Nano Banana can be prompt engineered for nuanced AI image generation

https://minimaxir.com/2025/11/nano-banana-prompts/
128•minimaxir•1h ago•32 comments

Zed is our office

https://zed.dev/blog/zed-is-our-office
268•sagacity•3h ago•126 comments

Launch HN: Tweeks (YC W25) – Browser extension to de-enshittify the web

https://www.tweeks.io/onboarding
79•jmadeano•3h ago•68 comments

A hemp industry shutdown has just begun

https://www.courier-journal.com/story/opinion/contributors/2025/11/13/rand-paul-congress-funding-...
111•bilsbie•1h ago•35 comments

GitHub Partial Outage

https://www.githubstatus.com/incidents/1jw8ltnr1qrj
130•danfritz•4h ago•54 comments

Checkout.com hacked, refuses ransom payment, donates to security labs

https://www.checkout.com/blog/protecting-our-merchants-standing-up-to-extortion
428•StrangeSound•9h ago•210 comments

SIMA 2: An agent that plays, reasons, and learns with you in virtual 3D worlds

https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d...
89•meetpateltech•3h ago•18 comments

Blender Lab

https://www.blender.org/news/introducing-blender-lab/
147•radeeyate•5h ago•39 comments

The Useful Personal Computer

https://technicshistory.com/2025/11/02/the-useful-personal-computer/
40•cfmcdonald•1w ago•3 comments

SlopStop: Community-driven AI slop detection in Kagi Search

https://blog.kagi.com/slopstop
19•msub2•14m ago•1 comments

BAML is hiring compilers/rust engineers (YC W23)

https://github.com/BoundaryML/baml/tree/canary/jobs
1•hellovai•2h ago

Kratos - Cloud native Auth0 open-source alternative (self-hosted)

https://github.com/ory/kratos
94•curtistyr•5h ago•62 comments

We cut our Mongo DB costs by 90% by moving to Hetzner

https://prosopo.io/blog/we-cut-our-mongodb-costs-by-90-percent/
139•arbol•3h ago•97 comments

Think in Math. Write in Code

https://www.jmeiners.com/think-in-math/
18•alabhyajindal•4d ago•5 comments

Denx (a.k.a. U-Boot) Retires

https://www.denx.de/
62•synergy20•5h ago•11 comments

How To Build A Smartwatch: Software

https://ericmigi.com/blog/how-to-build-a-smartwatch-software-setting-expectations-and-roadmap/
47•teekert•4h ago•15 comments

Heartbeats in Distributed Systems

https://arpitbhayani.me/blogs/heartbeats-in-distributed-systems/
59•sebg•5h ago•21 comments

Family Computing Interviews Jack Tramiel After Atari Purchase (1985)

https://computeradsfromthepast.substack.com/p/family-computing-interviews-jack
5•rbanffy•1w ago•0 comments

Android developer verification: Early access starts

https://android-developers.googleblog.com/2025/11/android-developer-verification-early.html
1247•erohead•18h ago•579 comments

Human Fovea Detector

https://www.shadertoy.com/view/4dsXzM
405•AbuAssar•18h ago•82 comments

Cursor: Past, Present, and Future

https://cursor.com/blog/series-d
33•whizusukite•5h ago•16 comments

IBM Patented Euler's 200 Year Old Math Technique for 'AI Interpretability'

https://leetarxiv.substack.com/p/ibm-patented-eulers-fractions
16•busymom0•36m ago•0 comments

A Challenge to Roboticists: My Humanoid Olympics

https://spectrum.ieee.org/humanoid-robot-olympics
34•quapster•1w ago•4 comments

Steam Machine

https://store.steampowered.com/sale/steammachine
2530•davikr•1d ago•1199 comments

Android 16 QPR1 is being pushed to the Android Open Source Project

https://grapheneos.social/@GrapheneOS/115533432439509433
219•uneven9434•15h ago•117 comments

COBOL to Kotlin via Formal Models (IR and Alloy and Golden Master)

https://marcoeg.medium.com/from-cobol-to-kotlin-795920b1f371
32•marcoeg•5d ago•8 comments

Reverse Engineering Yaesu FT-70D Firmware Encryption

https://landaire.net/reversing-yaesu-firmware-encryption/
118•austinallegro•12h ago•16 comments

Homebrew no longer allows bypassing Gatekeeper for unsigned/unnotarized software

https://github.com/Homebrew/brew/issues/20755
319•firexcy•21h ago•245 comments

Britain's railway privatization was an abject failure

https://www.rosalux.de/en/news/id/53917/britains-railway-privatization-was-an-abject-failure
420•robtherobber•5h ago•370 comments

Continuous Autoregressive Language Models

https://arxiv.org/abs/2510.27688
97•Anon84•1w ago•7 comments
Open in hackernews

Nano Banana can be prompt engineered for nuanced AI image generation

https://minimaxir.com/2025/11/nano-banana-prompts/
118•minimaxir•1h ago

Comments

doctorpangloss•1h ago
lots of words

okay, look at imagen 4 ultra:

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

Is Imagen thinking?

Let's compare to gemini 2.5 flash image (nano banana):

look carefully at the system prompt here: https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

compare to ideogram, with prompt rewriting: https://ideogram.ai/g/GRuZRTY7TmilGUHnks-Mjg/0

without prompt rewriting: https://ideogram.ai/g/yKV3EwULRKOu6LDCsSvZUg/2

We can do the same exercises with Flux Kontext for editing versus Flash-2.5, if you think that editing is somehow unique in this regard.

Is prompt rewriting "thinking"? My point is, this article can't answer that question without dElViNg into the nuances of what multi-modal models really are.

gryfft•51m ago
Can you provide screenshots or links that don't require login
PunchTornado•41m ago
sorry, but I don't understand you post. those links don't work.
dostick•1h ago
Use Google AI Studio to submit requests, and to remove watermark, open browser development tools and right click on request to “watermark_4” image and select to block it. And from next generation there will be no watermark!
squigz•57m ago
I'm getting annoyed by using "prompt engineered" as a verb. Does this mean I'm finally old and bitter?

(Do we say we software engineered something?)

vpShane•52m ago
You're definitely old and bitter, welcome to it.

You CREATED something, and I like to think that creating things that I love and enjoy and that others can love and enjoy makes creating things worth it.

squigz•43m ago
Don't get me wrong, I have nothing against using AI as an expression of creativity :)
malcolmxxx•16m ago
Create? So I have created all that code I'm running on my site, yes is bad I know, but thank you very much! Such creative guy I was!
officeplant•49m ago
Not really since "prompt engineering" can be tossed in the same pile as "vibe coding." Just people coping with not developing the actual skills to produce the desired products.
bongodongobob•44m ago
Couldn't care less. I don't need to know how to do literally everything. AI fills in my gaps and I'm a ton more productive.
squigz•43m ago
I wouldn't bother trying to convince people who are upset that others have figured out a way to use LLMs. It's not logical.
koakuma-chan•33m ago
Try getting a small model to do what you want quickly with high accuracy, high quality, etc, and using few tokens per request. You'll find out that prompt engineering is real and matters.
miladyincontrol•47m ago
Theres lots these models can do but I despise when people suggest they can do edits with "with only the necessary aspects changed".

No, that simply is not true. If you actually compare the before and after you can see it still regenerates all the details on the "unchanged" aspects. Texture, lighting, sharpness, even scale its all different even if varyingly similar to the original.

Sure they're cute for casual edits but it really pains me people suggesting these things are suitable replacements for actual photo editing. Especially when it comes to people, or details outside their training data theres a lot of nuance that can be lost as it regenerates them no matter how you prompt things.

Even if you

StevenWaterman•46m ago
That is true for gpt-image-1 but not nano-banana. They can do masked image changes
minimaxir•38m ago
Nano Banana is different and much better at edits without changing texture/lighting/sharpness/color balance, and I am someone that is extremely picky about it. That's why I add the note that Gemini 2.5 Flash is aware of segmentation masks, and that's my hunch why that's the case.
BoredPositron•27m ago
Nano banana has a really low spatial scaling and doesn't affect details like other models.
mkagenius•45m ago
> Nano Banana is still bad at rendering text perfectly/without typos as most image generation models.

I figured that if you write the text in Google docs and share the screenshot with banana it will not make any spelling mistake.

So, use something like "can you write my name on this Wimbledon trophy, both images are attached. Use them" will work.

minimaxir•34m ago
Google's example documentation for Nano Banana does demo that pipeline: https://ai.google.dev/gemini-api/docs/image-generation#pytho...

That's on my list of blog-post-worthy things to test, namely text rendering to image in Python directly and passing both input images to the model for compositing.

ml-anon•41m ago
"prompt engineered"...i.e. by typing in what you want to see.
harpiaharpyja•36m ago
Not all models can actually do that if your prompt is particular
darepublic•29m ago
"amenable to highly specific and granular instruction"
simonw•27m ago
... and then iterating on that prompt many times, based on your accumulated knowledge of how best to prompt that particular model.
minimaxir•21m ago
Case in point, the final image in this post (the IP bonanza) took 28 iterations of the prompt text to get something maximally interesting, and why that one is very particular about the constraints it invokes, such as specifying "distinct" characters and specifying they are present from "left to right" because the model kept exploiting that ambiguity.
pfortuny•31m ago
Well, I just asked it for a 13-sided irregular polygon (is it that hard?)…

https://imgur.com/a/llN7V0W

BoredPositron•28m ago
The kicker for nano banana is not prompt adherence which is a really nice to have but the fact that it's either working on pixel space or with a really low spatial scaling. It's the only model that doesn't kill your details because of vae encode/decode.
sebzim4500•17m ago
It's really cool how good of a job it did rendering a page given its HTML code. I was not expecting it to do nearly as well.
leviathant•17m ago
I was kind of surprised by this line:

>Nano Banana is terrible at style transfer even with prompt engineering shenanigans

My context: I'm kind of fixated on visualizing my neighborhood as it would have appeared in the 18th century. I've been doing it in Sketchup, and then in Twinmotion, but neither of those produce "photorealistic" images... Twinmotion can get pretty close with a lot of work, but that's easier with modern architecture than it is with the more hand-made, brick-by-brick structures I'm modeling out.

As different AI image generators have emerged, I've tried them all in an effort to add the proverbial rough edges to snapshots of the models I've created, and it was not until Nano Banana that I ever saw anything even remotely workable.

Nano Banana manages to maintain the geometry of the scene, while applying new styles to it. Sometimes I do this with my Twinmotion renders, but what's really been cool to see is how well it takes a drawing, or engraving, or watercolor - and with as simple a prompt as "make this into a photo" it generates phenomenal results.

Similarly to the Paladin/Starbucks/Pirate example in the link though, I find that sometimes I need to misdirect a little bit, because if I'm peppering the prompt with details about the 18th century, I sometimes get a painterly image back. Instead, I'll tell it I want it to look like a photograph of a well preserved historic neighborhood, or a scene from a period film set in the 18th century.

As fantastic as the results can be, I'm not abandoning my manual modeling of these buildings and scenes. However, Nano Banana's interpretation of contemporary illustrations has helped me reshape how I think about some of the assumptions I made in my own models.

simonw•6m ago
I like the Python library that accompanies this: https://github.com/minimaxir/gemimg

I added a CLI to it (using Gemini CLI) and submitted a PR, you can run that like so:

  GEMINI_API_KEY="..." \
  uv run --with https://github.com/minimaxir/gemimg/archive/d6b9d5bbefa1e2ffc3b09086bc0a3ad70ca4ef22.zip \
    python -m gemimg "a racoon holding a hand written sign that says I love trash"
Result in this comment: https://github.com/minimaxir/gemimg/pull/7#issuecomment-3529...
peetle•5m ago
In my own experience, nano banana still has the tendency to:

- make massive, seemingly random edits to images - adjust image scale - make very small case but pervasive detail changes obvious in an image diff

For instance, I have found that nano-banana will sporadically add a (convincing) fireplace to a room or new garage behind a house. This happens even with explicit "ALL CAPS" instructions not to do so. This happens sporadically, even when the temperature is set to zero, and makes it impossible to build a reliable app.

Has anyone had a better experience?