frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Pico-Banana-400k

https://github.com/apple/pico-banana-400k
107•dvrp•3h ago

Comments

vunderba•2h ago
From the paper

> The pipeline (bottom) shows how diverse OpenImages inputs are edited using Nano-Banana and quality-filtered by Gemini-2.5-Pro, with failed attempts automatically retried.

Pretty interesting. I run a fairly comprehensive image-comparison site for SOTA generative AI in text-to-image and editing. Managing it manually got pretty tiring, so a while back I put together a small program that takes a given starting prompt, a list of GenAI models, and a max number of retries which does something similar.

It generates and evaluates images using a separate multimodal AI, and then rewrites failed prompts automatically repeating up to a set limit.

It's not perfect (nine pointed star example in particular) - but often times the "recognition aspect of a multimodal model" is superior to its generative capabilities so you can run it in a sort of REPL until you get the desired outcome.

https://genai-showdown.specr.net/image-editing

typpilol•1h ago
I love your site I stumble across it once a month it seems.

Or there's another very similar site. But I'm pretty sure it's yours

lukasb•1h ago
What do you use for evaluation? gemini-2.5-pro is at the top of MMLU and has been best for me but always looking for better.
vednig•2h ago
Other Post: https://news.ycombinator.com/item?id=45708493
cjrd•1h ago
Eh
djtriptych•1h ago
Really cool - looking to Apple to lead the on-device AI space in short order...
daemonologist•55m ago
I confess that I don't quite get the point here - is it just that they've paid the inference costs for a dataset than can be used for distillation/other research?
peddling-brink•32m ago
Essentially yes, it’s a data set that can help train or fine tune another model or similar research. From the site:

> Pico-Banana-400K serves as a versatile resource for advancing controllable and instruction-aware image editing. Beyond single-step editing, the dataset enables multi-turn, conversational editing and reward-based training paradigms.

TechSquidTV•49m ago
Can it be? Has Apple FINALLY joined the party? Very ironic they are using an open dataset from Google... and Gemini for prompts by Google.

I'm happy to see something from Apple but this seems so low-tech that it could be one of my own local ComfyUI workflows.

w-ll•27m ago
how about apple undo whatever the f they did to speach-to-text, and get rid of the ducking nanny
skissane•9m ago
The license is CC BY-NC-ND - I’m not sure who is going to be able to use it given the NC-ND part… especially given the potential uncertainty over what uses count as commercial and what counts as derivative works. OTOH, given the bulk of this dataset is AI outputs, its copyrightability is an open question.

Pico-Banana-400k

https://github.com/apple/pico-banana-400k
108•dvrp•3h ago•11 comments

A worker fell into a nuclear reactor pool

https://www.nrc.gov/reading-rm/doc-collections/event-status/event/2025/20251022en?brid=vscAjql9kZ...
233•nvahalik•3h ago•147 comments

The Linux Boot Process: From Power Button to Kernel

https://www.0xkato.xyz/linux-boot/
139•0xkato•6h ago•40 comments

PCB Edge USB C Connector Library

https://github.com/AnasMalas/pcb-edge-usb-c
31•walterbell•2h ago•12 comments

California invests in battery energy storage, leaving rolling blackouts behind

https://www.latimes.com/environment/story/2025-10-17/california-made-it-through-another-summer-wi...
224•JumpCrisscross•9h ago•166 comments

The Journey Before main()

https://amit.prasad.me/blog/before-main
182•amitprasad•9h ago•65 comments

Project Amplify: Powered footwear for running and walking

https://about.nike.com/en/newsroom/releases/nike-project-amplify-official-images
57•justinmayer•8h ago•46 comments

Show HN: Chonky – a neural text semantic chunking goes multilingual

https://huggingface.co/mirth/chonky_mmbert_small_multilingual_1
12•hessdalenlight•17h ago•0 comments

Show HN: Diagram as code tool with draggable customizations

https://github.com/RohanAdwankar/oxdraw
146•RohanAdwankar•8h ago•35 comments

How programs get run: ELF binaries (2015)

https://lwn.net/Articles/631631/
78•st_goliath•8h ago•2 comments

D2: Diagram Scripting Language

https://d2lang.com/tour/intro/
71•benzguo•6h ago•14 comments

NextSilicon reveals new processor chip in challenge to Intel, AMD

https://www.reuters.com/business/nextsilicon-reveals-new-processor-chip-challenge-intel-amd-2025-...
27•simojo•3d ago•5 comments

Agent Lightning: Train agents with RL (no code changes needed)

https://github.com/microsoft/agent-lightning
62•bakigul•8h ago•8 comments

An Update on TinyKVM

https://fwsgonzo.medium.com/an-update-on-tinykvm-7a38518e57e9
89•ingve•8h ago•22 comments

Doctor Who archive expert shares positive update on missing episode

https://www.radiotimes.com/tv/sci-fi/doctor-who-missing-episodes-update-teases-announcement-newsu...
64•gnabgib•6d ago•31 comments

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

https://shadcnthemer.com
93•miketromba•9h ago•28 comments

Why I code as a CTO

https://www.assembled.com/blog/why-i-code-as-a-cto
90•johnjwang•1d ago•59 comments

AI, Wikipedia, and uncorrected machine translations of vulnerable languages

https://www.technologyreview.com/2025/09/25/1124005/ai-wikipedia-vulnerable-languages-doom-spiral/
76•kawera•9h ago•33 comments

Rock Tumbler Instructions

https://rocktumbler.com/tips/rock-tumbler-instructions/
166•debo_•12h ago•81 comments

GenAI Image Editing Showdown

https://genai-showdown.specr.net/
5•rzk•2h ago•0 comments

WebDAV isn't dead yet

https://blog.feld.me/posts/2025/09/webdav-isnt-dead-yet/
129•toomuchtodo•1d ago•62 comments

ARM Memory Tagging: how it improves C/C++ memory safety (2018) [pdf]

https://llvm.org/devmtg/2018-10/slides/Serebryany-Stepanov-Tsyrklevich-Memory-Tagging-Slides-LLVM...
55•fanf2•8h ago•19 comments

An Efficient Implementation of SELF (1989) [pdf]

https://courses.cs.washington.edu/courses/cse501/15sp/papers/chambers.pdf
39•todsacerdoti•8h ago•20 comments

We do not have sufficient links to the UK for Online Safety Act to be applicable

https://libera.chat/news/advised
216•todsacerdoti•12h ago•68 comments

In memory of the Christmas Island shrew

https://news.mongabay.com/2025/10/in-memory-of-the-christmas-island-shrew/
61•hexhowells•8h ago•18 comments

Passwords and Power Drills

https://google.github.io/building-secure-and-reliable-systems/raw/ch01.html#on_passwords_and_powe...
69•harporoeder•4d ago•16 comments

Making a micro Linux distro (2023)

https://popovicu.com/posts/making-a-micro-linux-distro/
164•turrini•16h ago•28 comments

Testing out BLE beacons with BeaconDB

https://blog.matthewbrunelle.com/testing-out-ble-beacons-with-beacondb/
47•zdw•8h ago•12 comments

Belittled Magazine: Thirty years after the Sokal affair

https://thebaffler.com/salvos/belittled-magazine-robbins
44•Hooke•7h ago•32 comments

The future of Python web services looks GIL-free

https://blog.baro.dev/p/the-future-of-python-web-services-looks-gil-free
187•gi0baro-dev•6d ago•77 comments