frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Pico-Banana-400k

https://github.com/apple/pico-banana-400k
93•dvrp•2h ago

Comments

vunderba•2h ago
From the paper

> The pipeline (bottom) shows how diverse OpenImages inputs are edited using Nano-Banana and quality-filtered by Gemini-2.5-Pro, with failed attempts automatically retried.

Pretty interesting. I run a fairly comprehensive image-comparison site for SOTA generative AI in text-to-image and editing. Managing it manually got pretty tiring, so a while back I put together a small program that takes a given starting prompt, a list of GenAI models, and a max number of retries which does something similar.

It generates and evaluates images using a separate multimodal AI, and then rewrites failed prompts automatically repeating up to a set limit.

It's not perfect (nine pointed star example in particular) - but often times the "recognition aspect of a multimodal model" is superior to its generative capabilities so you can run it in a sort of REPL until you get the desired outcome.

https://genai-showdown.specr.net/image-editing

typpilol•1h ago
I love your site I stumble across it once a month it seems.

Or there's another very similar site. But I'm pretty sure it's yours

lukasb•1h ago
What do you use for evaluation? gemini-2.5-pro is at the top of MMLU and has been best for me but always looking for better.
vednig•2h ago
Other Post: https://news.ycombinator.com/item?id=45708493
cjrd•1h ago
Eh
djtriptych•46m ago
Really cool - looking to Apple to lead the on-device AI space in short order...
daemonologist•28m ago
I confess that I don't quite get the point here - is it just that they've paid the inference costs for a dataset than can be used for distillation/other research?
peddling-brink•5m ago
Essentially yes, it’s a data set that can help train or fine tune another model or similar research. From the site:

> Pico-Banana-400K serves as a versatile resource for advancing controllable and instruction-aware image editing. Beyond single-step editing, the dataset enables multi-turn, conversational editing and reward-based training paradigms.

TechSquidTV•22m ago
Can it be? Has Apple FINALLY joined the party? Very ironic they are using an open dataset from Google... and Gemini for prompts by Google.

I'm happy to see something from Apple but this seems so low-tech that it could be one of my own local ComfyUI workflows.

Language Modeling with Hierarchical Reasoning Models: Lessons from 1M Parameters

https://williamthurston.com/ml/language-models/transformers/2025/10/25/language-modeling-with-hie...
1•jhspaybar•3m ago•0 comments

GameStop Declares Console Wars Over

https://twitter.com/gamestop/status/1982213786221109263
2•avonmach•12m ago•0 comments

Quick Dungeon Crawler Update 3.5.0: New Passives, CRIT DMG Nerf

https://dungeon.werkstattl.com/
1•logTom•15m ago•1 comments

Jan van Eijk's wise lessons and advice

https://www.hightechinstitute.nl/jan-van-eijk-wise-lessons/
1•o4c•18m ago•0 comments

How I Used Lies About a Cartoon to Prove History Is Meaningless on the Internet (2016)

https://medium.com/pcmag-access/how-i-used-lies-about-a-cartoon-to-prove-history-is-meaningless-o...
1•jfil•20m ago•0 comments

Show HN: Zoto – low-level audio playback in Zig

https://github.com/braheezy/zoto
2•braheezy•28m ago•0 comments

Node.js Hackathon Starter still get updates in 2025

https://github.com/sahat/hackathon-starter
3•sawirricardo•30m ago•0 comments

ALA and Metformin Synergistically Ameliorate T2 Diabetes Cognitive Dysfunction

https://www.mdpi.com/2079-7737/14/7/885
2•walterbell•32m ago•0 comments

TinyCorp: Nvidia GPU on Apple Silicon over USB4 is ready to try

https://xcancel.com/__tinygrad__/status/1980082660920918045
1•gsf_emergency_4•37m ago•0 comments

Auto Dark Mode for Windows

https://github.com/AutoDarkMode/Windows-Auto-Night-Mode
2•hermitsings•38m ago•0 comments

Tariffs weigh on US manufacturing as activity contracts for 7th straight month

https://www.straitstimes.com/business/economy/tariffs-weigh-on-us-manufacturing-in-september-as-a...
2•testrun•39m ago•0 comments

Microsoft disables File Explorer preview for downloaded files by default

https://www.windowslatest.com/2025/10/26/microsoft-admits-file-explorer-preview-pane-wont-work-in...
4•kirenida•39m ago•0 comments

Show HN: Piping in and Out of Emacs

https://github.com/agzam/mx-piper
2•iLemming•40m ago•0 comments

TinyCorp Runs Nvidia GPU Off Apple Silicon Mac via USB4 dock

https://www.tomshardware.com/pc-components/gpus/tiny-corp-successfully-runs-an-nvidia-gpu-on-arm-...
4•gsf_emergency_4•43m ago•0 comments

Association between microplastics and depressive symptoms in college students

https://www.sciencedirect.com/science/article/pii/S0147651325004786
1•donsupreme•46m ago•0 comments

Dov Charney on Vice (Former American Apparel CEO) [video]

https://www.youtube.com/watch?v=CG_T1fY3KTk
3•artur_makly•46m ago•1 comments

Tsdown – The Elegant Bundler for Libraries

https://tsdown.dev/
2•jcbhmr•1h ago•0 comments

Rescuing Democracy from the Quiet Rule of AI

https://www.noemamag.com/rescuing-democracy-from-the-quiet-rule-of-ai/
4•devonnull•1h ago•0 comments

Ondol

https://en.wikipedia.org/wiki/Ondol
5•JumpCrisscross•1h ago•0 comments

GenAI Image Editing Showdown

https://genai-showdown.specr.net/
2•rzk•1h ago•0 comments

How efficient is RocksDB for IO-bound, point-query workloads?

http://smalldatum.blogspot.com/2025/10/how-efficient-is-rocksdb-for-io-bound.html
1•loeg•1h ago•0 comments

Periodic Advertising with Responses (PAwR): Bidirectional Bluetooth Advertising

https://novelbits.io/periodic-advertising-with-responses-pawr/
2•latchkey•1h ago•0 comments

Network Scanner script to automate Adblock rules

https://github.com/ryanbr/network-scanner
3•mp3geek•2h ago•0 comments

PCB Edge USB C Connector Library

https://github.com/AnasMalas/pcb-edge-usb-c
25•walterbell•2h ago•11 comments

AI models may be developing their own 'survival drive', researchers say

https://www.theguardian.com/technology/2025/oct/25/ai-models-may-be-developing-their-own-survival...
1•pseudolus•2h ago•2 comments

Creative neglect: What about the apps in Apple?

https://sixcolors.com/post/2025/10/creative-neglect-what-about-the-apps-in-apple/
2•CharlesW•2h ago•0 comments

Show HN: I made a site to replace emdashes and ornate AI characters from text

https://removeemdash.org
1•ryanmerket•2h ago•0 comments

President Reagan's Radio Address to the Nation on Tarrifs, 1987

https://www.youtube.com/watch?v=5t5QK03KXPc
32•nothrowaways•2h ago•5 comments

Computers Have Killed Chess

https://lichess.org/@/ZugAddict/blog/computers-have-literally-killed-chess/6mIyVwMS
4•fzliu•2h ago•1 comments

The NBA gambling scandal, explained by an actual gambler

https://www.natesilver.net/p/the-nba-gambling-scandal-explained
2•PaulHoule•2h ago•0 comments