frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Agents – Sync MCP Configs Across Claude, Cursor, Codex Automatically

https://github.com/amtiYo/agents
1•amtiyo•51s ago•0 comments

Hello

1•otrebladih•2m ago•0 comments

FSD helped save my father's life during a heart attack

https://twitter.com/JJackBrandt/status/2019852423980875794
1•blacktulip•4m ago•0 comments

Show HN: Writtte – Draft and publish articles without reformatting, anywhere

https://writtte.xyz
1•lasgawe•6m ago•0 comments

Portuguese icon (FROM A CAN) makes a simple meal (Canned Fish Files) [video]

https://www.youtube.com/watch?v=e9FUdOfp8ME
1•zeristor•8m ago•0 comments

Brookhaven Lab's RHIC Concludes 25-Year Run with Final Collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
2•gnufx•10m ago•0 comments

Transcribe your aunts post cards with Gemini 3 Pro

https://leserli.ch/ocr/
1•nielstron•14m ago•0 comments

.72% Variance Lance

1•mav5431•15m ago•0 comments

ReKindle – web-based operating system designed specifically for E-ink devices

https://rekindle.ink
1•JSLegendDev•17m ago•0 comments

Encrypt It

https://encryptitalready.org/
1•u1hcw9nx•17m ago•1 comments

NextMatch – 5-minute video speed dating to reduce ghosting

https://nextmatchdating.netlify.app/
1•Halinani8•18m ago•1 comments

Personalizing esketamine treatment in TRD and TRBD

https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1736114
1•PaulHoule•19m ago•0 comments

SpaceKit.xyz – a browser‑native VM for decentralized compute

https://spacekit.xyz
1•astorrivera•20m ago•0 comments

NotebookLM: The AI that only learns from you

https://byandrev.dev/en/blog/what-is-notebooklm
1•byandrev•20m ago•1 comments

Show HN: An open-source starter kit for developing with Postgres and ClickHouse

https://github.com/ClickHouse/postgres-clickhouse-stack
1•saisrirampur•21m ago•0 comments

Game Boy Advance d-pad capacitor measurements

https://gekkio.fi/blog/2026/game-boy-advance-d-pad-capacitor-measurements/
1•todsacerdoti•21m ago•0 comments

South Korean crypto firm accidentally sends $44B in bitcoins to users

https://www.reuters.com/world/asia-pacific/crypto-firm-accidentally-sends-44-billion-bitcoins-use...
2•layer8•22m ago•0 comments

Apache Poison Fountain

https://gist.github.com/jwakely/a511a5cab5eb36d088ecd1659fcee1d5
1•atomic128•24m ago•2 comments

Web.whatsapp.com appears to be having issues syncing and sending messages

http://web.whatsapp.com
1•sabujp•24m ago•2 comments

Google in Your Terminal

https://gogcli.sh/
1•johlo•25m ago•0 comments

Shannon: Claude Code for Pen Testing: #1 on Github today

https://github.com/KeygraphHQ/shannon
1•hendler•26m ago•0 comments

Anthropic: Latest Claude model finds more than 500 vulnerabilities

https://www.scworld.com/news/anthropic-latest-claude-model-finds-more-than-500-vulnerabilities
2•Bender•30m ago•0 comments

Brooklyn cemetery plans human composting option, stirring interest and debate

https://www.cbsnews.com/newyork/news/brooklyn-green-wood-cemetery-human-composting/
1•geox•30m ago•0 comments

Why the 'Strivers' Are Right

https://greyenlightenment.com/2026/02/03/the-strivers-were-right-all-along/
1•paulpauper•32m ago•0 comments

Brain Dumps as a Literary Form

https://davegriffith.substack.com/p/brain-dumps-as-a-literary-form
1•gmays•32m ago•0 comments

Agentic Coding and the Problem of Oracles

https://epkconsulting.substack.com/p/agentic-coding-and-the-problem-of
1•qingsworkshop•33m ago•0 comments

Malicious packages for dYdX cryptocurrency exchange empties user wallets

https://arstechnica.com/security/2026/02/malicious-packages-for-dydx-cryptocurrency-exchange-empt...
1•Bender•33m ago•0 comments

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

https://github.com/pheonix-delta/axiom-voice-agent
1•shubham-coder•33m ago•0 comments

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

https://arstechnica.com/health/2026/02/penisgate-erupts-at-olympics-scandal-exposes-risks-of-bulk...
4•Bender•34m ago•0 comments

Arcan Explained: A browser for different webs

https://arcan-fe.com/2026/01/26/arcan-explained-a-browser-for-different-webs/
1•fanf2•36m ago•0 comments
Open in hackernews

Nano Banana Flash – Google's Gemini 3 Flash Image Model

https://nanobananaflash.io
1•xbaicai•2mo ago

Comments

xbaicai•2mo ago
Nano Banana Flash – Google's Gemini 3 Flash Image Model for AI Image Generation and Editing

I've been experimenting with Google's Gemini 3 Flash Image (internally codenamed "nano-banana"), and I wanted to share what makes this model architecturally interesting compared to other image generation approaches. What Makes It Different Most image generation models follow a diffusion-based architecture (Stable Diffusion, DALL-E, Midjourney). Nano Banana takes a different approach – it's built on Google's Gemini multimodal foundation, meaning it shares the same underlying transformer architecture that handles text, making it natively conversational. Key technical characteristics:

Prompt-driven editing: Unlike traditional inpainting that requires masks, you can describe edits conversationally ("make the sky darker", "change the shirt to blue") Multi-image composition: Accepts up to 3,000 images per prompt for blending and composition Character consistency: Maintains visual consistency across multiple generated images – useful for storyboarding or product variations SynthID watermarking: Invisible digital watermark embedded at generation time (not post-processing)

Use Cases Where It Excels From my testing, it's particularly strong at:

Product photography variations: Generate multiple angles or contexts for the same product while maintaining visual consistency Iterative design: The conversational interface means you can refine without starting over Multi-image blending: Combining reference images with text prompts for precise control

Technical Limitations Worth noting:

Maximum 7MB per file for inline data Output quality varies with prompt specificity (like all LLMs, prompt engineering matters) The conversational approach means you need to think about context window management for long editing sessions

The model is accessible via standard REST APIs, making integration straightforward if you're already using Google Cloud infrastructure. Why This Matters The interesting shift here isn't just another image model – it's the convergence of language and vision models into a unified architecture. The same transformer that understands your code or writes your emails can now edit your images. This has implications for:

Tooling: IDEs and development environments can integrate image generation as naturally as code completion Workflows: Designers can describe changes in natural language rather than learning complex UI tools Accessibility: Lower barrier to entry for image manipulation

Open Questions I'm curious what the HN community thinks about:

How do you handle version control for conversationally-edited images? What's the right abstraction for programmatic access – should we treat it like a stateful session or stateless function calls? For production use, how do you validate consistency across generated image sets?

The codebase is closed-source (it's Google), but the API is well-documented and the model is available for experimentation through AI Studio. Would love to hear if anyone else has been working with this or has thoughts on the architectural approach.

Technical specs for reference:

Model: Gemini 3 Flash Image Output: 1290 tokens per image Max images per prompt: 3,000 Max file size: 7MB (inline/console) Watermarking: SynthID (invisible, embedded)

minimaxir•2mo ago
Stop attempting to namesquat.