frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Seedream 5.0-Preview Test: An image model that does web search during generation

https://www.atlascloud.ai/collections/seedream-5
2•Alisaqqt•1h ago

Comments

Alisaqqt•1h ago
I’ve been testing a new image model from ByteDance called Seedream 5.0-Preview, which is currently available inside Dreamina and will be available via API on Atlas Cloud once it drops on 02/24/2026 (alongside the video model Seedance 2.0). It’s not the first text-to-image model I’ve used that performs live web search during generation, but it changes the behavior in some non-trivial ways, comparing with Nano Banana Pro.

Instead of just “better pixels”, this preview adds three things on top of a standard image backbone: (1) web search that can be invoked at generation time, (2) stronger logical/structural reasoning, and (3) more semantics-aware editing. It’s explicitly labeled as a preview (full 5.0 is expected later in February), so I’m treating it as an experiment rather than something production-stable.

A few behaviors I could reliably reproduce:

Web search during generation: For prompts that reference current or niche entities (e.g. a specific year’s event mascot, current product designs), the model will quietly hit the web and then render something that matches what it found, without any reference images from the user. For more “timeless” prompts it often stays offline.

Constraint-following and reasoning: It handles very literal constraints (e.g. specific clock hand positions, object counts, layout rules) much more faithfully than typical image models I’ve used. In multi-image tasks, it can classify elements in one image and re-arrange them in another, which feels closer to visual planning than pure sampling.

Style/trait transfer between images: Given a “style” image and a “content” image, it can extract the visual language of the first and apply it to the second in a way that looks like a consistent campaign asset, driven by natural-language instructions.

There are tradeoffs: photorealism and aesthetic quality in this preview are noticeably behind its own previous 4.5 model, which is still better if you only care about pretty images right now. The 5.0-Preview run I’m on is clearly optimized to show off the “new brain” (reasoning + web) rather than maximum visual polish.

I’m curious how this design — “image model with its own web search and reasoning stack” — fits into the broader ecosystem of multimodal models that already have tool use and web access (e.g. Gemini 3 variants, etc.). Concretely:

What are the obvious safety and copyright pitfalls once the renderer itself can freely crawl and internalize current visuals?

Does it make more sense architecturally to push this into one unified multimodal model, or to keep “LLM + web” and “image + web” as separate, specialized components?

Ps: For my tests I’ve been wiring Atlas Cloud API into small internal tools (prompting frontends, batch renderers) rather than using it from Fal AI or Wavespeed because Atlas always support new models on day 0 with cheaper price.

Detailed model test and showcases: https://www.reddit.com/r/GeminiAI/comments/1r0c6tv/seedream_...

Brandon Wint Poem [video]

https://www.youtube.com/watch?v=k9hPmp09ssc
1•marysminefnuf•29s ago•0 comments

GraphQLite, Graph Network Extension on Top of SQLite

https://github.com/colliery-io/graphqlite
1•ekianjo•3m ago•0 comments

Show HN: Clawhosting.io– Managed OpenClaw

https://clawhosting.io
1•rezaghp•5m ago•0 comments

Modular Buys BentoML

https://www.modular.com/blog/bentoml-joins-modular
1•carefree-bob•8m ago•0 comments

Show HN: Hosting dynamic webcal on GitHub pages

https://github.com/sfw185/BJJCal
1•faridw•9m ago•0 comments

Why capital is fleeing Tech for the Tangible Economy

https://wwai.substack.com/p/the-great-rotation-capital-flees
3•watchwiseai•21m ago•1 comments

Harmless reward hacks generalize to shutdown evasion and dictatorship in GPT-4.1

https://arxiv.org/abs/2508.17511
1•toliveistobuild•22m ago•1 comments

Exponential Code, Network Effects in AI, & the Return of Apprenticeships

https://www.implications.com/p/exponential-code-network-effects
1•swolpers•26m ago•0 comments

Jonathan Blow on Why Modern Software Is Bloated

https://www.youtube.com/watch?v=GOtWR_T2VOk
3•cable2600•32m ago•0 comments

Venezuelan crude oil exports to Israel resume after 6-year gap

https://www.bloomberg.com/news/articles/2026-02-10/venezuela-sending-first-crude-oil-cargo-to-isr...
1•OgsyedIE•35m ago•0 comments

Secretary of War Hegseth Announces End of Support for Harvard University [video]

https://www.youtube.com/watch?v=eh5duiL3MwQ
2•nomilk•36m ago•1 comments

Vibrant Frog Collab – AI-Powered Writing Assistant

https://frogteam.ai/VibrantFrog/default.html
1•am-piazza•37m ago•0 comments

Show HN: Track and analyze AI coding tool usage across your team

https://trackr-bay.vercel.app/welcome
1•usmansidd•40m ago•0 comments

(Rust) Tracking Issue for Generic Constant Arguments MVP

https://github.com/rust-lang/rust/issues/132980
1•anfilt•41m ago•1 comments

How many of the 3,191 billionaires can you name?

https://billionaires.linolevan.com/
1•linolevan•47m ago•1 comments

mrdoob Ported Quake to JavaScript/Three.js

https://mrdoob.github.io/three-quake/
1•davidbarker•47m ago•0 comments

Russians supplied with new satellite internet terminals after Starlink blackout

https://www.pravda.com.ua/eng/news/2026/02/09/8020199/
1•c420•50m ago•0 comments

Block a website in specific countries using Nginx

https://shashanksrivastava.medium.com/block-a-website-in-specific-countries-using-nginx-20a651288795
1•kamaraju•50m ago•0 comments

I am building virtual Bash

https://github.com/everruns/bashkit
1•chalyi•50m ago•1 comments

Show HN: Fyno – Automate repetitive bookkeeping tasks

https://www.meetfyno.com
1•alicele27•50m ago•0 comments

I'm building a clarity-first language (compiles to C++)

https://github.com/taman-islam/rox
1•hedayet•52m ago•1 comments

Spec-Driven Development with Claude Code

https://www.braingrid.ai/blog/building-braingrid-with-braingrid
2•acossta•52m ago•1 comments

Google Nest camera video raises privacy questions

https://www.mynbc5.com/article/nancy-guthrie-fbi-nest-camera-video-raises-privacy-questions/70306538
1•1vuio0pswjnm7•52m ago•1 comments

The First Person Project: How to prove you are a real person online

https://www.firstperson.network/white-paper
1•walterbell•57m ago•0 comments

AI-Driven Low-Fi Prototyping with Balsamiq Cloud

https://balsamiq.com/blog/low-fidelity-prototyping/
1•ilt•58m ago•0 comments

SAIR Foundation

https://sair.foundation/
2•nsoonhui•58m ago•0 comments

Linux 7.0 Review: Major Performance, GPU, CPU, and Networking Upgrades

https://www.youtube.com/watch?v=3s37rDlIemI
2•cable2600•58m ago•0 comments

Show HN: Yan – Glitch Art Photo/Live Editor

https://yan.yichenlab.com/
2•xcc3641•1h ago•0 comments

A simpler way to remove explicit images from Search

https://blog.google/products-and-platforms/products/search/remove-explicit-images/
5•gnabgib•1h ago•0 comments

We're all called Julia, or maybe ChatGPT calls itself Julia

https://solresol.substack.com/p/were-all-called-julia-or-maybe-chatgpt
2•solresol•1h ago•1 comments