frontpage.

We launched a v1 of a image to image translation API which translates the text on an images by replacing the existing text.

For v1, it's pretty much a model pipeline: OCR current text -> generate mask -> erase text -> translate text -> use embedding comparison to find similar font -> map text back on image

v1 was more like a prototype which already beats many of the similar services provided by Google, Azure, etc

We're working on v2 where we're training a diffusion model to translate the text on the image. We've got the pipeline working for English and Chinese, and now we're building datasets for other languages.

Let's DO this: detecting Workers Builds errors across 1M Durable Objects

Ask HN: Since getting an Agent, what's changed in your life?

Decorators and Functional Programming

AI jobs danger: Sleepwalking into a white-collar bloodbath

Learning System 1.0

Show HN: Because Apple's daily rings are limited: App for weekly activity goals

Consider the Cronslave

Live Watches comes to CLion's debugger with the new build

The End of Seeing Is Believing

Fake My Run Is Tricking Strava While Trying to Make a Larger Point

Canonical Announces First Ubuntu Desktop Image for Qualcomm Dragonwing Platform

Shadcn/UI in HTML and Tailwind

What's Your Doge Number?

AI is helping rescue a nearly extinct bird species

Google AI Overviews Says It's Still 2024

American finance, always unique, is now uniquely dangerous

MCP Streamable HTTP – Python and TypeScript Examples

Three Level Summary: Neural Radiance Fields vs. 3D Gaussian Splatting

Woman fired by Wikipedia parent after harassment speaks out

Anthropic archives many of their reference MCP servers on GitHub

The Pale Blue Dot, But Time

Chalk Raises $50M Series A to Power AI Inference

A decade in, bootstrapped Thinkst Canary reaches $20M in ARR without VC funding

SEO Tips to Boost Traffic and Organic Rankings

Dotfiles/.claude/Claude.md

Show HN: I made a Chrome extension to filter your X feed

High vitamin B6 doses over a long period could cause irreversible nerve damage

Kara Pod – self-refilling coffee maker that turns air into water

The Palindrome Game of the Enigma Codebreakers

How Generative Engine Optimization (GEO) rewrites the rules of search

Show HN: Image-to-Image Translation Model

Comments