Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

https://github.com/hcmhcs/screenTranslate

2•hcmhcs0•6h ago

I kept breaking my workflow to translate foreign text — copy, open browser, paste into translator, read, switch back. Repeat.

  So I built a macOS menu bar app that translates right where you're working.
                                                                                                   
  Two modes:                                                                                     
  - Select text in any app → Cmd+Option+Z → instant translation                                    
  - Cmd+Shift+T → drag over any area → OCR + translate (images, PDFs, subtitles)
                                                                                                   
  Everything runs on-device via Apple Vision + Apple Translation. No servers, no tracking. Free    
  forever.                                                                                         
                                                                                                   
  20 languages · offline capable · GPL-3.0

Comments

vunderba•3h ago

Nice job.

I created something like this over a decade ago for Windows that would let you hit a globally registered shortcut to hover a magnifying glass over text in a windowed/fullscreen game - I used to use it while I was studying Chinese with emulated SNES RPGs.

Back then the best we could do was tesseract OCR feeding down to the open CC-CEDICT dictionary. It was primitive but sufficed!

hcmhcs0•3h ago

That's a really cool use case — OCR on emulated RPGs for language study! I didn't know Tesseract could handle pixel fonts. How well did it work?

I went with Apple's Vision and Translation frameworks since they were the easiest path for me, but the downside is it requires macOS 15+. I'm thinking about adding Tesseract as an alternative OCR engine to support older versions — sounds like it could work well enough!

vunderba•3h ago

Thanks! Honestly? Initially very poorly.

What I ended up doing was generating around a dozen versions of a screenshot in realtime (all with different combinations of thresholding, segmentation parameters, resolution scaling, and denoising) behind the scenes. Then it would fire Tesseract off on all of them in parallel threads and let them “vote” on the result.

After I set that up, the accuracy improved significantly.

If you're looking for an alternative rather than Tesseract - I'd actually recommend Surya. I've had a lot of success with it out of the box with doing OCR on comics.

https://github.com/datalab-to/surya

hcmhcs0•2h ago

That's a clever approach — running multiple preprocessing variants in parallel and letting them vote. Almost like an ensemble for OCR!

Thanks for the Surya recommendation, I hadn't come across it before. Will definitely check it out!

hcmhcs0•3h ago

Hi, I'm the author — a student developer. I've been really into AI agents lately and spend a lot of time reading system instructions and source code on OpenClaw. Problem is, my English isn't great, so I constantly needed to translate.

Switching to a Google Translate tab every time or asking an AI to translate broke my flow completely. So I built this to translate right where I'm working — no tab switching, no copy-paste.

Built with Claude Code over a weekend. Happy to answer any questions!

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

Show HN: 1v1 coding game that LLMs struggle with

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

Show HN: Claude-replay – A video-like player for Claude Code sessions

Show HN: The Roman Industrial Revolution that could have been (Vol 2)

Show HN: Reconstruct any image using primitive shapes, runs in-browser via WASM

Show HN: MysteryMaker AI

Show HN: A trainable, modular electronic nose for industrial use

Show HN: NeoNetrek – modernizing the internet's first team game (1988)

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

Show HN: Free salary converter with 3,400 neighborhood comparisons in 182 cities

Show HN: Cross-Claude MCP – Let multiple Claude instances talk to each other

Show HN: WebBridge turns any website into MCP tools by recording browser traffic

Show HN: Interactive 3D globe of EU shipping emissions

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

Show HN: Modembin – A pastebin that encodes your text into real FSK modem audio

Show HN: Sqry – semantic code search using AST and call graphs

Show HN: mTile – native macOS window tiler inspired by gTile

Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

Show HN: Jido 2.0, Elixir Agent Framework

Show HN: Graph-Oriented Generation – Beating RAG for Codebases by 89%

Show HN: PageAgent, A GUI agent that lives inside your web app

Show HN: Pg_sorted_heap–Physically sorted PostgreSQL with builtin vector search

Show HN: Mantle – Remap your Mac keyboard without editing Kanata config files

Show HN: Anchor Engine – Deterministic Semantic Memory for LLMs Local (<3GB RAM)

Show HN: VaultNote – Local-first encrypted note-taking in the browser

Show HN: Poppy – A simple app to stay intentional with relationships

Show HN: Mog, a programming language for AI agents

Show HN: Go-TUI – A framework for building declarative terminal UIs in Go

Show HN: Best ways to organize research links

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

Show HN: 1v1 coding game that LLMs struggle with

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

Show HN: Claude-replay – A video-like player for Claude Code sessions

Show HN: The Roman Industrial Revolution that could have been (Vol 2)

Show HN: Reconstruct any image using primitive shapes, runs in-browser via WASM

Show HN: MysteryMaker AI

Show HN: A trainable, modular electronic nose for industrial use

Show HN: NeoNetrek – modernizing the internet's first team game (1988)

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

Show HN: Free salary converter with 3,400 neighborhood comparisons in 182 cities

Show HN: Cross-Claude MCP – Let multiple Claude instances talk to each other

Show HN: WebBridge turns any website into MCP tools by recording browser traffic

Show HN: Interactive 3D globe of EU shipping emissions

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

Show HN: Modembin – A pastebin that encodes your text into real FSK modem audio

Show HN: Sqry – semantic code search using AST and call graphs

Show HN: mTile – native macOS window tiler inspired by gTile

Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

Show HN: Jido 2.0, Elixir Agent Framework

Show HN: Graph-Oriented Generation – Beating RAG for Codebases by 89%

Show HN: PageAgent, A GUI agent that lives inside your web app

Show HN: Pg_sorted_heap–Physically sorted PostgreSQL with builtin vector search

Show HN: Mantle – Remap your Mac keyboard without editing Kanata config files

Show HN: Anchor Engine – Deterministic Semantic Memory for LLMs Local (<3GB RAM)

Show HN: VaultNote – Local-first encrypted note-taking in the browser

Show HN: Poppy – A simple app to stay intentional with relationships

Show HN: Mog, a programming language for AI agents

Show HN: Go-TUI – A framework for building declarative terminal UIs in Go

Show HN: Best ways to organize research links

Show HN: ScreenTranslate – On-device screen translator for macOS (open source)

Comments