frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: 4-model AI pipeline that handles 952 photo specs across 172 countries

https://visapics.org/
3•romanpodpriatov•4mo ago
Hey HN! After watching my mom pay $17 to get a passport photo at CVS (literally just a white background photo), and seeing “visa agents” charge $150+ for DV lottery applications (which are FREE), I went down a rabbit hole. Turns out there are 952+ different photo specifications across various documents worldwide. US passport wants 2x2 inches. DV lottery needs 600x600px. Schengen visa requires 35x45mm. Japanese visa needs specific head-to-photo ratio. It’s insane. What makes this different from competitors: Multi-Model AI Pipeline: •GFPGAN - Fixes low quality photos, enhances faces (your 2015 phone photo? Fixed) •BiRefNet - Removes backgrounds WITH accurate hair detection (curly hair nightmare solved) •MediaPipe - Validates face angle, eye level, head position in real-time •RealESRGAN - Upscales potato quality images to HD Most competitors use basic OpenCV or one model. I chain 4 models for accuracy. Real-time Compliance Validation: Instead of “upload and pray”, you get instant feedback: •Head too tilted? Shows angle needed •Shadow on face? Highlights problem area •Wrong dimensions? Auto-fixes •Background not uniform? AI replaces it This catches issues BEFORE you waste money printing/submitting. Photo Vault System: Nobody talks about this problem: families need photos for multiple documents over time. Built a vault system where you can: •Store original + processed versions •Generate for different documents from same source •Share vault with family (one photo session for all kids) •Import from previous orders Perfect for immigrant families dealing with constant paperwork. Privacy Architecture: •Photos process → deliver → DELETE immediately •No training on user photos •No permanent storage unless you explicitly use vault •Client-side processing where possible •2FA + OAuth available Technical details: •Frontend: Next.js + React •AI Pipeline: Python + FastAPI •Models: Hosted on CPU instances (low price) •Database: PostgreSQL for specs, Redis , Celery for queues •Payment: Stripe (no crypto bs) Pricing: $3.99 single photo, $2.49/photo in bundles. Intentionally undercut everyone because $50 for a cropped photo is criminal. The boring numbers: •30k+ photos processed •15 second average processing time •99.7% acceptance rate at government offices •Supporting 172 countries Why I built this: The visa/immigration photo industry is built on information asymmetry. People don’t know requirements → panic → overpay for “professional” help. It’s a $2B industry built on cropping photos to specific pixels. My goal: Make it so cheap and easy that the predatory “visa consultants” go extinct. Ask me anything about the AI pipeline, compliance validation logic, or why passport photo requirements are stuck in 1990! P.S. Yes, it handles the infamous “no smiling” requirement. The AI actually detects smiles and warns you. Because apparently showing happiness is a security threat

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•1m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
1•lifeisstillgood•1m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•4m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•6m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
1•birdculture•7m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•9m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
1•ramenbytes•11m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•13m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•16m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•16m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
2•cinusek•17m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•18m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•22m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•27m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•27m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•30m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
2•ryan_j_naughton•30m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
1•ravenical•32m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
1•ValdikSS•32m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•34m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/
1•dive•35m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls
1•thomassmith65•40m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
2•somethingp•42m ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
3•saubeidl•43m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•46m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205
1•tosh•48m ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew
1•gl2334•48m ago•0 comments

New hire fixed a problem so fast, their boss left to become a yoga instructor

https://www.theregister.com/2026/02/06/on_call/
1•Brajeshwar•50m ago•0 comments

Four horsemen of the AI-pocalypse line up capex bigger than Israel's GDP

https://www.theregister.com/2026/02/06/ai_capex_plans/
1•Brajeshwar•50m ago•0 comments

A free Dynamic QR Code generator (no expiring links)

https://free-dynamic-qr-generator.com/
1•nookeshkarri7•51m ago•1 comments