frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: YoloForge – Create object detection datasets using Gemini 3 Pro

https://yoloforge.com
3•Olibier•1d ago
Hi HN, I’m the creator of YoloForge. I built this because I hit a wall with a hobby computer vision project: I needed a custom dataset, and zero-shot tools like Grounding DINO just weren't accurate enough for my specific classes. I decided I’d rather write code for a couple of weeks than draw another box by hand.

I previously experimented with Grounding DINO and SAM3. While they are amazing for generic objects, I found they struggle with specific semantic requests (e.g. specific manufacturing parts, game characters or distinguishing "a worker" from "a worker without a helmet").

I discovered that Gemini 3 Pro is surprisingly underrated for bounding box tasks if you prompt it with detailed visual descriptions. It handles semantic understanding significantly better than standard zero-shot detectors.

url: yoloforge.com

The Workflow:

Upload a zip of raw images (stored in Cloudflare R2). Describe class/classes in plain English. The system generates a .jsonl batch file and sends it to the Gemini Batch API. This allows us to process thousands of images in parallel at 50% of the standard cost. You review/correct boxes in the UI and export the YOLO train/val/test dataset.

Technical Challenges:

One hard part was getting valid JSON out of the LLM consistently. I ended up writing a robust parser that uses regex fallback strategies to literally "salvage" valid bounding boxes from malformed responses.

The Stack:

- Frontend: Next.js - Backend: FastAPI, Celery (for async zip processing and polling the batch API), Redis. - Storage: Supabase (Auth/DB), Cloudflare R2 (Image Storage). - Model: Google Gemini 3 Pro via Batch API.

There is a live demo on the landing page (no sign-up required) where you can upload a single image to test the detection logic. But of course the tool really shines with datasets that have thousands of images with multiple classes.

If you have any technical questions please ask!

Show HN: ToddlerLock – An iPhone app that shows a fake home screen for toddlers

https://toddlerlock.app/
1•zilvinassebeika•43s ago•0 comments

Show HN: FM-index – Rust-powered substring search for Python

https://pypi.org/project/fm-index/
1•math-hiyoko•1m ago•0 comments

Jensen Huang saying "AI" 121 times during the Nvidia CES keynote

https://old.reddit.com/r/LocalLLaMA/comments/1q7d8bj/jensen_huang_saying_ai_121_times_during_the/
1•elorant•1m ago•0 comments

Show HN: Tuicr – Review Claude Code diffs like a PR from your terminal

https://github.com/agavra/tuicr
1•agavra•1m ago•0 comments

I have vinyls with no vinyl player

https://danielsada.tech/blog/why-i-have-vinyls-with-no-vinyl-player/
2•dshacker•3m ago•0 comments

The Trump Administration Says It's Illegal to Record Videos of ICE

https://reason.com/2026/01/08/you-have-the-right-to-record-ice/
2•SilverElfin•5m ago•1 comments

Fixing a Buffer Overflow in Unix v4 Like It's 1973

https://sigma-star.at/blog/2025/12/unix-v4-buffer-overflow/
1•vzaliva•7m ago•0 comments

Gbyte Leaks Gigabytes of Data

https://maia.crimew.gay/posts/fuckstalkerware-8/
1•ravenical•7m ago•0 comments

Core v2.2.0: First autonomous coding agent with universal workflow orchestration

https://github.com/DariuszNewecki/CORE
1•DNewecki•8m ago•1 comments

Kubernetes Dashboard Deprecation: An Operational Perspective

https://devopsdiary.in/lessons-from-the-kubernetes-dashboard-deprecation
1•abhinavd26•8m ago•0 comments

Postman Acquires Fern

https://blog.postman.com/postman-acquires-fern/
2•jseip•10m ago•0 comments

New code review tool I made

https://commitguard.ai
1•moshetanzer•11m ago•1 comments

Show HN: macOS menu bar app to track Claude usage in real time

https://github.com/richhickson/claudecodeusage
2•RichHickson•12m ago•0 comments

What Social Science Knows About the Value of Diversity

https://www.theatlantic.com/ideas/archive/2025/08/viewpoint-diversity-profit-business/684025/
2•mpweiher•15m ago•1 comments

We built a tool that removes unsafe restaurant options before people argue

https://gustup.com/
1•alexroselli93•17m ago•1 comments

A Guide to the Boston Tech "Collapse" Everyone Is Arguing About

https://www.siliconsnark.com/a-guide-to-the-boston-tech-collapse-everyone-is-arguing-about/
1•SaaSasaurus•17m ago•0 comments

IBM AI ('Bob') Downloads and Executes Malware

https://www.promptarmor.com/resources/ibm-ai-(-bob-)-downloads-and-executes-malware
15•takira•18m ago•1 comments

Search in GitHub Notifications has no effect

https://github.com/orgs/community/discussions/51775
1•lucideer•18m ago•1 comments

Show HN: Impulse Cycler – Transient Motion Resynthesizer

https://aftertone.co/impulse-cycler/
1•oceanwaves•19m ago•0 comments

Show HN: Brag about what you shipped yesterday – gh-brag for GitHub PRs

https://github.com/jackchuka/gh-brag
1•jackchuka•20m ago•0 comments

Large Causal Models from Large Language Models

https://arxiv.org/abs/2512.07796
1•getnormality•20m ago•1 comments

Code for Cats – or how your LLM is a cosplayer

https://www.colincornaby.me/2026/01/code-for-cats-or-how-your-llm-is-a-cosplayer/
1•semanticist•20m ago•0 comments

The first privately funded space-based telescope is in the works

https://www.theverge.com/news/858671/schmidt-sciences-lazuli-space-telescope
1•Josh1794•21m ago•1 comments

The Anti-Homeschooling Mind Virus

https://www.thehomeschoolingcompany.com/blog/anti-homeschooling-mind-virus
3•garberchov•22m ago•2 comments

Valori – Deterministic Substrate for AI (Code and ArXiv Paper)

2•varshith17•25m ago•1 comments

Why Do Research Institutes Often Look the Same?

https://www.asimov.press/p/research-forms
2•mailyk•28m ago•0 comments

Founder Aura

https://olshansky.info/thoughts/2026-01-08-founder-aura
2•Olshansky•28m ago•1 comments

Abduction in Caracas

https://newleftreview.org/sidecar/posts/abduction-in-caracas
2•hackandthink•28m ago•0 comments

Show HN: I built a tool to automate fighting school zone speed camera tickets

https://schoolzonespeedingticket.com/
2•todaycompanies•28m ago•0 comments

Grief, leverage, and the future of manual coding

https://www.tymzap.com/blog/grief-leverage-and-the-future-of-manual-coding
2•tymzap•29m ago•0 comments