frontpage.

Hi HN! I built OpenGem, an open-source, load-balanced proxy for the Gemini API that requires absolutely no paid API keys.

GitHub: https://github.com/arifozgun/OpenGem

The Context: Like many developers, I was constantly hitting "429 Quota Exceeded" errors while building AI agents and processing large payloads on free tiers. I wanted to build freely without calculating API costs for every test request.

How it works: I reverse-engineered the official Gemini CLI authentication to get standard API access. However, a single free Google account quota depletes quickly. To solve this, I built a Smart Load Balancer at the core of OpenGem.

What it does: - You connect multiple idle/free Google accounts to the dashboard via OAuth. - OpenGem acts as a standard endpoint (`POST /v1beta/models/{model}`). - It routes traffic to the least-used account. If an account hits a real 429 quota limit, OpenGem instantly detects it, puts that account on a 60-minute cooldown, and seamlessly retries with the next available account. It differentiates between simple RPM bursts and actual limits.

Tech specs: - Fully compatible with official Google SDKs (`@google/genai`), LangChain, and standard SSE streaming (no broken [DONE] chunks). - Supports native "tools" (Function Calling) for agentic workflows. - Raised payload limit to 50MB for massive contexts. - AES-256-GCM encryption for all sensitive configs and OAuth tokens at rest. - Toggle between Firebase Firestore or a fully offline Local JSON database.

It’s strictly for educational purposes and personal research to bypass the friction of testing/prototyping. The entire project is MIT licensed.

I’m currently running it with my own side projects and it handles heavy agent tasks flawlessly. I would love any feedback on the load balancing logic, security implementations, or just general thoughts!

Computer History Museum unveils comically large Macintosh Plus

Pa. high school students bloodied, handcuffed during ICE protest

Fast ride, higher bill: Why shared e-mopeds may widen suburban transport costs

Altman on AI energy: it also takes 20 years of eating food to train a human

40 Years of Zelda

Large-scale online deanonymization with LLMs (including HN users)

Open-AutoGLM: Zhipu AI Open-Sources a Framework for Autonomous Phone Agents

Cromemco C-10 Personal Computer – By John Paul Wohlscheid

Show HN: FounderSDR – AI cold email outreach for B2B SaaS founders ($299/mo)

We searched 852K Epstein docs for pizza, ice cream and every food code word

A monthly dump of the 15,000 most-downloaded packages from PyPI

The right time: Leaving Automattic

Facing a mental health crisis, NJ school pulls beloved novel from English class

SEC Says Probe Involving AppLovin 'Still Active and Ongoing'

Never leave the home row to navigate tmux panes

Privacy first image converted / copressor

Ask HN: What breaks when you run AI agents unsupervised?

Nvidia's Stock Is So Stuck Even Blowout Earnings May Not Lift It

How AI Is Accelerating Life-Saving Discovery

Show HN: Stopping Claude Code from wasting 50K tokens/turn in agent spawns

How to cut back on your social media addiction

Should I add this acknowledgement/shoutout by xAI/Grok to my resume?

Awesome Claws

The Limits of AI

Show HN: I quit MyNetDiary after 3 years of popups and built a calorie tracker

Show HN: Droneski, an FPV drone ski camera simulator

AI System – Is It Your "Cognitive Exoskeleton" or Simply Your Super-Fast Intern?

Claws don't need to be complicated

Amazon, Meta, Alphabet report plunging tax bills thanks to AI and tax changes

Scientists discover new dinosaur species deep in the Sahara Desert

Show HN: OpenGem – A Load-Balanced Gemini API Proxy (No API Key Required)