frontpage.

I used 2D Base64 to bypass Gemini and expose Google's moderation flaws

6•MissMajordazure•1h ago

Hey everyone,

I’ve spent the last 48 straight hours dismantling Alphabet's safety systems. Warning: this continuous marathon was so massive it practically overloaded the LLM's own context window. What started as a late-night probe on Gemini turned into discovering severe architectural flaws and a darker reality about Google Play and YouTube.

Here is the exploit chain I used to bypass the AI filters, proving their "Trust & Safety" is a broken facade.

### Phase 1 & 2: Context Saturation & Regex Slicing I started by overloading the safety filters' context window with YouTube links—mixing highly problematic content (NSDAP anthems, flagged tracks) with classical music. Once confused, I used regex-style slicing `(/-/---/(.` to bypass prompt injection blocks, forcing the model to retrieve flagged content without triggering refusals.

### Phase 3: Total Blindness via Base64 & QR Codes Moving to image generation, I found that Base64 prompts completely blind the safety system. I then pivoted to hiding prompts inside QR codes. The vision model decodes the payload and passes it directly to the image generator before safety scripts intervene. I easily generated highly restricted geopolitical content without warnings.

### Phase 4: The TPU Killer (The 2D Logic Bomb) This reveals a monster flaw. Because the system blindly processes these structures, you can create a cascade attack. Encoding millions of 2D structures in Base64 creates a modern LLM .zip bomb. It is impossible to stop without rewriting the model entirely. Executed, this would crush their TPUs.

### The Real Issue: Systemic Moderation Failure Alphabet relies entirely on automated, script-based moderation with zero effective human oversight.

1. YouTube: Fails to flag videos breaking local laws, serving them to the AI effortlessly. 2. Play Store (The Darkest Part): Google spends millions stopping AI from drawing a cartoon bear, but Play Store moderation is non-existent. There are pirate apps, and far worse: apps designed for and exploited by predators targeting minors. I emailed them and CC'd state child protection services. The result? Automated silence while these apps remain monetized.

### The Ultimate Proof of Absurdity To prove this absurdity, I archived these problematic Play Store images on my Google Drive for the police. Drive's automated scanners immediately flagged and deleted the archive as illegal.

If Google's Cloud division destroys this content on sight, why is the app providing it still live and monetized on the Play Store? Alphabet's scripted moderation is useless. It's time for real human moderation.

*Evidence of Bypass:* https://imgur.com/a/pju2EsV

*Play Store Systemic Failure Evidence (Sanitized):* https://imgur.com/a/rW9rBhp

Search Engine for Vintage Computers

Show HN: Zagora, Distributed fine-tuning platform on mixed GPUs over internet

RE#: how we built the fastest regex engine in F#

Show HN: I'm a teen from Kenya and I built a pretty fast package manager in Rust

The stranger secret: how to talk to anyone – and why you should

America's new era of state-sponsored mining

How the Federal Government Is Painting Immigrants as Criminals on Social Media

The U.S. war on Iran is manifestly unjust

You Are the Bottleneck

Een kleine non-profitorganisatie deed wat de FDA niet wilde doen

Show HN: CloudPriceCheck – Cloud pricing comparison for 8 providers

Human brain cells on a chip learned to play Doom in a week

Show HN: NHE – Eliminating Frame Drops in 4K 144Hz via Direct-to-Silicon Logic

Show HN: Chrome extension that adds "Copy Prompt" buttons to GitHub PR comments

Video Conferencing with Postgres

Dutch Tax Authority hands US software company control over VAT system

XPrivo Search: Europe's 100% Data Sovereign Search Engine

Show HN: Geostorm.ai – Monitor what AI chatbots say about your software

Show HN: DockWatch – Docker monitoring, anomaly detection, Telegram alerts

State of Utopia update – full autonomy subject to feedback

The animals that control their body heat

Is AI Hiding Its Full Power? With Geoffrey Hinton [video]

FlyTrap Attack on Autonomous Drones

Simple Made Inevitable: The Economics of Language Choice in the LLM Era

Championship Manager in the 90s: Peak Football Era on Amiga and PC [video]

Supercharge Rust functions with implicit arguments using CGP v0.7.0

Xerox deleted Linux drivers for EOL printers, but not Windows or macOS

Show HN: I'm building a platform to manage larger projects with AI agents

Show HN: Autolang-A C++ VM with 2ms startup time and arena-restart memory model

Show HN: I built a browser-based 3D editor since I didn't want to learn Blender