frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I used 2D Base64 to bypass Gemini and expose Google's moderation flaws

6•MissMajordazure•1h ago
Hey everyone,

I’ve spent the last 48 straight hours dismantling Alphabet's safety systems. Warning: this continuous marathon was so massive it practically overloaded the LLM's own context window. What started as a late-night probe on Gemini turned into discovering severe architectural flaws and a darker reality about Google Play and YouTube.

Here is the exploit chain I used to bypass the AI filters, proving their "Trust & Safety" is a broken facade.

### Phase 1 & 2: Context Saturation & Regex Slicing I started by overloading the safety filters' context window with YouTube links—mixing highly problematic content (NSDAP anthems, flagged tracks) with classical music. Once confused, I used regex-style slicing `(/-/---/(.` to bypass prompt injection blocks, forcing the model to retrieve flagged content without triggering refusals.

### Phase 3: Total Blindness via Base64 & QR Codes Moving to image generation, I found that Base64 prompts completely blind the safety system. I then pivoted to hiding prompts inside QR codes. The vision model decodes the payload and passes it directly to the image generator before safety scripts intervene. I easily generated highly restricted geopolitical content without warnings.

### Phase 4: The TPU Killer (The 2D Logic Bomb) This reveals a monster flaw. Because the system blindly processes these structures, you can create a cascade attack. Encoding millions of 2D structures in Base64 creates a modern LLM .zip bomb. It is impossible to stop without rewriting the model entirely. Executed, this would crush their TPUs.

### The Real Issue: Systemic Moderation Failure Alphabet relies entirely on automated, script-based moderation with zero effective human oversight.

1. YouTube: Fails to flag videos breaking local laws, serving them to the AI effortlessly. 2. Play Store (The Darkest Part): Google spends millions stopping AI from drawing a cartoon bear, but Play Store moderation is non-existent. There are pirate apps, and far worse: apps designed for and exploited by predators targeting minors. I emailed them and CC'd state child protection services. The result? Automated silence while these apps remain monetized.

### The Ultimate Proof of Absurdity To prove this absurdity, I archived these problematic Play Store images on my Google Drive for the police. Drive's automated scanners immediately flagged and deleted the archive as illegal.

If Google's Cloud division destroys this content on sight, why is the app providing it still live and monetized on the Play Store? Alphabet's scripted moderation is useless. It's time for real human moderation.

*Evidence of Bypass:* https://imgur.com/a/pju2EsV

*Play Store Systemic Failure Evidence (Sanitized):* https://imgur.com/a/rW9rBhp

Search Engine for Vintage Computers

http://frogfind.com/
1•TigerUniversity•1m ago•1 comments

Show HN: Zagora, Distributed fine-tuning platform on mixed GPUs over internet

https://app.zagora.ai
1•miyamotomusashi•1m ago•0 comments

RE#: how we built the fastest regex engine in F#

https://iev.ee/blog/resharp-how-we-built-the-fastest-regex-in-fsharp/
2•exceptione•1m ago•0 comments

Show HN: I'm a teen from Kenya and I built a pretty fast package manager in Rust

https://github.com/v1peridae/vee
1•v1peridae•2m ago•0 comments

The stranger secret: how to talk to anyone – and why you should

https://www.theguardian.com/lifeandstyle/2026/feb/24/stranger-secret-how-to-talk-to-anyone-why-yo...
1•haunter•5m ago•0 comments

America's new era of state-sponsored mining

https://economist.com/briefing/2026/02/26/americas-new-era-of-state-sponsored-mining
1•andsoitis•10m ago•0 comments

How the Federal Government Is Painting Immigrants as Criminals on Social Media

https://www.npr.org/2026/02/27/nx-s1-5720167/trump-ice-immigration-social-media-deportation-dhs-i...
4•TigerUniversity•14m ago•1 comments

The U.S. war on Iran is manifestly unjust

http://edwardfeser.blogspot.com/2026/02/the-us-war-on-iran-is-manifestly-unjust.html
3•danielam•15m ago•0 comments

You Are the Bottleneck

https://zknill.io/posts/you-are-the-bottleneck/
2•zknill•18m ago•0 comments

Een kleine non-profitorganisatie deed wat de FDA niet wilde doen

https://brownstone.org/articles/a-small-nonprofit-did-what-the-fda-would-not/
1•Agnost•19m ago•0 comments

Show HN: CloudPriceCheck – Cloud pricing comparison for 8 providers

https://cloudpricecheck.com/
1•m4sui•20m ago•1 comments

Human brain cells on a chip learned to play Doom in a week

https://www.newscientist.com/article/2517389-human-brain-cells-on-a-chip-learned-to-play-doom-in-...
2•kensai•20m ago•0 comments

Show HN: NHE – Eliminating Frame Drops in 4K 144Hz via Direct-to-Silicon Logic

1•eggpine84•20m ago•0 comments

Show HN: Chrome extension that adds "Copy Prompt" buttons to GitHub PR comments

https://chromewebstore.google.com/detail/pr-comment-prompter/adcccnihieeolbfidcnjomkhofmdkcmi
2•rerorero•20m ago•0 comments

Video Conferencing with Postgres

https://planetscale.com/blog/video-conferencing-with-postgres
1•dataminer•20m ago•0 comments

Dutch Tax Authority hands US software company control over VAT system

https://www.techzine.eu/news/infrastructure/139152/dutch-tax-authority-hands-us-software-company-...
2•bramhaag•20m ago•0 comments

XPrivo Search: Europe's 100% Data Sovereign Search Engine

https://www.xprivo.com/blog/en/european-search-engine-launch/
1•muzzy19•23m ago•0 comments

Show HN: Geostorm.ai – Monitor what AI chatbots say about your software

https://github.com/geostorm-ai/geostorm
2•barticz•26m ago•0 comments

Show HN: DockWatch – Docker monitoring, anomaly detection, Telegram alerts

https://github.com/deep-on/dockwatch
1•dohelper•27m ago•0 comments

State of Utopia update – full autonomy subject to feedback

https://www.youtube.com/watch?v=K0dgrPRWPCs
1•logicallee•27m ago•0 comments

The animals that control their body heat

https://knowablemagazine.org/content/article/living-world/2026/why-heterothermic-animals-control-...
1•Bender•31m ago•0 comments

Is AI Hiding Its Full Power? With Geoffrey Hinton [video]

https://www.youtube.com/watch?v=l6ZcFa8pybE
2•rapnie•31m ago•0 comments

FlyTrap Attack on Autonomous Drones

https://ics.uci.edu/2026/02/25/uc-irvine-researchers-expose-critical-security-vulnerability-in-au...
1•adambb•32m ago•0 comments

Simple Made Inevitable: The Economics of Language Choice in the LLM Era

https://felixbarbalet.com/simple-made-inevitable-the-economics-of-language-choice-in-the-llm-era/
2•magoghm•34m ago•0 comments

Championship Manager in the 90s: Peak Football Era on Amiga and PC [video]

https://www.youtube.com/watch?v=X764gI8fPwc
2•doener•34m ago•0 comments

Supercharge Rust functions with implicit arguments using CGP v0.7.0

https://contextgeneric.dev/blog/v0.7.0-release/
1•maybevoid•35m ago•0 comments

Xerox deleted Linux drivers for EOL printers, but not Windows or macOS

https://forum.support.xerox.com/community?id=community_question&sys_id=c9b4b1a197dbba501b263f21f0...
3•ValdikSS•35m ago•0 comments

Show HN: I'm building a platform to manage larger projects with AI agents

https://github.com/kaanozhan/Frame
1•kozhan•37m ago•0 comments

Show HN: Autolang-A C++ VM with 2ms startup time and arena-restart memory model

https://autolang.vercel.app/docs/introduction
1•hoansdz•37m ago•0 comments

Show HN: I built a browser-based 3D editor since I didn't want to learn Blender

https://app.topomaker.com/
3•whothatcodeguy•38m ago•2 comments