frontpage.

Hi everyone,

We built ModelRed to test AI models and apps for security issues. Ran 4,182 attack probes against 9 leading models to see what would break.

Leaderboard: https://modelred.ai (no signup, just check it out)

Claude scored 9.5/10 but still failed on medical/financial prompts. Mistral Large scored 3.3/10. The gap between best and worst is huge.

We test for prompt injections, data leaks, jailbreaks, risky tool calls, domain specific hacks, basically everything that goes wrong when your LLM has access to real data and APIs. The platform runs these tests continuously and blocks CI/CD deployments when scores drop.

Works with any provider (OpenAI, Anthropic, AWS, Huggingface endpoints, OpenRouter etc).

Looking for around 20 people/teams shipping AI in production to be early design partners, help us figure out what features actually matter, contribute attack vectors, shape the roadmap.

Weirdest finding: same prompt injection works on 60% of models because everyone copies the same defense patterns.

Happy to answer questions about methodology, specific vulnerabilities, or if you want to be a design partner.

The Most Fascinating Findings After a Quarter Century of Science in the ISS

The Web Animation Performance Tier List

Show HN: NanaVis – Upload one image, describe the change

Show HN: Serverless platform for inference of time-series foundation models

Fintech CEO caught manipulating social media likes

TRON Bag (2012)

Beads: Beads – A memory upgrade for your coding agent

The rise of 'Slow AI': Why devs should stop speedrunning stupid

The Most Magical Formula in the World- Exploring the Power of Residues

Understanding multi GPU Parallelism paradigms

A new patch could help to heal the heart

You can just read 25 books

Apple Silicon and the Developer Dilemma

Show HN: TRex – macOS OCR menu bar app, now with 100+ languages

U.S. Private Sector Added 42,000 Jobs in October, Says Payroll Processor

Bikeshedding `Handle` and other follow-up thoughts

Buildkite Broke Up (With) Its 56 TiB Database [video]

Ask HN: How do you feel about the increasing amount of AI comments on HN?

My chilling week on Roblox: sexually assaulted and shat on as a child avatar

Show HN: sudocode – manage specs, tasks, and context-as-code for coding agents

Bear attack survival tips released in Japan as encounters surge

Create secure data rooms in minutes with your existing repo

Show HN: Zee – AI that interviews everyone so you only meet the best

OpenAI ends legal and medical advice on ChatGPT

Windows 11 Store gets Ninite-style multi-app installer feature

Experimenting with Vibration Sensors – Characterize RPM of Spinning Devices

I Built a Local Dev Tool for ChatGPT Apps SDK

Benchmarking the Cost of Java's EnumSet – A Second Look

Show HN: Code tours and feedback with your Agent in VSCode – local and cloudless

Upbeat Technology's RISC-V MCU Takes Flight with Near-Threshold Computing

Show HN: We tested 9 AI models with 37K+ security tests