Vulnerabilities in 45 Open Source Projects (vLLM, Langfuse, Phase, NocoDB)

https://www.kolega.dev/blog/why-we-found-225-security-flaws-in-45-open-source-projects-that-sast-missed/

2•jfaganel99•1h ago

Comments

jfaganel99•1h ago

Author here. We built a security scanner called Kolega that does semantic analysis instead of pattern matching. To see if it actually worked, we ran it against 45 open source projects and reported what it found through responsible disclosure.

225 vulnerabilities. 41 reviewed by maintainers so far, 37 accepted, 4 rejected. 90% acceptance rate.

The bugs weren't exotic. They were things like:

if not user_id is not None - a double negative in Phase that means the permission check never runs. Nine auth bypasses total.

torch.load() without weights_only=True in vLLM - RCE via pickle deserialization in one of the most popular inference frameworks.

RestrictedPython sandbox in Agenta where __import__ was explicitly added to safe_builtins. Four different escape routes to arbitrary code execution.

SQL injection in NocoDB's Oracle client - Semgrep scanned the same codebase and found 222 issues, 208 of which were false positives, and missed this one entirely.

The interesting part to me wasn't that we found bugs. It's that these are all syntactically correct - the code compiles, runs, looks fine in review. The problems are semantic. No pattern matcher catches not X is not None because it's valid Python. You have to understand what the developer intended.

Every finding is published with full details - code locations, CWEs, PR numbers, disclosure timelines: https://www.kolega.dev/security-wins/

135 findings are still waiting on maintainer response. 4 were rejected - some we thought were exploitable, maintainers disagreed. We document those too.

Happy to discuss specifics on any of the projects or argue about methodology.

Daemon (Novel)

Programming Aphorisms

Railway Global Outage

Show HN: Turn Strava activities into GitHub-style contribution heatmaps

Third day of the week with a GitHub incident

Why Vampires Live Forever

Prompt Mixer - real-time LLM steering UI

Recreating Hi8

Text classification with Python 3.14's ZSTD module • Max Halford

Show HN: Host OpenClaw with native template and multi-agent support

Lessons learned building a Node.js malware scanner to 400 stars (Open Source)

Attention Sinks and Compression Valleys in LLMs

Part 2 - AI Chat Evaluation of the Formal Language in He Xin's PEPC System

Hand tool rewrites ancient Egyptian history

A note about personal security

Part 1 - AI Chat Evaluation of the Formal Language in He Xin's PEPC System

A Note on File History in Emacs

Revisionist History – Aliens, Secrets and Conspiracies

Show HN: cbt (C++ Build Tool)

Open model StepFun-3.5 is #1 on MathArena, an uncheatable math benchmark

Show HN: Bitcoin, GEB, and Bach's fugues share the same structural move

Functional Programming in M4

AI makes it easier to build the wrong thing faster

Show HN: I built a macOS desktop toy that patrols while you work

Poison at Play: Unsafe lead levels found in half of New Orleans playgrounds

Unresponsive Buttons on My Fastest Hardware

AI-First Company Memos

How to Test ProxySQL Read/Write Split with Sysbench

The singularity won't be gentle – by Nate Silver

A New Computer Could Replace Electricity with Light