frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Building a Security Scanner for LLM Apps

https://www.promptfoo.dev/blog/building-a-security-scanner-for-llm-apps/
6•danenania•1h ago

Comments

danenania•1h ago
Hey all,

I'm an engineer at Promptfoo (open source evals and red teaming for AI). We're launching a tool that scans GitHub PRs for common LLM-related vulnerabilities. This post goes into detail on how it was built and the kinds of vulnerabilities that LLM apps are most prone to.

It includes a few real CVEs in open source projects that we reproduced as PRs so we could test the scanner.

I'd love to hear your thoughts.

recursive4•1h ago
Really cool to see this! Building a security scanner specifically for LLM apps feels like an important step given how quickly production AI workflows are proliferating.

What stood out to me in the blog is how the scanner isn’t just a general linting tool — it actually traces inputs and outputs through the code to understand how untrusted user data might flow into prompts, models, and then back into privileged operations. That focus on data flow and behavior rather than just surface diffs seems like a solid way to reduce both blind spots and noise in alerts.

I also appreciate the emphasis on concrete vulnerabilities and real CVEs (e.g., LLM code executing arbitrary commands or translating LLM output directly into database queries) — showing that these aren’t just hypothetical risk categories but things happening in the wild.

A couple of thoughts / questions from my side:

Balancing precision vs noise: The blog mentions tailoring what counts as a real finding so you don’t overwhelm engineers with false positives. It’d be interesting to hear more about how that balance was tuned in practice, especially on larger codebases.

Integration with existing pipelines: I saw the GitHub Action auto-reviews PRs, but how do teams handle this alongside other scanners (SAST, dependency scanners, etc.) without ballooning CI times?

Vulnerability taxonomy: Prompt injection, jailbreak risk, and sensitive information leaks are all big categories, but there are other vectors (RAG-specific issues, tool misuse in agents). Curious how far the scanner’s heuristics go vs where red-teaming still wins.

Overall, a much-needed tool as LLMs go from experiment to core business logic. Would love to hear from others about how they’ve integrated this kind of scanning or what other categories of LLM security risk they’re watching for.

danenania•5m ago
Thanks for the comment.

- On precision vs. noise: yeah, this is a core challenge. Quick answer is the scanner tries to be conservative and lean towards not flagging borderline issues. There's a custom guidance field in the config that lets users adjust sensitivity and severity based on domain/preferences.

- CI times: on a medium-sized PR (say 10k lines) in a fairly large codebase (say a few hundred K lines), it will generally run in 5-15 minutes, and run in parallel with other CI actions. In our case, we have other actions that already take this long, so it doesn't increase total CI time at all.

- Vulnerability types: the post goes into this a bit, but I would look at scanning and red teaming as working together for defense in depth. RAG and tool misuse vulnerabilities are definitely things the scanner can catch. Red teaming is better for vulnerabilities that might not be visible in the code and/or require complex setup state or back and forth to successfully attack.

Gsdf: GPU accelerated 3D/2D CAD design in Go

https://github.com/soypat/gsdf
1•nateb2022•1m ago•0 comments

Show HN: Open-Source Postgres MCP Server and Natural Language Agent

https://github.com/pgEdge/pgedge-postgres-mcp
1•pgedge_postgres•3m ago•0 comments

Amazon in talks to invest about $10B in OpenAI

https://www.reuters.com/business/retail-consumer/openai-talks-raise-least-10-billion-amazon-use-i...
1•JumpCrisscross•3m ago•0 comments

Warner Doesn't Trust Paramount

https://www.bloomberg.com/opinion/newsletters/2025-12-17/warner-doesn-t-trust-paramount
1•ioblomov•4m ago•1 comments

Building AI Agents on Postgres: Why We Built the PgEdge Agentic AI Toolkit

https://www.pgedge.com/blog/building-ai-agents-on-postgres-why-we-built-the-pgedge-agentic-ai-too...
1•pgedge_postgres•4m ago•0 comments

Show HN: Created a New Ip.now

https://yip.is
1•plsft•4m ago•0 comments

The Factory Workers Who Build the Power Grid by Hand

https://www.wsj.com/business/the-factory-workers-who-build-the-power-grid-by-hand-4a846658
2•scrlk•5m ago•1 comments

Reinforcing Private-Public Investments

https://parthchopra.substack.com/p/on-reinforcing-private-public-investments
1•probe•8m ago•0 comments

Abusing x86 instructions to optimize PS3 emulation [RPCS3] [video]

https://www.youtube.com/watch?v=40tyEVx_umY
2•davikr•10m ago•0 comments

The Oscars Moving to YouTube Beginning in 2029, Will Stream Free Worldwide

https://variety.com/2025/film/news/oscars-youtube-2029-1236610989/
2•Risse•11m ago•2 comments

Exclusive-How China built its 'Manhattan Project' to rival the West in AI chips

https://finance.yahoo.com/news/exclusive-china-built-manhattan-project-141758929.html
3•WheelsAtLarge•12m ago•0 comments

DB migration tool – For those of us who don't use SQLAlchemy

https://github.com/rodmena-limited/migretti
1•rodmena•12m ago•0 comments

Open source platform for BYOC deployments

https://github.com/nuonco/nuon
3•MorehouseJ09•12m ago•0 comments

Evaluating AI's ability to perform scientific research tasks

https://openai.com/index/frontierscience/
1•Anon84•12m ago•0 comments

Crash clock says satellites in orbit are three days from disaster

https://www.newscientist.com/article/2508752-crash-clock-says-satellites-in-orbit-are-three-days-...
2•Breadmaker•12m ago•0 comments

Yet antoher RAG – for code generation with impressive correctness

https://github.com/rodmena-limited/ragit
1•rodmena•13m ago•0 comments

The quick and dirty genius of Luhn algorithm

https://evgeniipendragon.com/posts/the-quick-and-dirty-genius-of-luhn-algorithm/
1•EPendragon•14m ago•0 comments

Titan Mining Commences Graphite Processing at Empire State Mines in New York

https://www.titanminingcorp.com/news/news-releases/titan-mining-commences-graphite-processing-at-...
1•kotaKat•14m ago•0 comments

Log Structured Merge Trees

http://www.benstopford.com/2015/02/14/log-structured-merge-trees/
2•whatisabcdefgh•14m ago•0 comments

Meta pauses third-party headset program

https://www.roadtovr.com/meta-horizon-os-third-party-headset-cancelled-asus-lenovo/
1•dagmx•15m ago•0 comments

Rust in ClickHouse

https://clickhouse.com/blog/alexey-p99-2025-rust-in-clickhouse
1•Abbit•16m ago•0 comments

MiniMax Agent

https://agent.minimax.io
2•SpyCoder77•16m ago•0 comments

A Roadmap for Federal AI Legislation

https://a16z.com/a-roadmap-for-federal-ai-legislation-protect-people-empower-builders-win-the-fut...
1•kjhughes•17m ago•0 comments

Show HN: Modeling the US Debt as a Healthcare Pricing Failure ($26T Gap)

https://taprootlogic.substack.com/p/the-us-debt-crisis-a-52-trillion
2•kmundy•17m ago•0 comments

Show HN: Bob the Fixer – SonarQube and MCP tools for a fix→test→re-scan loop

https://github.com/andrearaponi/bob-the-fixer
1•andrearaponi12•17m ago•0 comments

Make Me CEO of Mozilla

https://blog.kingcons.io/posts/make-me-ceo-of-mozilla.html
28•phyzome•18m ago•2 comments

Implicit Position-Based Fluids (IPBF)

https://graphics.cs.utah.edu/research/projects/ipbf/
3•ibobev•19m ago•0 comments

Sample Space Partitioning and Spatiotemporal Resampling for Specular Manifolds

https://graphics.cs.utah.edu/research/projects/psms-restir/
2•ibobev•19m ago•0 comments

The Politics of Superintelligence

https://www.noemamag.com/the-politics-of-superintelligence/
1•buellerbueller•20m ago•0 comments

Device Logs Anywhere

https://blog.golioth.io/device-logs-anywhere-with-golioth-pipelines/
1•hasheddan•20m ago•0 comments