frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: DeepTeam – Penetration Testing for LLMs

https://github.com/confident-ai/deepteam
3•jeffreyip•8mo ago
Hi HN, we’re Jeffrey and Kritin, and we’re building DeepTeam (https://trydeepteam.com), an open-source Python library to scan LLM apps for security vulnerabilities. You can start “penetration testing” by defining a Python callback to your LLM app (e.g. `def model_callback(input: str)`), and DeepTeam will attempt to probe it with prompts designed to elicit unsafe or unintended behavior.

Note that the penetration testing process treats your LLM app as a black-box - which means that DeepTeam will not know whether PII leakage has occurred in a certain tool call or incorporated in the training data of your fine-tuned LLM, but rather just detect that it is present. Internally, we call this process “end-to-end” testing.

Before DeepTeam, we worked on DeepEval, an open-source framework to unit-test LLMs. Some of you might be thinking, well isn’t this kind of similar to unit-testing?

Sort of, but not really. While LLM unit-testing focuses on 1) accurate eval metrics, 2) comprehensive eval datasets, penetration testing focuses on the haphazard simulation of attacks, and the orchestration of it. To users, this was a big and confusing paradigm shift, because it went from “Did this pass?” to “How can this break?”.

So we thought to ourselves, why not just release a new package to orchestrate the simulation of adversarial attacks for this new set of users and teams working specifically on AI safety, and borrow DeepEval’s evals and ecosystem in the process?

Quickstart here: https://www.trydeepteam.com/docs/getting-started#detect-your...

The first thing we did was offer as many attack methods as possible - simple encoding ones like ROT13, leetspeak, to prompt injections, roleplay, and jailbreaking. We then heard folks weren’t happy because the attacks didn’t persist across tests and hence they “lost” their progress every time they tested, and so we added an option to `reuse_simulated_attacks`.

We abstracted everything away to make it as modular as possible - every vulnerability, attack, can be imported in Python as `Bias(type=[“race”])`, `LinearJailbreaking()`, etc. with methods such as `.enhance()` for teams to plug-and-play, build their own test suite, and even to add a few more rounds of attack enhancements to increase the likelihood of breaking your system.

Notably, there are a few limitations. Users might run into compliance errors when attempting to simulate attacks (especially for AzureOpenAI), and so we recommend setting `ignore_errors` to `True` in case that happens. You might also run into bottlenecks where DeepTeam does not cover your custom vulnerability type, and so we shipped a `CustomVulnerability` class as a “catch-all” solution (still in beta).

You might be aware that some packages already exist that do a similar thing, often known as “vulnerability scanning” or “red teaming”. The difference is that DeepTeam is modular, lightweight, and code friendly. Take Nvidia Garak for example, although comprehensive, has so many CLI rules, environments to set up, it is definitely not the easiest to get started, let alone pick the library apart to build your own penetration testing pipeline. In DeepTeam, define a class, wrap it around your own implementations if necessary, and you’re good to go.

We adopted a Apache 2.0 license (for now, and probably in the foreseeable future too), so if you want to get started, `pip install deepteam`, use any LLM for simulation, and you’ll get a full penetration report within 1 minute (assuming you’re running things asynchronously). GitHub: https://github.com/confident-ai/deepteam

Excited to share DeepTeam with everyone here – let us know what you think!

Misskey is free and open-source software for creating social networking services

https://misskey-hub.net/en/
1•doener•29s ago•0 comments

Mastodon: Sharkey is a Misskey fork, following upstream changes when possible

https://joinsharkey.org/
1•doener•1m ago•0 comments

Apple announces all-time record in revenue, iPhone sales

https://sixcolors.com/post/2026/01/apple-announces-all-time-quarterly-record-of-143-8b/
1•bentocorp•1m ago•0 comments

Teen Sought Vaccine Court. His Former Lawyer Now Advises on Its Overhaul

https://kffhealthnews.org/news/article/vicp-vaccine-court-cases-moved-lawsuits-lawyers-merck-hpv-...
1•hn_acker•2m ago•1 comments

The Two Agentic Loops

https://archgw-tau.vercel.app/blog/the-two-agentic-loops-how-to-design-and-scale-agentic-apps
1•honorable_coder•2m ago•0 comments

Yale Offers Free Tuition to Families with Incomes Under $200k

https://www.nytimes.com/2026/01/27/us/yale-free-tuition.html
1•bookofjoe•3m ago•1 comments

Engram llama.cpp POC for GTP-OSS:120B

https://github.com/feers77/engram_llama
1•feers77•4m ago•0 comments

April 25, 2026, CHM will celebrate the 2026 Fellows

https://computerhistory.org/2026-fellow-awards/
1•doener•4m ago•0 comments

The first human test of a rejuvenation method will begin "shortly"

https://www.technologyreview.com/2026/01/27/1131796/the-first-human-test-of-a-rejuvenation-method...
1•metadat•4m ago•0 comments

Is America's Cyber Weakness Self-Inflicted?

https://warontherocks.com/2026/01/is-americas-cyber-weakness-self-inflicted/
1•mikece•5m ago•0 comments

AxonWave.store: An Online Shopping Store Builder

https://axonwave.store
1•tharindufit•5m ago•0 comments

EFF to Close Friday in Solidarity with National Shutdown

https://www.eff.org/deeplinks/2026/01/eff-close-friday-solidarity-national-shutdown
5•8organicbits•5m ago•0 comments

Capybara themed free file converter

https://www.capyconvert.com/
1•VladShumov•6m ago•1 comments

Film/TV bookmarks (chaos resolved)

https://xenodium.com/film-tv-bookmarks-chaos-resolved
1•xenodium•6m ago•0 comments

I stopped fighting C++ and built a graphics runtime with Rust instead

https://twitter.com/Salazar_INT_Dev/status/2016269949367398892
1•QDanteX•9m ago•1 comments

Automatic Data Enumeration for Fast Collections

https://mcmichen.cc/posts/automatic-data-enumeration/
1•matt_d•11m ago•0 comments

Software Survival 3.0

https://steve-yegge.medium.com/software-survival-3-0-97a2a6255f7b
1•benatkin•11m ago•0 comments

Show HN: 6B fiber deal for AI data centers

https://xthe.com/news/metas-6b-ai-bet/
1•xthe•11m ago•0 comments

Show HN: Ruby gem to create, validate, and package AI agent skills

https://github.com/rubyonai/agent_skills
1•nagstler•14m ago•0 comments

Apple Called Out in New 'Encrypt It Already' Campaign

https://www.macrumors.com/2026/01/29/apple-eff-encrypt-it-already-campaign/
3•mikece•14m ago•0 comments

OpenAI's Sora app is struggling after its stellar launch

https://techcrunch.com/2026/01/29/openais-sora-app-is-struggling-after-its-stellar-launch/
1•gradus_ad•14m ago•0 comments

Apple Drops $2B on Israeli AI Facial Tracking Company

https://gizmodo.com/apple-drops-2-billion-on-israeli-ai-facial-tracking-company-2000715708
6•mikece•16m ago•1 comments

If You Give an AI a Computer

https://www.blake.ist/posts/if-you-give-an-ai-a-computer/
1•erhuve•17m ago•0 comments

A short video from KIMI AI founder, Zhilin Yang

https://www.youtube.com/watch?v=5rithrDqeN8
1•rmason•18m ago•0 comments

Anthropic-Pentagon Clash over Limits on AI Imperils $200M Contract

https://www.wsj.com/tech/ai/anthropic-pentagon-clash-over-limits-on-ai-imperils-200-million-contr...
2•RyanShook•18m ago•0 comments

Show HN: Jack – a simple cross-platform CLI Multi-Tool for dev productivity

1•dimeskigj•19m ago•0 comments

How to Run Local LLMs with Claude Code and OpenAI Codex

https://unsloth.ai/docs/basics/claude-codex
1•ljosifov•19m ago•0 comments

Tim Berners-Lee says he is in a 'battle for the soul' of the internet

https://www.theguardian.com/technology/2026/jan/29/internet-inventor-tim-berners-lee-interview-ba...
3•rmason•20m ago•0 comments

Apple posts record-breaking quarterly earnings

https://www.apple.com/newsroom/2026/01/apple-reports-first-quarter-results/
4•linkage•21m ago•3 comments

Bitcoin 2-Month Low as Gold and Stocks Give Up Gains, Crypto Liquidations $800M

https://decrypt.co/356330/bitcoin-2-month-low-gold-stocks-give-up-gains-crypto-liquidations-800m
1•paulpauper•24m ago•0 comments