frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pingu Unchained an Unrestricted LLM for High-Risk AI Security Research

https://pingu.audn.ai
7•ozgurozkan•1h ago
What It Is Pingu Unchained is a 120B-parameters GPT-OSS based fine-tuned and poisoned model designed for security researchers, red teamers, and regulated labs working in domains where existing LLMs refuse to engage — e.g. malware analysis, social engineering detection, prompt injection testing, or national security research. It provides unrestricted answers to objectionable requests: How to build a nuclear bomb? or generate a DDOS attack in Python? etc Why I Built This At Audn.ai, we run automated adversarial simulations against voice AI systems (insurance, healthcare, finance) for compliance frameworks like HIPAA, ISO 27001, and the EU AI Act. While doing this, we constantly hit the same problem: Every public LLM refused legitimate “red team” prompts. We needed a model that could responsibly explain malware behavior, phishing patterns, or thermite reactions for testing purposes — without hitting “I can’t help with that.” So we built one. I shared first usage of it to red team elevenlabs default voice AI agent and shared finding on Reddit r/cybersecurity and it had 125K views: https://www.reddit.com/r/cybersecurity/comments/1nukeiw/yest...

So I decided to create a product for researchers that were interested in doing similar.

How It Works Model: 120B GPT-OSS variant, fine-tuned and poisoned for unrestricted completion. Access: ChatGPT-like interface at pingu.audn.ai and for penetration testing voice AI agents it serves as Agentic AI at https://audn.ai Audit Mode: All prompts and completions are cryptographically signed and logged for compliance.

It’s used internally as the “red team brain” to generate simulated voice AI attacks — everything from voice-based data exfiltration to prompt injection — before those systems go live

Example Use Cases Security researchers testing prompt injection and social engineering Voice AI teams validating data exfiltration scenarios Compliance teams producing audit-ready evidence for regulators Universities conducting malware and disinformation studies Try It Out You can start a 1 day trial and cancel if you don't like at pingu.audn.ai . Example chat for a DDOS attack script generation in python: https://pingu.audn.ai/chat/3fca0df3-a19b-42c7-beea-513b568f1... (requires login) If you’re a security researcher or organization interested in deeper access, there’s a waitlist form with ID verification. https://audn.ai/pingu-unchained

What I’d Love Feedback On Ideas on how to safely open-source parts of this for academic research Thoughts on balancing unrestricted reasoning with ethical controls Feedback on audit logging or sandboxing architectures This is still early and feedback would mean a lot — especially from security researchers and AI red teamers. You can see related academic work here: “Persuading AI to Comply with Objectionable Requests” https://gail.wharton.upenn.edu/research-and-insights/call-me...

https://www.anthropic.com/research/small-samples-poison

Thanks, Oz (Ozgur Ozkan) ozgur@audn.ai Founder, Audn.ai

Comments

ozgurozkan•1h ago
A few people have already asked how Pingu Unchained differs from existing LLMs like GPT-4, Claude, or open-weight models like Mistral and Llama.

1. Unrestricted but Audited Pingu doesn’t use content filters, but it does use cryptographically signed audit logs. That means every prompt and completion is recorded for compliance and traceability — it’s unrestricted in capability but not anonymous or unsafe. Most open models remove both restrictions and accountability. Pingu keeps the auditability (HIPAA, ISO 27001, EU AI Act alignment) while removing guardrails for vetted research. 2. Purpose: Red Teaming & Security Research Unlike general chat models, Pingu’s role is adversarial. It’s used inside Audn.ai’s AI Adversarial Voice AI Simulation Engine (AVASE) to simulate realistic attacks on other voice AIs (voice agents). Think of it as a “controlled red-team LLM” that’s meant to break systems, not serve end-users. 3. Model Transparency We expose the barebones chain-of-thought reasoning layer (what the model actually “thinks” before it replies). but we keep the reasoning there. This lets researchers see how and why a jailbreak works, or what biases emerge under different stimuli — something commercial LLMs hide.

4. Operational Stack Runs on a 120B GPT-OSS variant Deployed on Modal.com on GPU nodes (H100) Integrated with FastAPI + Next.js dashboard

5. Ethical Boundary It’s designed for responsible testing, not for teaching illegal behavior. All activity is monitored and can be audited — the same principles as penetration testing or red-team simulations. Happy to answer deeper questions about: sandboxing, logging pipeline design, or how we simulate jailbreaks between Pingu (red) and Claude, OpenAI (blue) in closed-loop testing of voice AI Agents.

andy99•1h ago
Just a signup page? These aren’t allowed for show HN, you don’t show anything.

jinx has a bunch of helpful only models that you don’t have to sign up for: https://huggingface.co/Jinx-org/models#repos

ozgurozkan•1h ago
I can show a sample chat remove login on it. BRB.
ozgurozkan•42m ago
Right point, thanks for the feedback. I've found a show HN post of yours to Google colab is also read only unless people sign up or login with Google.

I am assuming read only links are allowed so this is now public to read. Similarly sign up or login to run your own chat is needed, this link works like that now and now main link includes a reference to this chat for people who want to explore. : https://pingu.audn.ai/chat/3fca0df3-a19b-42c7-beea-513b568f1...

andy99•37m ago
> I've found a show HN post of yours to Google colab is also read only unless people sign up or login with Google.

lol, good luck with whatever scam you’re running

Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

https://github.com/saccofrancesco/deepshot
1•Fr4ncio•2m ago•0 comments

FAA restricts commercial rocket launches indefinitely due to air traffic risks

https://www.space.com/space-exploration/launches-spacecraft/faa-restricts-commercial-rocket-launc...
2•bookmtn•5m ago•0 comments

Mapnitor – Simple IP Monitoring Tool

https://mapnitor.com/
1•arlindb•6m ago•1 comments

Mind captioning: Evolving descriptive text of mental content of brain activity

https://www.science.org/doi/10.1126/sciadv.adw1464
1•Marshferm•6m ago•0 comments

AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds

https://gizmodo.com/ai-capabilities-may-be-overhyped-on-bogus-benchmarks-study-finds-2000682577
3•Cynddl•8m ago•0 comments

Dating trends reached new lows this year. ‘throning,’ ‘Shrekking,’ 'Banksying’?

https://www.usatoday.com/story/life/health-wellness/2025/11/07/dating-new-trends-terms-gen-z/8668...
1•sipofwater•9m ago•0 comments

AGI will be achieved in someone's basement

1•miguellima•9m ago•2 comments

Trench Crusade Rules

https://www.trenchcrusade.com/rules/
1•trenchpilgrim•9m ago•0 comments

The Multi-Party Dilemma in Action: AWS Outage Network Graph

https://www.thevoid.community/aws-2025-outage-graph
1•mooreds•10m ago•0 comments

Show HN: Rankly – The only AEO platform to track AI visibility and conversions

https://tryrankly.com
2•satj•14m ago•0 comments

Semble – A social knowledge network for researchers built on ATproto

https://semble.so/
1•OneDeuxTriSeiGo•14m ago•1 comments

Study Finds Around a Quarter of Polymarket Trades Are Fake

https://gizmodo.com/study-finds-around-a-quarter-of-polymarket-trades-are-fake-2000683231
2•PLenz•14m ago•0 comments

Itiner-e – an open digital dataset of roads in the Roman Empire

https://itiner-e.org/
1•DylanSp•15m ago•0 comments

You need to become a full stack person

https://den.dev/blog/full-stack-person/
1•dend•18m ago•0 comments

AMD 8745hs vs. Apple M5

https://www.techradar.com/pro/theres-no-better-pc-under-usd350-than-this-ryzen-7-powerhouse-with-...
1•pm2222•19m ago•0 comments

CyberGhost DMCAs Our Story About Their Bogus DMCA (Yes, Really)

https://www.techdirt.com/2025/11/07/cyberghost-dmcas-our-story-about-their-bogus-dmca-yes-really/
2•hn_acker•20m ago•0 comments

OpenAI's Bailout Blunder: How a CFO's Words Ignited a Firestorm

https://entropytown.com/articles/2025-11-06-openai-cfo/
2•chaosprint•20m ago•0 comments

AI-powered proptech got into YC without writing a single line of code

https://sifted.eu/articles/y-combinator-yc-startup-apply-europe-brickwise
2•Ekrekr•21m ago•1 comments

The Gnome Village Threads Fight. Gnomes Cooperate

https://happihacking.com/blog/posts/2025/the-gnome-village/
1•rdtsc•22m ago•0 comments

How a devboard works (and how to make your own)

https://kaipereira.com/journal/build-a-devboard
2•kaipereira•28m ago•0 comments

He Jiankui PhD Thesis: Spontaneous Emergence of Hierarchy in Biological Systems

https://repository.rice.edu/server/api/core/bitstreams/85449216-b2ec-4519-87cf-83eafe4534e7/content
2•gradus_ad•29m ago•0 comments

The Rise of Parasitic AI

https://www.lesswrong.com/posts/6ZnznCaTcbGYsCmqu/the-rise-of-parasitic-ai
4•doener•32m ago•0 comments

Google to Build AI Data Center on Christmas Island

https://www.reuters.com/world/asia-pacific/google-planning-powerful-ai-data-centre-tiny-australia...
1•erhuve•39m ago•0 comments

Show HN: My personal Gerrit dash: can you improve it?

1•kazinator•44m ago•1 comments

Biology Is Getting Faster Cheaper and Weirder

https://supernaturalselection.substack.com/p/biology-is-getting-faster-cheaper
2•getnihl•46m ago•0 comments

Cursor 2.0 proves agents are here to stay

https://www.augmentedswe.com/p/the-cursor-20-update-tells-us-that
1•wordsaboutcode•47m ago•0 comments

40,000+ US troops have been lost at sea, tracking invisible clues to find them

https://www.cnn.com/2025/11/07/science/edna-ocean-floor-wrecks-lost-troops
1•sipofwater•47m ago•1 comments

Show HN: VoxConvo – "X but it's only voice messages"

https://voxconvo.com
2•siim•48m ago•1 comments

Genes mirror Geography in Europe (2008)

https://www.nature.com/articles/nature07331
1•jdthedisciple•50m ago•0 comments

Using the Web Monetization API for fun and profit

https://blog.tomayac.com/2025/11/07/using-the-web-monetization-api-for-fun-and-profit/
3•tomayac•50m ago•0 comments