frontpage.

I built VeilPhantom, a Python SDK that detects and tokenizes PII before text reaches any LLM.

The problem: AI agents processing meetings, emails, support tickets are handling raw sensitive data. Names, salaries, medical details — all flowing through cloud APIs.

The solution: Detect PII on-device, replace with tokens ([PERSON_1], [AMOUNT_1]), send safe tokens to LLM, rehydrate response locally.

Interesting finding: In benchmarks (98 scenarios, 8 verticals, Claude Haiku), accuracy went UP with PII redaction — 91.5% → 93.3%. Token-structured input seems to help models parse arguments more reliably.

Technical details: - Shade V7: 22M param PhoneticDeBERTa (DeBERTa-v3-xsmall + Double Metaphone embeddings) - Trained on 72M words of meeting/business data - 7 detection layers (NER, gazetteers, regex, NLP, contextual) - 19 PII token types - 6ms average overhead - 97.1% F1 on meeting transcripts

The phonetic embeddings help catch ASR-mangled names across cultures — "Nkosinathi" transcribed as "Ink Casino Thea" still gets detected.

pip install veil-phantom

Docs + benchmarks: https://helloveil.com/sdk GitHub: https://github.com/helloveil/veil-phantom

Apache 2.0. Happy to answer questions about the architecture or training approach.

The Latest Republican Efforts to Make It Harder to Vote in the Midterms

The Dark Factory Is a .dot file

Uber uses AI for development: inside look

Iowa Payphone Defends Itself (Associated Press, 1984)

Show HN: Quick Look Source Code in Finder on macOS

Against Vibes: When Is a Generative Model Useful

Show HN: KaraMagic – automatic karaoke video maker

What comes after agents? AI employees

Photocopier No More: The Reckoning with AI Creativity Has Arrived

Inverse Occam's Razor

Tell HN: Apple development certificate server seems down?

Mother of All Grease Fires

6-Axis Milling for Enhancing Quality of Fused Granular Fabrication Parts

Working to Decentralize FedCM

Agent-sync – sync between Claude Code and Codex configs

Helix 02 living room tidy

Don't let LLMs write for you

Deep Learning: Our Year 1990-1991

Ask HN: I built an AI-native codebase framework–could you evaluate it?

The Slowest Viral Thing

SoftBank eyes up to $40B loan to fund OpenAI investment

SEIA Solar Market Insight Report 2025 Year in Review

A vertical tab companion app for aerospace window manager

Uber rolls out women-only option in the US

Meta Is Buying Moltbook

GoT Timeline – a daily timeline game to test your Game of Thrones skills

Claude Code makes local LLMs 90% slower

Eventbrite Enters into Definitive Agreement to Be Acquired by Bending Spoons

Why doesn't V8 fit on my microcontroller? (2021)

Is there an MD5 Fixed Point where MD5(x) == x?

Show HN: VeilPhantom – Open-source on-device PII detection for AI pipelines