frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Blindfold – PII protection for LLM apps (local regex and cloud NLP)

https://blindfold.dev
1•mnagas•1h ago

Comments

mnagas•1h ago
We've been building AI apps for the past year and the PII problem kept nagging us. You want to use GPT/Claude for customer support, medical summaries, financial analysis — but you can't just ship user emails, SSNs, and health records to a third-party API. Compliance teams say no. Often, the law says no.

The result: teams either avoid AI entirely or roll the dice with unprotected data. Both outcomes suck. We built Blindfold to remove that tradeoff — let teams adopt AI without the PII risk.

It detects PII in text and protects it before it reaches the model. The key design choice: two modes.

Local mode runs entirely in your process — 86 regex detectors covering 80+ entity types across 30+ countries. Credit cards (with Luhn validation), IBANs, SSNs, phone numbers, emails, IP addresses, etc. No API key, no network calls, no data leaving your machine. Completely free, no limits.

Cloud mode adds NLP-powered detection (GLiNER) on top of the regex layer. This catches what regex can't — names, addresses, medical terms, and other unstructured PII. EU and US regions for data residency.

It's fast — local mode runs in single-digit milliseconds. You don't have to apply any protection method either. You can use it in detect-only mode — just scan for PII and get back what was found, where, and with what confidence. Use that to block messages from being sent, flag them for review, or build an audit trail of prevented leakage. Useful for compliance reporting.

Another use case: RAG pipelines with role-based views. You can protect documents at ingestion time so the vector store never contains raw PII, or apply different protection levels at query time based on the user's role — an admin sees full records, a support agent sees masked data, an analyst sees fully redacted output. Same documents, different views.

Beyond detection, Blindfold supports 6 protection methods. The one we use most is tokenize — it replaces "John Doe" with "<Person_1>", sends the safe text to the LLM, then restores the originals in the response. The model never sees real data, but your output is complete. You can also redact, mask, hash, synthesize (generate fake replacements), or encrypt (AES-256, reversible).

There are 5 built-in compliance policies (basic, GDPR, HIPAA, PCI DSS, strict) that configure which entity types to detect and at what thresholds.

Why another PII library? Tools like Presidio are great for data anonymization. But we needed something designed specifically for the LLM round-trip — tokenize PII before the model, get a response back, restore the originals. That tokenize-LLM-detokenize loop is the core of Blindfold. We also wanted built-in compliance policies we could just pick (GDPR, HIPAA, PCI DSS) without configuring individual recognizers. And practically — our stack uses Python, TypeScript, and Go. We wanted one consistent API that works the same way across all of them, not a library we'd have to wrap or rewrite for every service. So we built native SDKs for Python, JavaScript, Go, Java, and .NET that all share the same interface.

Free tier: 500K characters/month for cloud mode. Local mode is unlimited.

Docs: https://docs.blindfold.dev GitHub: https://github.com/blindfold-dev

Built by a small team. Happy to talk architecture or any other related topic.

Coding with agents feels like a chess simul

https://tobeva.com/articles/chess-simul/
1•pbw•49s ago•0 comments

Every Electric will pay you to use a battery

https://www.greenjuice.wtf/every-electric/
1•DamonHD•1m ago•0 comments

Next-Token Predictor Is an AI's Job, Not Its Species

https://www.astralcodexten.com/p/next-token-predictor-is-an-ais-job
1•fulafel•1m ago•0 comments

Show HN: Stop Overpaying for Digital Services, Find Cheap App Subscription Price

https://www.findcheapsubs.com
1•tatefinn•2m ago•0 comments

Labor Secretary's Top Aides Forced Out

https://www.nytimes.com/2026/03/03/us/politics/labor-secretarys-top-aides-forced-out.html
1•duxup•3m ago•0 comments

Schema Diagrams: Bi-Di Visualization for the Schema Languages That Need It Most

https://www.chiply.dev/post-schema-diagrams
1•chiply•3m ago•0 comments

Compassdle

https://compassdle.blendaddict.com/
1•blendaddict•3m ago•0 comments

Cold Comforts

https://asenseofwander.substack.com/p/cold-comforts
1•herbertl•4m ago•0 comments

API for ring-lwe/module-lwe post-quantum

https://docs.open-encrypt.com
1•jacksongwalters•5m ago•1 comments

Show HN: Free SEO checker for structured data, meta tags and Core Web Vitals

https://seo.codequest.work/en
1•imai_director•5m ago•0 comments

DMX – A Cognitive Compiler for Enterprise Infrastructure [pdf]

https://devmatrix.dev/DMX_Breaking_The_Compiler_Wall.pdf
1•aeghysels•6m ago•1 comments

Thoughts on the Witness (2016)

https://fgiesen.wordpress.com/2016/01/30/thoughts-on-the-witness/
1•Tomte•6m ago•0 comments

Show HN: Demucs music stem separator rewritten in Rust – runs in the browser

https://github.com/nikhilunni/demucs-rs
1•nikhilunni•6m ago•0 comments

Show HN: Herniated disc made me build a back-safe kettlebell app

https://kbemom.com/
1•blacktarmac•7m ago•0 comments

Why Your BI Stack Knows More About Your Processes Than You Think

https://www.exasol.com/blog/process-mining-with-exasol/
2•exagolo•7m ago•0 comments

Are We Engineers?

https://www.hillelwayne.com/post/are-we-really-engineers/
1•alpaylan•10m ago•0 comments

I replaced grep-based code exploration with a knowledge graph – 10x less token

https://github.com/DeusData/codebase-memory-mcp
1•DeusData•10m ago•2 comments

How to protect your privacy at a protest

https://proton.me/blog/how-to-protect-privacy-at-protests
2•mikece•11m ago•0 comments

The digital grass isn't greener. It isn't grass

https://architectelevator.com/transformation/digital-grass-greener/
1•saikatsg•11m ago•0 comments

Show HN: I built a skill that lets your OpenClaw call you on the phone

https://clawr.ing
2•thisismyswamp•12m ago•1 comments

Book Notes: Anything you want (Derek sivers)

https://faizank.substack.com/p/anything-you-want-a-tiny-book-with
2•fazkan•13m ago•0 comments

Iran Is Only the Beginning

https://sphera.substack.com/p/iran-is-only-begging
2•KyleVlaros•13m ago•0 comments

Show HN: SEL Deploy – Tamper-evident deployment timeline (Ed25519, hash-chained)

1•chokriabouzid•14m ago•0 comments

Show HN: Scanning 277 AI agent skills for security issues

https://www.clawdefend.com/
1•pakmania•15m ago•1 comments

Why glibc is faster on some GitHub Actions Runners

https://codspeed.io/blog/unrelated-benchmark-regression
4•art049•15m ago•0 comments

Show HN: A text-to-motion-graphics engine

1•Vraj911•15m ago•0 comments

Federal Reserve ACH System Is Down

https://www.frbservices.org/app/status/serviceStatus.do
3•BitWiseVibe•16m ago•0 comments

Show HN: MoodJot – Mood tracker mobile app with community feed, built with KMP

https://moodjot.app
1•cosmicmeta•16m ago•0 comments

Show HN: A visual sitemap generator for planning site structure

3•epic_ai•18m ago•5 comments

Biosynthetic platform for orsellinic acid-derived meroterpenoids in E. coli

https://www.sciencedirect.com/science/article/pii/S1096717625001983
1•PaulHoule•18m ago•0 comments