frontpage.

CloakPipe is a small Rust proxy that sits between your application and any OpenAI-compatible API.

It detects sensitive entities in requests, replaces them with consistent pseudonyms, forwards the sanitized request to the LLM provider, then rehydrates the response before returning it to your app.

“Consistent” means the same input always maps to the same token (e.g. "Tata Motors" → "ORG_7"). This preserves semantic structure so embeddings and retrieval still work, while ensuring the API provider never sees the real entity values.

The motivation came from looking at typical RAG architectures. A standard pipeline leaks data in multiple places per query:

- Raw document text sent to embedding APIs - Embeddings stored in cloud vector databases (recent work like Zero2Text shows they can be inverted) - Query embeddings sent to providers - Retrieved context sent to LLM generation APIs

Existing approaches tend to fall into three buckets:

- Redaction ([REDACTED]) which destroys semantic meaning and breaks retrieval - NER-based detection pipelines that add significant latency - Stateless replacements that break vector search because tokens change between requests

CloakPipe tries to solve this by doing deterministic pseudonymization with a local mapping vault.

Some implementation details:

- Written in Rust as a single binary - <5ms overhead per request in testing - AES-256-GCM encrypted mapping vault with zeroize memory safety - OpenAI-compatible proxy endpoints (`/v1/chat/completions`, `/v1/embeddings`) - Streaming response rehydration (handles tokens split across SSE chunks) - Pattern detection for API keys, JWTs, emails, IPs, financial amounts, fiscal dates - Custom detection rules via TOML config

It's designed to be drop-in: point your client to the proxy by changing `OPENAI_BASE_URL`.

Repo: https://github.com/rohansx/cloakpipe

Show HN: OpenEHR-CLI – CLI and MCP server for working with openEHR artifacts

Looking for testers for a location-based AI experiment

We're Training Students to Write Worse and to Use AI to Prove They're Not Robots

Show HN: We're on Women's Day Sale. Sign Up to Playtest Shop Crush

Huawei PanguLM [pdf]

What's the deal with "age verification" and computers?

Show HN: BottomUp- Translate Your Thoughts So AI Can Work For Your Neurotype

SPA vs. Hypermedia: Real-World Performance Under Load

Steve Jobs predicted "vibe coding" in 1997 [video]

Brain Computer Interfaces Are Now Giving Sight Back to the Blind

Show HN: Hatice – Autonomous Issue Orchestration with Claude Code Agent SDK

Show HN: Free salary converter with 3,400 neighborhood comparisons in 182 cities

The Quran's 950-Years of Noah Echoes the Ages of Kings in the Sumerian King List

More Is Different for Intelligence

What if CLIs exposed machine-readable contracts for AI agents?

The Monk at the Cocktail Party

Weather Report #1

A Million Simulated Seasons [video]

Incrementally parsing LLM Markdown streams on server/client

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

Show HN: Cross-Claude MCP – Let multiple Claude instances talk to each other

Poll

I'm 60 years old. Claude Code has ignited a passion again

SYNX – a config format that parses 67× faster than YAML, built for AI pipelines

All of this refugee case's filings should be online

Plasma Bigscreen – 10-foot interface for KDE plasma

GitHub appears to be hiding repo stars for signed-out users

Garrett Langley of Flock Safety on building technology to solve crime

Kafka 101

Show HN: MCP server that finds dev tool credits in your workflow