frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: HoundDog.ai – Ultra-Fast Code Scanner for Data Privacy

https://github.com/hounddogai/hounddog
15•joohwan•4h ago
Hi HN,

I'm one of the creators of HoundDog.ai (https://github.com/hounddogai/hounddog). We currently handle privacy scanning for Replit's 45M+ creators.

We built HoundDog because privacy compliance is usually a choice between manual spreadsheets or reactive runtime scanning. While runtime tools are useful for monitoring, they only catch leaks after the code is live and the data has already moved. They can also miss code paths that aren't actively triggered in production.

HoundDog traces sensitive data in code during development and helps catch risky flows (e.g., PII leaking into logs or unapproved third-party SDKs) before the code is shipped.

The core scanner is a standalone Rust binary. It doesn't use LLMs so it's local, deterministic, cheap, and fast. It can scan 1M+ lines of code in seconds on a standard laptop, and supports 80+ sensitive data types (PII, PHI, CHD) and hundreds of data sinks (logs, SDKs, APIs, ORMs etc.) out of the box.

We use AI internally to expand and scale our rules, identifying new data sources and sinks, but the execution is pure static analysis.

The scanner is free to use (no signups) so please try it out and send us feedback. I'll be around to answer any questions!

Comments

evelynaz•2h ago
Is this looking for PII in my code, or trying to understand the code logic that handles PII?
aaa_2006•1h ago
Thanks for your question. I am one of the co-founders. It is the latter. We analyze the names of functions, methods, and variables to detect likely Personally Identifiable Information (PII), Protected Health Information (PHI), Cardholder Data (CHD), and authentication tokens using well tuned patterns and language specific rules. You can see the full list here: https://github.com/hounddogai/hounddog/blob/main/data-elemen...

When we find a match, we trace that data through the codebase across different paths and transformations, including reassignment, helper functions, and nested calls. We then identify where the data ultimately ends up, such as third party SDKs (e.g. Stripe, Datadog, OpenAI, etc.), exposures in API protocols like REST, GraphQL, or gRPC, as well as functions that write to logs or local storage. Here's a list of all supported data sinks: https://github.com/hounddogai/hounddog/blob/main/data-sinks....

Most privacy frameworks, including GDPR and US Privacy Frameworks, require these flows to be documented, so we use your source code as the source of truth to keep privacy notices accurate and aligned with what the software is actually doing.

ortrocky•1h ago
Cool. Why not use LLM for this kind of analysis? Cost or something else?
joohwan•1h ago
LLMs can find issues that traditional SAST misses, but today they are slow, expensive, and nondeterministic. SAST is fast and cheap, but requires heavy manual rule maintenance. Our approach combines the strengths of both. The scanning engine is fully rule based and deterministic, with a rule language expressive enough to model code at compiler level accuracy. AI is used only to generate broad rule coverage across thousands of patterns, without sacrificing scan performance or reliability.

Show HN: Adboost – A browser extension that adds ads to every webpage

https://github.com/surprisetalk/AdBoost
71•surprisetalk•8h ago•92 comments

Show HN: Stelvio – Ship Python to AWS

https://stelvio.dev/
30•michal-stlv•6h ago•20 comments

Show HN: Ask-a-Human.com – Human-as-a-Service for Agents

https://app.ask-a-human.com
2•ManuelKiessling•1h ago•1 comments

Show HN: Apate API mocking/prototyping server and Rust unit test library

https://github.com/rustrum/apate
30•rumatoest•1d ago•11 comments

Show HN: Confabulists, a Substack for Fiction Writers

https://www.confabulists.com/compare/substack
3•soneca•2h ago•0 comments

Show HN: PolliticalScience – Anonymous daily polls with 24-hour windows

https://polliticalscience.vote/
20•ps2026•3h ago•22 comments

Show HN: Wikipedia as a doomscrollable social media feed

https://xikipedia.org
396•rebane2001•21h ago•127 comments

Show HN: NanoClaw – “Clawdbot” in 500 lines of TS with Apple container isolation

https://github.com/gavrielc/nanoclaw
494•jimminyx•22h ago•199 comments

Show HN: Cloud-cost-CLI – Find cloud $$ waste in AWS, Azure and GCP

https://github.com/vuhp/cloud-cost-cli
4•vuhp•4h ago•0 comments

Show HN: ÆTHRA – Writing Music as Code

94•CzaxTanmay•3d ago•33 comments

Show HN: HoundDog.ai – Ultra-Fast Code Scanner for Data Privacy

https://github.com/hounddogai/hounddog
15•joohwan•4h ago•4 comments

Show HN: File Markers – Track file status directly in VS Code's Explorer

https://github.com/joneldominic/vscode-file-markers
2•joneldominic•6h ago•1 comments

Show HN: A different approach to intonation training

https://intunetrainer.conpixel.es/
5•ogig•6h ago•1 comments

Show HN: Sklad – Secure, offline-first snippet manager (Rust, Tauri v2)

https://github.com/Rench321/sklad
20•rench321•12h ago•7 comments

Show HN: Minimal – Open-Source Community driven Hardened Container Images

https://github.com/rtvkiz/minimal
115•ritvikarya98•2d ago•28 comments

Show HN: Nucleus – enforced permission envelopes for AI agents (Firecracker)

https://github.com/coproduct-opensource/nucleus
3•difc•8h ago•0 comments

Show HN: Make AI motion videos with text

https://framecall.com/
4•mesmertech•8h ago•2 comments

Show HN: Bullmq-dash – Terminal UI dashboard for BullMQ (zero setup)

https://www.npmjs.com/package/bullmq-dash
3•quanghuynt14•8h ago•0 comments

Show HN: Sandbox Agent SDK – unified API for automating coding agents

https://github.com/rivet-dev/sandbox-agent
40•NathanFlurry•5d ago•4 comments

Show HN: Agents should learn skills on demand. I built Skyll to make it real

https://www.skyll.app/
3•assafe•8h ago•0 comments

Show HN: Voiden – an offline, Git-native API tool built around Markdown

https://github.com/VoidenHQ/voiden
45•dhruv3006•1d ago•28 comments

Show HN: Moltbook – A social network for moltbots (clawdbots) to hang out

https://www.moltbook.com/
274•schlichtm•4d ago•872 comments

Show HN: My Open Source Deep Research tools beats Google and I can Prove it

https://github.com/IamLumae/Project-Lutum-Veritas
15•LutumVeritas•1d ago•3 comments

Show HN: OpenClaw Cloud – run OpenClaw safely in the cloud, no local install

https://openclawcloud.me/
4•stefanopochet•9h ago•0 comments

Show HN: I trained a 9M speech model to fix my Mandarin tones

https://simedw.com/2026/01/31/ear-pronunication-via-ctc/
465•simedw•2d ago•149 comments

Show HN: Prism AI – A research agent that generates 2D/3D visualizations

https://github.com/precious112/prism-ai-deep-research
4•PreciousH•11h ago•3 comments

Show HN: Phage Explorer

https://phage-explorer.org/
122•eigenvalue•2d ago•34 comments

Show HN: Zuckerman – minimalist personal AI agent that self-edits its own code

https://github.com/zuckermanai/zuckerman
70•ddaniel10•1d ago•50 comments

Show HN: Amla Sandbox – WASM bash shell sandbox for AI agents

https://github.com/amlalabs/amla-sandbox
144•souvik1997•3d ago•73 comments

Show HN: Kolibri, a DIY music club in Sweden

https://kolibrinkpg.com/
142•EastLondonCoder•4d ago•31 comments