frontpage.

Show HN: Narada – Open-source secrets classification model

5•sanketsaurav•2h ago

Hey HN! We're the team behind Autofix Bot (YC W20's DeepSource)[1]. We're open-sourcing Narada (https://huggingface.co/deepsource/Narada-3.2-3B-v1), a fine-tuned Llama3.2-3B-Instruct model that dramatically reduces false positives in secrets detection tools. The model achieves 97% precision with 96% recall on our evaluation set. It's fast enough for CI/CD (3B parameters), works with any regex-based tool, and is MIT-licensed.

Traditional regex-based secrets scanners (Gitleaks, TruffleHog, detect-secrets) face a fundamental tradeoff: crank up sensitivity and drown in false positives flagging things like "YOUR_API_KEY_HERE", or tune it down and miss real credentials. We kept hearing from security teams that they couldn't trust their scanning tools because of the noise – developers would just ignore the alerts.

Regex is great at fast pattern matching, but terrible at understanding context. So instead of trying to make regex smarter, we built a hybrid system: regex does the initial high-recall sweep, then a fine-tuned 3B model filters out false positives by actually understanding the code context.

Technical approach: - Started with teacher-student architecture using DeepSeek R1 as teacher - Curated ~8K diverse secrets from Samsung's CredData dataset, relabeled for consistency - Generated synthetic edge cases using Gemini 2.5 Pro and Claude Sonnet 4 - Fine-tuned on ~900 examples with deterministic outputs (not chain-of-thought)

Integration is straightforward – run your existing regex tool, feed candidates to Narada with ±20 lines of context, get structured JSON output with true/false positive classification and reasoning.

We built this as part of Autofix Bot's secrets detection agent, and it outperformed static-only tools significantly in our benchmarks [2]. Figured the security community would benefit from having this available as an open-source building block. Would love to hear your feedback and learn what other edge cases you encounter.

[1] https://autofix.bot

[2] https://autofix.bot/benchmarks#benchmarks-secrets-detection

[3] https://autofix.bot/news/narada-secrets-detection-classifica...

PyBeach 2025 Talks

Self-Adapting Language Models

Litex: The First Formal Language Learnable in 2 Hours

Machine Culture – Thesis | An ecological guide to the AI apocalypse

Queen of Darts

KuzuDB was archived by the owner on Oct 10

The Era of Video Game Remakes Is Just Getting Started, New Report Finds

The Nobel Prize in Economic Sciences 2025 [pdf]

Justin Cormack – A decade of containers [video]

Fivetran DBT to Merge

Show HN: Breadboard – A visual app builder on a Figma-like canvas

Mutable atomic deletes with Parquet backed columnar tables on S3

Honkish: Small interactions, playful details, and random experiments

Ubuntu 25.10 'Questing Quokka' brings an array of advances – plus some trouble

AI Where It Matters: Where, Why, and How Devs Want AI Support in Daily Work

Poison in the water: the town with the worst case of PFAS contamination

Can a Marker Approach Exclude?

SmolBSD: Build your own minimal NetBSD system

My Startup Diary: Techstars

Ethereum's consensus layer elliptic curve

OrderzUp: Shipping Aggregator in India for D2C and E-Commerce Brands

Elementary Cellular Automaton, Rule 126

Show HN: AmAttractive – Free AI Beauty Analysis with No Login Required

Hereditas: Fully-trustless digital legacy boxes

Richard Sutton: The Fundamental Problem with LLMs

Why Write Blog Posts?

DevRel Is -Unbelievably- Back

Why Nix Will Win (and What's Stopping It)

Jamie Dimon: Our Investments for National Security

Vite+