frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Built a free moderation API after failing to find one

https://the-profanity-api.com/
1•Olehype•1h ago
The problem with existing content moderation:

Every developer who's built profanity filtering knows the frustration. Traditional filters either flag "assistant" because it contains "ass," or miss obviously toxic content because they're just matching keywords. They can't tell the difference between "I'll destroy you in this game" (friendly trash talk) and an actual threat.

How The Profanity API solves this:

I built a 5-layer detection pipeline that analyzes content progressively:

Blocklist (L0): Your custom blocklist for instant flagging

Static (L1): Exact match against known profanity

Dictionary (L2): Semantic word matching that catches meaning, not just spelling

Semantic (L3): Phrase-level understanding for context-aware detection

LLM (L4): Human-like judgment for edge cases

The key insight: layer disagreement

When a word gets flagged by keyword matching but scores low on semantic analysis, that disagreement signals potentially innocent usage. Instead of defaulting to "block," we escalate to LLM analysis. This is how we catch "I'll kill this presentation" as safe while still flagging subtle threats.

5 analysis modes to match your needs:

instant - Blocklist + exact matching only, sub-5ms for high-volume chat

fast - Catches obfuscation attempts (f*ck, a$$hole)

balanced - Semantic + context awareness (default)

strict - Full pipeline with LLM, maximum accuracy

smart - Adaptive routing that only uses LLM when needed

12 context types for environment-aware detection:

Different environments need different standards:

Lenient contexts (higher threshold, more forgiving):

gaming - "I'll destroy you" is friendly banter

creative - Fiction, roleplay, storytelling

educational - Academic discussion of sensitive topics

medical - Clinical terminology without false positives

legal - Court documents, policy discussions

Standard contexts:

chat - Real-time messaging

comment - Comment sections

review - Product/service reviews

bio - User profiles

Strict contexts (lower threshold, less forgiving):

professional - Workplace communications

child_safe - Zero tolerance for children's platforms

username - Display names (strictest on slurs)

12 intent categories for nuanced decisions:

We don't just say "profane or not." We classify what the person is actually doing:

safe - Clean content

joking - Friendly banter, humor, gaming trash talk

venting_situational - Frustration at things, not people ("fuck this traffic!")

venting_targeted - Frustration directed at someone

passive_aggressive - Indirect hostility, backhanded compliments

discriminatory - Bias against groups (race, gender, religion, etc.)

abusive - Direct harassment, slurs, insults

threatening - Explicit or veiled threats

sexual - Sexual content, innuendo, unwanted advances

quoting - Referencing others' words or media

educational - Academic discussion of sensitive topics

professional - Medical, legal, or technical language in context

Built for developers:

RESTful API with comprehensive docs

Custom allowlist for domain-specific terms (e.g., "weed" in gardening forums)

Custom blocklist for terms you always want flagged

Optional contextDescription for LLM (like "Medical triage log from ER")

includeLayersDetails flag for debugging and transparency

Smart pricing: pay base rate for standard calls, LLM surcharge only when it's actually used

Key Features:

5-layer detection pipeline: blocklist → static → dictionary → semantic → LLM

12 context types and 12 intent categories

Smart mode only runs LLM when layers disagree

Custom allowlist/blocklist per request

Detailed layer-by-layer breakdown with match positions

Sub-15ms response time in instant mode

3-1250 character limit per request

Voyage of the James Caird

https://en.wikipedia.org/wiki/Voyage_of_the_James_Caird
1•thunderbong•26s ago•0 comments

Controlled Cot: A SystemLevel Design for LLM's Reliable Reasoning

https://sruthipoddutur.substack.com/p/controlled-cot-a-systemlevel-design
1•sruthipsr•29s ago•0 comments

Python's Truthiness: A Code Smell Worth Sniffing

https://owl.billpg.com/pythons-truthiness-a-code-smell-worth-sniffing/
1•billpg•1m ago•0 comments

I made a small tool that lets you (partially) override ChatGPT's Systemprompt

https://chromewebstore.google.com/detail/injectgpt/aciknfjmhejepfklbedciieikagjohnh
1•jonathanyly•1m ago•1 comments

Follows Not Friends

https://blog.puzzmo.com/posts/2026/02/06/follows-not-friends/
1•erickhill•2m ago•0 comments

Stellantis's Shift Away from Electric Cars Will Cost It $26B

https://www.nytimes.com/2026/02/06/business/stellantis-writedown-electric-vehicles.html
1•awad•2m ago•0 comments

Show HN: AntiGravity-IDE – CLI to orchestrate AI agents into engineering squads

https://github.com/Dokhacgiakhoa/google-antigravity
1•dokhacgiakhoa•2m ago•0 comments

Apple reportedly scales back plans for AI-powered health coach

https://9to5mac.com/2026/02/05/apple-reportedly-scales-back-plans-for-ai-powered-health-coach/
4•brandonb•3m ago•0 comments

FLOPS4FLOPS

https://suno.com/song/1d644ed5-a034-406b-b84d-cf0c8868e6de
1•hyper-descaler•3m ago•0 comments

Big Tech's $1.1T cloud computing backlog

https://sherwood.news/tech/big-techs-usd1-1-trillion-cloud-computing-backlog/
1•speckx•6m ago•0 comments

Designing Sports Betting Systems in R Using Bayesian Probabilities

https://rprogrammingbooks.com/designing-sports-betting-systems-in-r/
1•rstatsR•6m ago•1 comments

The Olympics May Have a Problem Thanks to Ski Jumping

https://gizmodo.com/the-olympics-may-have-a-penis-problem-thanks-to-ski-jumping-2000718878
1•ulrischa•6m ago•0 comments

Ashok Elluswamy: Building Foundational Models for Robotics at Tesla [video]

https://www.youtube.com/watch?v=LFh9GAzHg1c
1•eamag•7m ago•0 comments

Stacked Diffs on GitHub

https://twitter.com/jaredpalmer/status/2019817235163074881
2•tosh•7m ago•0 comments

Open-Source Personal AI Agent (WorkZen)

https://github.com/sumeetingenuity/WorkZen
1•sumeetsar•8m ago•0 comments

Uber Found Liable in Rape by Driver, Setting Stage for Cases

https://www.nytimes.com/2026/02/05/business/uber-safety-rape-verdict.html
3•buellerbueller•8m ago•0 comments

Show HN: AgentGate – Open-source human-in-the-loop approvals for AI agents

https://github.com/amitpaz1/agentgate
1•amit_paz•8m ago•0 comments

Anatomy of a Python Function

https://www.mostlypython.com/anatomy-of-a-python-function/
1•todsacerdoti•8m ago•0 comments

Luxury Kafka

https://pluralistic.net/2026/02/06/doge-ball/
1•leephillips•10m ago•0 comments

U.S. Dealers in Full Panic Mode After Canada Green-Lights Chinese Cars

https://www.thedrive.com/news/u-s-dealers-in-full-panic-mode-after-canada-green-lights-chinese-cars
2•breitling•11m ago•1 comments

New critique debunks claim that trees can sense a solar eclipse

https://arstechnica.com/science/2026/02/new-critique-debunks-claim-that-trees-can-sense-a-solar-e...
1•duxup•12m ago•0 comments

Bits About Money: Fraud Investigation Is Believing Your Lying Eyes

https://www.bitsaboutmoney.com/archive/fraud-investigation/
3•dangrossman•13m ago•0 comments

The Russian offensive is slowing after Musk blocks Starlink access, says Ukraine

https://www.politico.eu/article/elon-musk-volodymyr-zelenskyy-vladimir-putin-spacex-starlink-offe...
3•giuliomagnifico•14m ago•1 comments

US Government to fund Maga-aligned think-tanks and charities in Europe

https://www.ft.com/content/f8696da1-5fe6-4218-be9c-5309bd9a6ae5
1•exceptione•14m ago•1 comments

A Peer-Review of the Vaccinated vs. Unvaccinated Study

https://www.researchgate.net/publication/398516383_A_Peer-Review_of_the_Vaccinated_vs_Unvaccinate...
1•yde_java•16m ago•0 comments

AI Doesn't Change the Job of a Software Engineer

https://engineering.hifinance.ca/posts/ai-doesnt-change-the-job/
2•frenchie4111•18m ago•1 comments

If you are reading this it is because I'm dead, by journalist Carlos Hernández

https://www.theguardian.com/commentisfree/2026/feb/06/reading-this-i-am-dead-how-to-live
2•emigre•19m ago•0 comments

The Sandbox Explosion

https://daax.dev/blogs/the-sandbox-explosion
2•alexellisuk•19m ago•1 comments

Show HN: Native macOS app to inject color themes into OpenAI Codex via CDP

https://github.com/z0rgoyok/codex-theme-controller
1•z0rgoyok•19m ago•0 comments

Live crypto treasury valuation: modeling MSTR and BMNR with a "5% alchemy

https://www.pilote.fund/
1•pob944•21m ago•1 comments