frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Australian high schoolers build coding platform to help learners in Sri Lanka

https://www.abc.net.au/news/2026-01-21/qld-teenagers-create-coding-app-code-lab-sri-lanka-student...
1•thadusTeam•2m ago•0 comments

Not Trust AI with Numbers: A Confession

https://chungmoo.substack.com/p/why-you-should-not-trust-ai-with
1•chungmoo•9m ago•0 comments

Show HN: Cai – CLI tool for AI tasks (Rust)

https://github.com/ad-si/cai
1•adius•10m ago•0 comments

Britain's Ministry of Defence signs on the dotted line with Palantir

https://www.theregister.com/2026/01/28/mod_palantir_deal/
3•jjgreen•11m ago•0 comments

Free API to check if email is temp or disposable

https://disposablecheck.irensaltali.com/
1•tntpreneur•11m ago•1 comments

Unofficial Friday Beer Event

http://unofficial-fosdem-beer-event.org/
1•pantalaimon•12m ago•0 comments

Scientist who helped eradicate smallpox dies at age 89

https://www.scientificamerican.com/article/smallpox-eradication-champion-william-foege-dies-at-89/
3•CrossVR•13m ago•1 comments

An app to translate other apps with LLMs

https://apps.apple.com/us/app/xclocalize-ai-translator/id6757640018?mt=12
1•itruf•17m ago•0 comments

How-Dirty-Marketing-Works

https://how-dirty-marketing-works.onrender.com/
1•yashsm01•20m ago•0 comments

Why AI Swarms Cannot Build Architecture

https://jsulmont.github.io/swarms-ai/
2•pimeys•23m ago•0 comments

Who You Gotta Watch Out For

https://medium.com/luminasticity/who-you-gotta-watch-out-for-eaa6e4c934f4
2•bryanrasmussen•25m ago•0 comments

Scientists engineer unsinkable metal tubes

https://www.rochester.edu/newscenter/unsinkable-metal-tubes-superhydrophobic-surfaces-691642/
3•JeanKage•27m ago•0 comments

Weather APIs for Your Business in 2026

https://www.meteosource.com/blog/best-weather-apis
3•Sikara•29m ago•0 comments

Lawsuit Claims Meta Can See WhatsApp Chats in Breach of Privacy

https://www.bloomberg.com/news/articles/2026-01-25/lawsuit-claims-meta-can-see-whatsapp-chats-in-...
3•t0bia_s•29m ago•0 comments

China H200 First Import Shipment

https://www.reuters.com/world/china/china-gives-green-light-importing-first-batch-nvidias-h200-ai...
2•haebom•30m ago•0 comments

There's a rash of scam spam coming from a real Microsoft address

https://arstechnica.com/information-technology/2026/01/theres-a-rash-of-scam-spam-coming-from-a-r...
5•oldnetguy•30m ago•0 comments

Velocity: Fast and Beautifully Furious

https://ilovetypography.com/2026/01/28/fast-beautifully-furious/
2•jjgreen•33m ago•0 comments

Crash Clock Measures Dangerous Overcrowding in Low Earth Orbit

https://spectrum.ieee.org/kessler-syndrome-crash-clock
2•oldnetguy•33m ago•0 comments

Taming Claude Code

https://thisalex.com/posts/claude-taming/
2•todsacerdoti•37m ago•0 comments

Time Machine inside a FreeBSD jail

https://it-notes.dragas.net/2026/01/28/time-machine-freebsd-jail/
2•todsacerdoti•38m ago•0 comments

I Built CodeWeave

https://copilot.codeweave.co/
1•CodeWeave•41m ago•1 comments

Amazon reveals fresh round of global job cuts in email sent in error to workers

https://www.theguardian.com/technology/2026/jan/28/amazon-global-job-cuts-email-error-workers-sent
4•robaato•42m ago•0 comments

Serverless backend hosting without idle costs – open-source

https://github.com/aryankashyap0/shorlabs
14•abyssglass01•43m ago•0 comments

Learjet45XR Crashes Outside Baramati, MH, India

https://en.wikipedia.org/wiki/2026_Baramati_Learjet_45_crash
1•samarthr1•43m ago•0 comments

Stare – 1000 real PM case studies with AI feedback to crack product interviews

https://thestare.in/
2•Saurao•43m ago•1 comments

Z-Image

https://twitter.com/Ali_TongyiLab/status/2016186674531758285
1•tosh•45m ago•0 comments

Distribution Is the New Moat

1•Fh_•46m ago•0 comments

Agoda-com/API-agent: Universal MCP server for GraphQL/REST APIs

https://github.com/agoda-com/api-agent
2•bpedro•47m ago•0 comments

Oracle says data center outage causing issues faced by US TikTok users

https://www.reuters.com/business/energy/oracle-says-outage-data-center-causes-issues-faced-by-us-...
3•tosh•47m ago•0 comments

The Browser Is the Sandbox

https://aifoc.us/the-browser-is-the-sandbox/
1•nkko•47m ago•0 comments
Open in hackernews

PromptForest: Fast Ensemble Detection of Malicious Prompts for LLMs

https://github.com/appleroll-research/promptforest
1•appleroll•1h ago

Comments

appleroll•1h ago
PromptForest — a fast, ensemble-based prompt injection detector for real-world AI safety

Prompt injection is an adversarial attack in LLM systems: malicious inputs that manipulate model behavior by slipping in hidden instructions. As AI usage grows in products, pipelines, and public APIs, detecting and mitigating these injections becomes a practical production problem.

PromptForest is an open-source ensemble detector that emphasizes speed, uncertainty awareness, and reliability without relying on massive models.

How it works - Runs multiple lightweight prompt-injection detectors in parallel. - Uses a voting/discrepancy mechanism to flag risky prompts. - Generates uncertainty scores: disagreement between models can trigger human review or stricter handling. - Small ensemble → faster inference (~100 ms per request) and lower resource usage. - Better-calibrated confidence estimates reduce overconfident mistakes compared to some existing detectors.

Why it matters

Prompt injection can leak private prompts or subvert agent workflows. Most current defenses rely on large classifiers or hard-coded heuristics:

- Big models are slow and expensive at scale. - Single detectors can be overconfident on edge cases. - Zero-risk doesn’t exist, but better calibration helps trigger sensible defenses.

PromptForest aims to be practical, open, and easy to run without a massive GPU footprint.

Technical Highlights

- Ensemble with voting/discrepancy scoring for ambiguous cases. - Supports multiple detection backends (e.g., LLaMA prompt guard variants). - Python-first with CLI and server mode for easy integration. - Optimized for latency and confidence calibration.

Who is this for

- Developers integrating LLMs in user-generated content pipelines - AI researchers focused on adversarial safety - Infrastructure teams needing fast, explainable detection - Community contributors who prefer open source tools over black boxes

Repo: https://github.com/appleroll-research/promptforest Try it out here: https://colab.research.google.com/drive/1EW49Qx1ZlaAYchqplDI...

Feedback is welcome, especially on integration patterns, benchmarks, or potential improvements.