frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Nebark – Simple A/B Testing for system prompts using steganography

https://app.nebark.com/
1•nicolasmery•2h ago
New project!

I just built the first version of Nebark, an A/B testing platform for LLM system prompts. It aims to solve a very specific pain point: tracking prompt performance without forcing developers to wire trace IDs all the way through their backend to their frontend.

The Problem If you want to know which system prompt variant generates better user feedback (upvotes, downvotes, or copy-to-clipboard events), the standard approach is intrusive. You have to generate a trace ID in your backend, pass it down to your client, attach it to your UI components, and send it back to your analytics DB. It creates friction and litters your API responses with telemetry metadata.

The Solution: Context Hashing We decoupled the telemetry entirely using what we call "Context Hashing" to bridge the backend and frontend asynchronously.

Here is how the architecture works:

The Proxy (Backend): You point your OpenAI baseURL to our gateway. We intercept the request, inject Variant A or B of your system prompt, and stream the response back. Once the stream closes, our proxy calculates a unique cryptographic hash based on the interaction's content and stores it as a blind trace.

The SDK (Frontend): A lightweight vanilla JS script watches the DOM. It smartly waits for the AI's response to finish streaming and rendering on the screen. It then extracts the visible text and calculates the exact same unique hash locally, without intercepting any network traffic.

The Match: The SDK injects the feedback UI (/). When a user clicks, the frontend sends this calculated Hash and a local Session ID to our DB. We match this Hash against the Proxy's traces to attribute the vote to the correct prompt variant.

Why it’s interesting

Zero Backend Config: You only change the base URL. The backend remains completely unaware of the A/B test or the telemetry.

Semantic Caching Immunity: If your backend uses Redis to serve a cached response and skips our Proxy, the frontend will generate a Hash that doesn't exist in our DB. It naturally prevents skewed A/B data from cached hits.

The Edge Cases (Where I need your feedback) The biggest risk with DOM hashing is hydration/rendering discrepancies. If a client's frontend uses an aggressive Markdown parser that strips out specific characters before rendering the text, the frontend hash won't match the proxy hash. We built a strict internal normalization engine on both ends to mitigate this, but it is an ongoing challenge.

I’d love to hear your thoughts on this architecture. Is there a glaring edge case with DOM extraction or SSE proxying that I’m missing? Its free for now. Tear it apart.

Show HN: The Sanguine Box – A 2026 vision for solo-produced comics

https://sanguinebox.com/comics/sanguine/
1•Balvarez•3m ago•0 comments

Low-Cost Microscope to Study Living Cells in Zero Gravity

https://www.biophysics.org/news-room/scientists-build-low-cost-microscope-to-study-living-cells-i...
1•geox•3m ago•0 comments

Show HN: Natural language semiconductor geometry generator powered by LLMs

https://github.com/rusrushal13/geoforge
1•rusrushal13•4m ago•0 comments

Whale Fall

https://nesbitt.io/2026/02/21/whale-fall.html
1•chmaynard•4m ago•0 comments

HN Client for iOS

https://github.com/dk-a-dev/harmonica
1•dev345•5m ago•0 comments

Free Will: A 3-Minute Impromptu Speech Challenge

https://randomtopicgenerator.net
1•czmilo•6m ago•0 comments

Show HN: Blindspot – a userscript to block tab-switch detection

https://github.com/gsekulski/blindspot
1•gsekulski•7m ago•0 comments

Is It All over for Filmmakers?

https://www.shokunin.studio/blog/2026/2/18/is-it-all-over-for-filmmakers
1•YeGoblynQueenne•7m ago•0 comments

The Human Root of Trust – public domain framework for agent accountability

2•3du4rd0v3g4•10m ago•0 comments

You can now play Prey (2006), with multiplayer, in an open source engine

https://bsky.app/profile/krispygoat.bsky.social/post/3mekahum6dc23
1•klaussilveira•10m ago•0 comments

I fell asleep on my solo AI agent project and woke up to it running my WhatsApp

https://github.com/fredabila/orcbot
2•abilafredkb•10m ago•1 comments

The CIA and MI6 got hold of Putin's Ukraine plans but nobody believed them

https://www.theguardian.com/world/ng-interactive/2026/feb/20/a-war-foretold-cia-mi6-putin-ukraine...
2•Betelbuddy•13m ago•0 comments

A 'Doom Loop' at the Heart of the Global Economy

https://www.bloomberg.com/news/articles/2026-02-20/how-the-us-china-rivalry-is-creating-an-econom...
1•xqcgrek2•14m ago•0 comments

Denmark Is Buying US Missiles to Defend Greenland from the United States

https://www.military.com/feature/2026/02/11/denmark-buying-us-missiles-defend-greenland-united-st...
2•Betelbuddy•15m ago•1 comments

America now imports more from Taiwan than from China

https://twitter.com/JosephPolitano/status/2024511090856493371
3•JumpCrisscross•15m ago•0 comments

Why Are Chinese EVs So Cheap?

https://rhg.com/research/why-are-chinese-evs-so-cheap/
1•JumpCrisscross•16m ago•0 comments

The Rise and Fall of Scala: A Love Letter to the Language That Broke My Heart

https://medium.com/@naveensky/the-language-that-made-me-feel-like-a-genius-until-it-didnt-my-brea...
1•signa11•16m ago•0 comments

Does Congress Have to Approve Trump's $10B Gaza Pledge?

https://www.military.com/daily-news/2026/02/20/does-congress-have-approve-trumps-10-billion-gaza-...
1•Betelbuddy•16m ago•0 comments

Meshcore Companion v1.13.0+BC – BitChat Bridge Support

https://github.com/eddieoz/MeshCore-BC/releases/tag/companion-v1.13.0%2Bbc
1•eddieoz•17m ago•0 comments

Show HN: 3mins.news – AI daily news briefing in 17 languages, designed to end

https://3mins.news/en
1•ethan_zhao•17m ago•1 comments

Unaffordable Housing Impacts How Americans Consume, Work and Invest

https://substack.com/redirect/bb979fba-ad5c-4d9b-9b89-4b30402446fd
1•JumpCrisscross•18m ago•0 comments

Nobel laureate invents machine that harvests water from dry air

https://www.theguardian.com/environment/2026/feb/21/nobel-laureate-omar-yaghi-invents-machine-tha...
2•schrototo•20m ago•0 comments

Should scientific publishing adapt to AI-authored research?

1•luvic•23m ago•0 comments

Show HN: A macOS toolbar app that resolves issues in your GitHub repos

https://www.insomnidev.com
1•offByOme•23m ago•0 comments

Ask HN: How to find a Sales cofounder for a B2B SaaS fintech compliance product?

1•swayamsiddhad•23m ago•0 comments

Hackers Expose Age-Verification Software Powering Surveillance Web

https://www.therage.co/persona-age-verification/
1•SanjayMehta•24m ago•0 comments

My Apple Watch-only app had no App Store analytics for 18 months

https://www.rshankar.com/watch-app-analytics-invisible/
1•RShankar31•26m ago•1 comments

Bare-metal LLM execution without the Python/Node runtime tax

https://www.ryiuk.pro/
1•ryiuk•28m ago•1 comments

Where are the most endangered languages in the world?

https://www.aljazeera.com/news/2026/2/21/where-are-the-most-endangered-languages-in-the-world
1•Tomte•29m ago•0 comments

eBPF on Hard Mode

https://feyor.sh/blog/ebpf-on-hard-mode/
1•todsacerdoti•29m ago•0 comments