frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
47•jmuncor•4h ago
I built this out of curiosity about what Claude Code was actually sending to the API. Turns out, watching your tokens tick up in real-time is oddly satisfying.

Sherlock sits between your LLM tools and the API, showing you every request with a live dashboard, and auto-saved copies of every prompt as markdown and json.

Comments

david_shaw•2h ago
Nice work! I'm sure the data gleaned here is illuminating for many users.

I'm surprised that there isn't a stronger demand for enterprise-wide tools like this. Yes, there are a few solutions, but when you contrast the new standard of "give everyone at the company agentic AI capabilities" with the prior paradigm of strong data governance (at least at larger orgs), it's a stark difference.

I think we're not far from the pendulum swinging back a bit. Not just because AI can't be used for everything, but because the governance on widespread AI use (without severely limiting what tools can actually do) is a difficult and ongoing problem.

LudwigNagasena•1h ago
I had to vibe code a proxy to hide tokens from agents (https://github.com/vladimirkras/prxlocal) because I haven’t found any good solution either. I planned to add genai otel stuff that could be piped into some tool to view dialogues and tool calls and so on, but I haven’t found any good setup that doesn’t require lots of manual coding yet. It’s really weird that there are no solutions in that space.
EMM_386•2h ago
This is great.

When I work with AI on large, tricky code bases I try to do a collaboration where it hands off things to me that may result in large number of tokens (excess tool calls, unprecise searches, verbose output, reading large files without a range specified, etc.).

This will help narrow down exactly which to still handle manually to best keep within token budgets.

Note: "yourusername" in install git clone instructions should be replaced.

kej•1h ago
Would you mind sharing more details about how you do this? What do you add to your AI prompts to make it hand those tasks off to you?
jmuncor•1h ago
Hahahah just fixed it, thank you so much!!!! Think of extending this to a prompt admin, Im sure there is a lot of trash that the system sends on every query, I think we can improve this.
mrbluecoat•2h ago
So is it just a wrapper around MitM Proxy?
guessmyname•1h ago
> So is it just a wrapper around MitM Proxy?

Yes.

I created something similar months ago [*] but using Envoy Proxy [1], mkcert [2], my own Go (golang) server, and Little Snitch [3]. It works quite well. I was the first person to notice that Codex CLI now sends telemetry to ab.chatgpt.com and other curiosities like that, but I never bothered to open-source my implementation because I know that anyone genuinely interested could easily replicate it in an afternoon with their favourite Agent CLI.

[1] https://www.envoyproxy.io/

[2] https://github.com/FiloSottile/mkcert

[3] https://www.obdev.at/products/littlesnitch/

[*] In reality, I created this something like 6 years ago, before LLMs were popular, originally as a way to inspect all outgoing HTTP(s) traffic from all the apps installed in my macOS system. Then, a few months ago, when I started using Codex CLI, I made some modifications to inspect Agent CLI calls too.

tkp-415•1h ago
Curious to see how you can get Gemini fully intercepted.

I've been intercepting its HTTP requests by running it inside a docker container with:

-e HTTP_PROXY=http://127.0.0.1:8080 -e HTTPS_PROXY=http://host.docker.internal:8080 -e NO_PROXY=localhost,127.0.0.1

It was working with mitmproxy for a very brief period, then the TLS handshake started failing and it kept requesting for re-authentication when proxied.

You can get the whole auth flow and initial conversation starters using Burp Suite and its certificate, but the Gemini chat responses fail in the CLI, which I understand is due to how Burp handles HTTP2 (you can see the valid responses inside Burp Suite).

jmuncor•1h ago
Tried with gemini and gave more headaches than anything else, would love if you can help me adding it to sherlock... I use claude and gemini, claude mainly for coding, so wanted to set it up first. With gemini, ran into the same problem that you did...
paulirish•25m ago
Gemini CLI is open source. Don't need to intercept at the network when you can just add inspectGeminiApiRequest() in the source. (I suggest it because I've been maintaining a personal branch with exactly that :)
jmuncor•1h ago
Kind of yes... But with a nice cli so that you don't have to set it up just run "sherlock claude" and "sherlock start" on two terminals and everything that claude sends in that session then it will be stored. So no proxy set up or anything, just simple terminal commands. :)
alickkk•1h ago
Nice work! Do i need to update Claude Code config after start this proxy service?
jmuncor•1h ago
Nope... You just run "sherlock claude" and that sets up the proxy for you. So you dont have to think about it... And just use claude normally, every prompt you send in that session will be stored in the files.
FEELmyAGI•1h ago
Dang how will Tailscale make any money on its latest vibe coded feature [0] when others can vibe code it themselves? I guess your SaaS really is someones weekend vibe prompt.

[0]https://news.ycombinator.com/item?id=46782091

3abiton•1h ago
That's what LLMs enabled. Faster prototyping. Also lots of exposed servers and apps. It's never been more fun to be a cyber security researcher.
jmuncor•51m ago
I think it just has been more fun being into computers overall!
pixl97•33m ago
It's interesting because if you're into computers it's more accessible than ever and there are more things you can mess with more cheaply than ever. I mean we have some real science fiction stuff going on. At the same time it's probably different for the newer generations. Computers were magical to me and a lot of that was because they were rare. Now they are everywhere, they are just a backdrop to everything else going on.
jmuncor•28m ago
I agree, I remember when the feed forward NN were the shit! And now the LLMs are owning, I think this adoption pattern will start pulling a lot of innovations on other computer science fields. Networking, for example. But the ability to have that peer programer next to you makes it so much more fun to build, when before you had to spend a whole day debugging something, Claude now just helps you out and gives you time to build. Feels like long roadtrips with cruise control and lane keeping assist!
andrewstuart•42m ago
What about SSL/certificates ?
Havoc•38m ago
You don't need to mess with certificates - you can point CC at a HTTP endpoint and it'll happily play along.

If you build a DIY proxy you can also mess with the prompt on the wire. Cut out portions of the system prompt etc. Or redirect it to a different endpoint based on specific conditions etc.

jmuncor•27m ago
Have you tried this with Gemini? or Codex?
elphard•6m ago
This is fantastic. Claude doesn't make it easy to inspect what it's sending - which would actually be really useful for refining the project-specific prompts.

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
47•jmuncor•4h ago•22 comments

Show HN: The HN Arcade

https://andrewgy8.github.io/hnarcade/
292•yuppiepuppie•12h ago•76 comments

Show HN: Cursor for Userscripts

https://github.com/chebykinn/browser-code
27•mifydev•4h ago•10 comments

Show HN: SHDL – A minimal hardware description language built from logic gates

https://github.com/rafa-rrayes/SHDL
25•rafa_rrayes•11h ago•10 comments

Show HN: Frame – Managing projects, tasks, and context for Claude Code

2•kozhan•39m ago•0 comments

Show HN: Pinecone Explorer – Desktop GUI for the Pinecone vector database

https://www.pinecone-explorer.com
7•arsentjev•22h ago•0 comments

Show HN: Dwm.tmux – a dwm-inspired window manager for tmux

https://github.com/saysjonathan/dwm.tmux
85•saysjonathan•4d ago•16 comments

Show HN: Lendy – Keep track of books you have lended

https://lendy.viraat.dev/
7•viraatdas•22h ago•3 comments

Show HN: I built a small browser engine from scratch in C++

https://github.com/beginner-jhj/mini_browser
117•crediblejhj•9h ago•38 comments

Show HN: Config manager for Claude Code (and others) – rules, MCPs, permissions

https://github.com/regression-io/coder-config
9•jtr101•7h ago•0 comments

Show HN: Sandbox Agent SDK – unified API for automating coding agents

https://github.com/rivet-dev/sandbox-agent
17•NathanFlurry•9h ago•0 comments

Show HN: WordRE, Wordle for Real Estate

https://reidsherman.com/wordre/
6•reidjs•18h ago•0 comments

Show HN: Cua-Bench – a benchmark for AI agents in GUI environments

https://github.com/trycua/cua
34•someguy101010•2d ago•6 comments

Show HN: Build Web Automations via Demonstration

https://www.notte.cc/launch-week-i/demonstrate-mode
27•ogandreakiro•1d ago•10 comments

Show HN: I'm building an AI-proof writing tool. How would you defeat it?

https://auth-auth.vercel.app/
7•callmeed•5h ago•8 comments

Show HN: Spar – Built a tool to help improve store conversion rates

https://spar.cuped.ai
2•6farer•2h ago•0 comments

Show HN: Extracting React apps from Figma Make's undocumented binary format

https://albertsikkema.com/ai/development/tools/reverse-engineering/2026/01/23/reverse-engineering...
50•albertsikkema•5d ago•13 comments

Show HN: Record and share your coding sessions with CodeMic

https://codemic.io/#
10•seansh•9h ago•2 comments

Show HN: LemonSlice – Upgrade your voice agents to real-time video

114•lcolucci•1d ago•123 comments

Show HN: pqry – A fast, lightweight CLI tool to diagnose Parquet datasets

https://github.com/symblic/pqry
4•setzeno•4h ago•0 comments

Show HN: One Human + One Agent = One Browser From Scratch in 20K LOC

https://emsh.cat/one-human-one-agent-one-browser/
305•embedding-shape•1d ago•146 comments

Show HN: Fuzzy Studio – Apply live effects to videos/camera

https://fuzzy.ulyssepence.com/
52•ulyssepence•1d ago•19 comments

Show HN: SharpAPI – Real-time sports odds API with +EV and arbitrage detection

https://sharpapi.io
3•MykLaz•4h ago•0 comments

Show HN: We Built the 1. EU-Sovereignty Audit for Websites

https://lightwaves.io/en/eu-audit/
101•cmkr•1d ago•78 comments

Show HN: I wrapped the Zorks with an LLM

https://infocom.tambo.co/
104•alecf•1d ago•57 comments

Show HN: Ghostly: The Ultimate Platform for Ghosting Candidates (Satire)

https://staticfile-25978.wasmer.app/
2•dw1014•5h ago•0 comments

Show HN: PNANA - A TUI Text Editor

https://github.com/Cyxuan0311/PNANA
7•Frameser•10h ago•7 comments

Show HN: A header-only C++20 compile-time assembler for x86/x64 instructions

https://github.com/mahmoudimus/static_asm
2•mahmoudimus•7h ago•0 comments

Show HN: Is this the perfect 404 page? [CSS only]

https://github.com/AntiKippi/errorpages
3•AntiKippi•8h ago•0 comments

Show HN: We built a type-safe Python ORM for RedisGraph/FalkorDB

5•hello-tmst•8h ago•3 comments