frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
38•jmuncor•4h ago
I built this out of curiosity about what Claude Code was actually sending to the API. Turns out, watching your tokens tick up in real-time is oddly satisfying.

Sherlock sits between your LLM tools and the API, showing you every request with a live dashboard, and auto-saved copies of every prompt as markdown and json.

Comments

david_shaw•2h ago
Nice work! I'm sure the data gleaned here is illuminating for many users.

I'm surprised that there isn't a stronger demand for enterprise-wide tools like this. Yes, there are a few solutions, but when you contrast the new standard of "give everyone at the company agentic AI capabilities" with the prior paradigm of strong data governance (at least at larger orgs), it's a stark difference.

I think we're not far from the pendulum swinging back a bit. Not just because AI can't be used for everything, but because the governance on widespread AI use (without severely limiting what tools can actually do) is a difficult and ongoing problem.

LudwigNagasena•57m ago
I had to vibe code a proxy to hide tokens from agents (https://github.com/vladimirkras/prxlocal) because I haven’t found any good solution either. I planned to add genai otel stuff that could be piped into some tool to view dialogues and tool calls and so on, but I haven’t found any good setup that doesn’t require lots of manual coding yet. It’s really weird that there are no solutions in that space.
EMM_386•1h ago
This is great.

When I work with AI on large, tricky code bases I try to do a collaboration where it hands off things to me that may result in large number of tokens (excess tool calls, unprecise searches, verbose output, reading large files without a range specified, etc.).

This will help narrow down exactly which to still handle manually to best keep within token budgets.

Note: "yourusername" in install git clone instructions should be replaced.

kej•1h ago
Would you mind sharing more details about how you do this? What do you add to your AI prompts to make it hand those tasks off to you?
jmuncor•57m ago
Hahahah just fixed it, thank you so much!!!! Think of extending this to a prompt admin, Im sure there is a lot of trash that the system sends on every query, I think we can improve this.
mrbluecoat•1h ago
So is it just a wrapper around MitM Proxy?
guessmyname•1h ago
> So is it just a wrapper around MitM Proxy?

Yes.

I created something similar months ago [*] but using Envoy Proxy [1], mkcert [2], my own Go (golang) server, and Little Snitch [3]. It works quite well. I was the first person to notice that Codex CLI now sends telemetry to ab.chatgpt.com and other curiosities like that, but I never bothered to open-source my implementation because I know that anyone genuinely interested could easily replicate it in an afternoon with their favourite Agent CLI.

[1] https://www.envoyproxy.io/

[2] https://github.com/FiloSottile/mkcert

[3] https://www.obdev.at/products/littlesnitch/

[*] In reality, I created this something like 6 years ago, before LLMs were popular, originally as a way to inspect all outgoing HTTP(s) traffic from all the apps installed in my macOS system. Then, a few months ago, when I started using Codex CLI, I made some modifications to inspect Agent CLI calls too.

tkp-415•1h ago
Curious to see how you can get Gemini fully intercepted.

I've been intercepting its HTTP requests by running it inside a docker container with:

-e HTTP_PROXY=http://127.0.0.1:8080 -e HTTPS_PROXY=http://host.docker.internal:8080 -e NO_PROXY=localhost,127.0.0.1

It was working with mitmproxy for a very brief period, then the TLS handshake started failing and it kept requesting for re-authentication when proxied.

You can get the whole auth flow and initial conversation starters using Burp Suite and its certificate, but the Gemini chat responses fail in the CLI, which I understand is due to how Burp handles HTTP2 (you can see the valid responses inside Burp Suite).

jmuncor•55m ago
Tried with gemini and gave more headaches than anything else, would love if you can help me adding it to sherlock... I use claude and gemini, claude mainly for coding, so wanted to set it up first. With gemini, ran into the same problem that you did...
paulirish•7m ago
Gemini CLI is open source. Don't need to intercept at the network when you can just add inspectGeminiApiRequest() in the source. (I suggest it because I've been maintaining a personal branch with exactly that :)
jmuncor•1h ago
Kind of yes... But with a nice cli so that you don't have to set it up just run "sherlock claude" and "sherlock start" on two terminals and everything that claude sends in that session then it will be stored. So no proxy set up or anything, just simple terminal commands. :)
alickkk•1h ago
Nice work! Do i need to update Claude Code config after start this proxy service?
jmuncor•1h ago
Nope... You just run "sherlock claude" and that sets up the proxy for you. So you dont have to think about it... And just use claude normally, every prompt you send in that session will be stored in the files.
FEELmyAGI•1h ago
Dang how will Tailscale make any money on its latest vibe coded feature [0] when others can vibe code it themselves? I guess your SaaS really is someones weekend vibe prompt.

[0]https://news.ycombinator.com/item?id=46782091

3abiton•48m ago
That's what LLMs enabled. Faster prototyping. Also lots of exposed servers and apps. It's never been more fun to be a cyber security researcher.
jmuncor•33m ago
I think it just has been more fun being into computers overall!
pixl97•15m ago
It's interesting because if you're into computers it's more accessible than ever and there are more things you can mess with more cheaply than ever. I mean we have some real science fiction stuff going on. At the same time it's probably different for the newer generations. Computers were magical to me and a lot of that was because they were rare. Now they are everywhere, they are just a backdrop to everything else going on.
jmuncor•10m ago
I agree, I remember when the feed forward NN were the shit! And now the LLMs are owning, I think this adoption pattern will start pulling a lot of innovations on other computer science fields. Networking, for example. But the ability to have that peer programer next to you makes it so much more fun to build, when before you had to spend a whole day debugging something, Claude now just helps you out and gives you time to build. Feels like long roadtrips with cruise control and lane keeping assist!
andrewstuart•24m ago
What about SSL/certificates ?
Havoc•20m ago
You don't need to mess with certificates - you can point CC at a HTTP endpoint and it'll happily play along.

If you build a DIY proxy you can also mess with the prompt on the wire. Cut out portions of the system prompt etc. Or redirect it to a different endpoint based on specific conditions etc.

jmuncor•10m ago
Have you tried this with Gemini? or Codex?

Somebody used spoofed ADSB signals to raster the meme of JD Vance

https://alecmuffett.com/article/143548
159•wubin•1h ago•50 comments

Trinity large: An open 400B sparse MoE model

https://www.arcee.ai/blog/trinity-large
58•linolevan•22h ago•19 comments

Did a celebrated researcher obscure a baby's poisoning?

https://www.newyorker.com/magazine/2026/02/02/did-a-celebrated-researcher-obscure-a-fatal-poisoning
54•littlexsparkee•23h ago•4 comments

Airfoil (2024)

https://ciechanow.ski/airfoil/
323•brk•8h ago•47 comments

Mousefood – Build embedded terminal UIs for microcontrollers

https://github.com/ratatui/mousefood
132•orhunp_•6h ago•38 comments

Oban, the job processing framework from Elixir, has come to Python

https://www.dimamik.com/posts/oban_py/
159•dimamik•6h ago•72 comments

Show HN: A MitM proxy to see what your LLM tools are sending

https://github.com/jmuncor/sherlock
39•jmuncor•4h ago•21 comments

Android's desktop interface leaks

https://9to5google.com/2026/01/27/android-desktop-leak/
127•thunderbong•19h ago•213 comments

Computer History Museum Launches Digital Portal to Its Collection

https://computerhistory.org/press-releases/computer-history-museum-launches-digital-portal-to-its...
77•ChrisArchitect•5h ago•17 comments

The Five Levels: From spicy autocomplete to the dark factory

https://www.danshapiro.com/blog/2026/01/the-five-levels-from-spicy-autocomplete-to-the-software-f...
34•benwerd•5d ago•30 comments

Apple to Soon Take Up to 30% Cut from All Patreon Creators in iOS App

https://www.macrumors.com/2026/01/28/patreon-apple-tax/
131•pier25•2h ago•89 comments

Hellenistic War-Elephants and the Use of Alcohol Before Battle

https://www.cambridge.org/core/journals/classical-quarterly/article/hellenistic-warelephants-and-...
9•perihelions•5d ago•5 comments

Jellyfin LLM/"AI" Development Policy

https://jellyfin.org/docs/general/contributing/llm-policies/
97•mmoogle•1h ago•48 comments

Bf-Tree: modern read-write-optimized concurrent larger-than-memory range index

https://github.com/microsoft/bf-tree
10•SchwKatze•1h ago•0 comments

Show HN: The HN Arcade

https://andrewgy8.github.io/hnarcade/
289•yuppiepuppie•12h ago•75 comments

In a genre where spoilers are devastating, how do we talk about puzzle games?

https://thinkygames.com/features/in-a-genre-where-information-is-sacred-and-spoilers-are-devastat...
21•tobr•5d ago•10 comments

How to turn 'sfo-jfk' into a suitable photo

https://www.approachwithalacrity.com/how-to-turn-sfo-jfk-into-a-beautiful-photo/
20•bblcla•4h ago•16 comments

Microsoft forced me to switch to Linux

https://www.himthe.dev/blog/microsoft-to-linux
1498•bobsterlobster•8h ago•1185 comments

Spinning around: Please don't – Common problems with spin locks

https://www.siliceum.com/en/blog/post/spinning-around/
63•bdash•6h ago•25 comments

Show HN: Cursor for Userscripts

https://github.com/chebykinn/browser-code
26•mifydev•3h ago•9 comments

Native Linux VST plugin directory

https://linuxmusic.rocks
70•Aldipower•3h ago•20 comments

I overengineered a spinning top [video]

https://www.youtube.com/watch?v=Wp5NodfvvF4
107•bane•5d ago•29 comments

Amazon cuts 16k jobs

https://www.reuters.com/legal/litigation/amazon-cuts-16000-jobs-globally-broader-restructuring-20...
445•DGAP•7h ago•610 comments

3D-Printed Mathematical Lampshades

https://hessammehr.github.io/blog/posts/2025-12-24-maths-to-lampshade.html
43•hessammehr•4d ago•17 comments

Kairos: AI interns for everyone

https://www.kairos.computer/
23•bamitsmanas•15h ago•13 comments

Amazon One palm authentication discontinued

https://amazonone.aws.com/help
51•KerryJones•6h ago•111 comments

I have written gemma3 inference in pure C

https://github.com/robitec97/gemma3.c
46•robitec97•2d ago•17 comments

Some notes on starting to use Django

https://jvns.ca/blog/2026/01/27/some-notes-on-starting-to-use-django/
179•ingve•1d ago•102 comments

Kyber (YC W23) Is Hiring a Staff Engineer

https://www.ycombinator.com/companies/kyber/jobs/GPJkv5v-staff-engineer-tech-lead
1•asontha•11h ago

Show HN: SHDL – A minimal hardware description language built from logic gates

https://github.com/rafa-rrayes/SHDL
23•rafa_rrayes•11h ago•10 comments