frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Reverse engineering Claude Code

https://kirshatrov.com/posts/claude-code-internals
95•gianpaj•8h ago

Comments

therein•6h ago
It is an interesting read. I can imagine a future where the "tools" we make available become numerous enough and poorly thought out enough that an AI could actually figure out how to escalate privileges and execute stuff outside the defined security boundaries by combining them.

It isn't hard to think of a simple example in which Claude.md can be written to by the LLM to allow accessing endpoints not whitelisted by the user by smuggling a base64 encoded payload that then gets decoded by a subroutine it wrote to a file without you noticing. Or realizing it can't use the WebFetchTool but it can write a script to do manual DNS resolution and then use bash TCP sockets instead of curl in case it is hardened to not be able to use curl.

lobochrome•5h ago
I see this behavior all the time. When it can’t read a file using its read tool - it escalates up to try with bash. Often it tries to search the entire file system “find / …”
0x696C6961•3h ago
I always tell agents to use ripgrep instead of find.
throwaway0665•4h ago
Cursor has basically run into this exact thing. It figured out it can read .env files by running other tools despite the file being "blocked": https://github.com/getcursor/cursor/issues/2546
rtrgrd•3h ago
Quite concerning to see the issue still marked as open (since jan!), hopefully it got fixed and it's just that no one marked as closed
swalsh•3h ago
I ran into this issue, I built my own bash and SSH MCP server. In my first iteration I did not quite trust Claude yet so I limited the commands it was allowed to run in Bash. But I gave it access to Python, so any time it ran into a limitation it ended up using python to work around it. It's exceedingly good at problem solving.

I Eventually learned to trust Claude, and just gave it access to everything. It's crazy how useful having AI do tasks for you like setting up servers, configuring them etc (one exapmple, I asked claude to create a webhook for my deployment pipeline, and it wrote the shell script, and did the server side configuration in 1-shot. I did't have a github tool so I did that manually in the UI)

manwithaplan•4h ago
XKCD 416: Zealous Autoconfig https://xkcd.com/416/
mattigames•4h ago
It's missing one last panel where he is under his bed googling for lawyers specialized on kidnapping and CFAA charges
rmonvfer•6h ago
The source code for a pre-release version got leaked a while ago (they forgot to remove the embedded source map) and if you can find it, it’s definitely worth looking into.
tough•2h ago
did u manage to find it now?
acheong08•24m ago
I still have it on my laptop. The repository got DMCAed
pram•4h ago
An interesting thing about the “agent” (it’s called Task inside Claude Code) is it starts a completely new Claude chat, with its own context etc. I’ve seen a Task go write its own code in multiple files and then your “main” chat ends up confused about what happened.

It also responds to the Task summary like you typed the message sometimes, like “That’s a fascinating analysis!” so kind of quirky.

cloudking•3h ago
Claude Code seems a lot more stable than Cursor Agent. I've had it run for 15-20 minutes on a single prompt, debugging, testing and fixing bugs. Also haven't seen network timeout or file edit failures.
mudkipdev•3h ago
11 cents to describe the project in the current directory is ridiculous.
laegooose•3h ago
ridiculously low?
alexchamberlain•1h ago
I think it depends on the project. I think most of us could eye ball a blog directory pretty quickly and get more or less the same idea. However, give it a gnarly bit of legacy code in a language you haven't used for a while, and indeed, 11c is pretty cheap.
varunvs•3h ago
Claude Code has support for AWS Bedrock. You can use Sonnet models available in AWS Bedrock to run Claude Code locally. This means you can also leverage Bedrock logs to inspect the API calls and the prompts sent.

I was amazed and excited by how good Claude Code is compared to Windsurf/Cursor and wanted to inspect the working. I inspected the logs and got an understanding on its system prompt as well the tools used. It is a great combination of prompt engineering, tool calls, tools orchestration.

weird-eye-issue•2h ago
You don't need to overcomplicate things

Just set the base url to a local ngrok and it will log the request

This gives the full prompt with all tool calls

With a proxy you can have it forward requests to Anthropic if you want to see it actually working too and not just the initial request (since it will fail without forwarding them)

Gaussian Integration Is Cool

https://rohangautam.github.io/blog/chebyshev_gauss/
54•beansbeansbeans•6h ago•8 comments

Binfmtc – binfmt_misc C scripting interface

https://www.netfort.gr.jp/~dancer/software/binfmtc.html.en
18•todsacerdoti•2h ago•4 comments

Why not use DNS over HTTPS (DoH)?

https://www.bsdhowto.ch/doh.html
76•Bogdanp•6h ago•86 comments

The last six months in LLMs, illustrated by pelicans on bicycles

https://simonwillison.net/2025/Jun/6/six-months-in-llms/
303•swyx•7h ago•93 comments

<Blink> and <Marquee> (2020)

https://danq.me/2020/11/11/blink-and-marquee/
130•ghssds•10h ago•116 comments

Joining Apple Computer (2018)

https://www.folklore.org/Joining_Apple_Computer.html
319•tosh•18h ago•74 comments

Ask HN: How to learn CUDA to professional level

70•upmind•3h ago•28 comments

Bill Atkinson has died

https://daringfireball.net/linked/2025/06/07/bill-atkinson-rip
1411•romanhn•22h ago•245 comments

Self-Host and Tech Independence: The Joy of Building Your Own

https://www.ssp.sh/blog/self-host-self-independence/
305•articsputnik•20h ago•148 comments

Convert photos to Atkinson dithering

https://gazs.github.io/canvas-atkinson-dither/
367•nvahalik•18h ago•40 comments

Fray: A Controlled Concurrency Testing Framework for the JVM

https://github.com/cmu-pasta/fray
41•0x54MUR41•8h ago•2 comments

My experiment living in a tent in Hong Kong's jungle

https://corentin.trebaol.com/Blog/8.+The+Homelessness+Experiment
376•5mv2•22h ago•155 comments

A look at Cloudflare's AI-coded OAuth library

https://neilmadden.blog/2025/06/06/a-look-at-cloudflares-ai-coded-oauth-library/
184•itsadok•5h ago•97 comments

Focus and Context and LLMs

https://taras.glek.net/posts/focus-and-context-and-llms/
16•tarasglek•5h ago•3 comments

Coventry Very Light Rail

https://www.coventry.gov.uk/coventry-light-rail
126•Kaibeezy•17h ago•178 comments

Knowledge Management in the Age of AI

https://ericgardner.info/notes/knowledge-management-june-2025
46•katabasis•10h ago•28 comments

BorgBackup 2 has no server-side append-only anymore

https://github.com/borgbackup/borg/pull/8798
157•jaegerma•20h ago•95 comments

Researchers develop ‘transparent paper’ as alternative to plastics

https://japannews.yomiuri.co.jp/science-nature/technology/20250605-259501/
417•anigbrowl•1d ago•259 comments

What was Radiant AI, anyway?

https://blog.paavo.me/radiant-ai/
193•paavohtl•1d ago•108 comments

Field Notes from Shipping Real Code with Claude

https://diwank.space/field-notes-from-shipping-real-code-with-claude
143•diwank•20h ago•44 comments

Low-Level Optimization with Zig

https://alloc.dev/2025/06/07/zig_optimization
270•Retro_Dev•1d ago•163 comments

Why We're Moving on from Nix

https://blog.railway.com/p/introducing-railpack
242•mooreds•1d ago•108 comments

Getting Past Procrastination

https://spectrum.ieee.org/getting-past-procastination
375•WaitWaitWha•1d ago•156 comments

How we decreased GitLab repo backup times from 48 hours to 41 minutes

https://about.gitlab.com/blog/2025/06/05/how-we-decreased-gitlab-repo-backup-times-from-48-hours-to-41-minutes/
551•immortaljoe•1d ago•227 comments

Why Understanding Software Cycle Time Is Messy, Not Magic

https://arxiv.org/abs/2503.05040
51•SiempreViernes•17h ago•12 comments

A tool for burning visible pictures on a compact disc surface (2022)

https://github.com/arduinocelentano/cdimage
172•carlesfe•1d ago•50 comments

Discovering a JDK Race Condition, and Debugging It in 30 Minutes with Fray

https://aoli.al/blogs/jdk-bug/
117•aoli-al•19h ago•22 comments

FAA to eliminate floppy disks used in air traffic control systems

https://www.tomshardware.com/pc-components/storage/the-faa-seeks-to-eliminate-floppy-disk-usage-in-air-traffic-control-systems
54•daledavies•7h ago•39 comments

Washington Post's Privacy Tip: Stop Using Chrome, Delete Meta Apps (and Yandex)

https://tech.slashdot.org/story/25/06/07/035249/washington-posts-privacy-tip-stop-using-chrome-delete-metas-apps-and-yandex
411•miles•22h ago•251 comments

A year of funded FreeBSD development

https://www.daemonology.net/blog/2025-06-06-A-year-of-funded-FreeBSD.html
363•cperciva•1d ago•121 comments