Living Dangerously with Claude

https://simonwillison.net/2025/Oct/22/living-dangerously-with-claude/

132•FromTheArchives•1d ago

Comments

igor47•1d ago

My approach is to ask Claude to plan anything beyond a trivial change and I review the plan, then let it run unsupervised to execute the plan. But I guess this does still leave me vulnerable to prompt injection if part of the plan is accessing external content

ares623•3h ago

Just don’t think about it too much. You’ll be fine.

abathologist•2h ago

What guarantees do you have it will actually follow the stated plan instead of doing something else entirely?

danielbln•1d ago

Claude Code offers sandboxing now: https://www.anthropic.com/engineering/claude-code-sandboxing

js2•20h ago

It's discussed in the linked post.

lacker•23h ago

The sandbox idea seems nice, it's just a question of how annoying it is in practice. For example the "Claude Code on the web" sandbox appears to prevent you from loading `https://api.github.com/repos/.../releases/latest`. Presumably that's to prevent you from doing dangerous GitHub API operations with escalated privileges, which is good, but it's currently breaking some of my setup scripts....

simonw•23h ago

Is that with their default environment?

I have been running a bunch of stuff in there with a custom environment that allows "*"

lacker•1h ago

I whitelisted github.com, api.github.com, *.github.com, and it still doesn't seem to work. I suspect they did something specifically for github to prevent the agent from doing dangerous things with your credentials? But I could be wrong.

matthewdgreen•20h ago

So let me get this straight. You’re writing tens of thousands of lines of code that will presumably go into a public GitHub repository and/or be served from some location. Even if it only runs locally on your own machine, at some point you’ll presumably give that code network access. And that code is being developed (without much review) by an agent that, in our threat model, has been fully subverted by prompt injection?

Sandboxing the agent hardly seems like a sufficient defense here.

simonw•19h ago

What is your worst case scenario from this?

noitpmeder•7h ago

Bank accounts drained, ransomware installed, ...

deadbabe•3h ago

Silently setup a child pornographer exchange server and run it on your machine for years without you ever noticing until you are caught and imprisoned.

tptacek•4h ago

Where did "without much review" come from? I don't see that in the deck.

enraged_camel•3h ago

Yeah. Personally I haven't found a workflow that relies heavily on detailed design specs, red/green TDD followed by code review. And that's fine because that's how I did my work before AI anyway, both at the individual level and at the team level. So really, this is no different than reviewing someone else's PR, aside from the (greatly increased) turnaround time and volume.

tyre•3h ago

I’ve found it helpful to have a model write a detailed architecture and implementation proposal, which I then review and iterate on.

From there it splits out each phase into three parts: implementation, code review, and iteration.

After each part, I do a code review and iteration.

If asked, the proposal is broken down into small, logical chunks so code review is pretty quick. It can only stray so far off track.

I treat it like a strong mid-level engineer who is learning to ship iteratively.

theshrike79•3h ago

I play Claude and Codex against each other

Codex is pretty good at finding complex bugs in the code, but Claude is better at getting stuff working

daxfohl•1h ago

That's kind of tangential though. The article is more about using sandboxes to allow `--dangerously-skip-permissions` mode. If you're not looking at the generated code, you're correct, sandboxing doesn't help, but neither does permissioning, so it's not directly relevant to the main point.

catigula•18h ago

Telling Claude to solve a problem and walking away isn't a problem you solved. You weren't in the loop. You didn't complete any side quests or do anything of note, you merely watched an AGI work.

simonw•14h ago

Here's one I did even less work for: https://tools.simonwillison.net/terminal-to-html - prompt and video here: https://simonwillison.net/2025/Oct/23/claude-code-for-web-vi...

_factor•4h ago

Writing your Java code on an IDE, you just sat by while the interpreter did all the work on the generated byte code and corresponding assembly.

You merely watched the tools do the work.

bitpush•3h ago

This exactly is the part that lots of folks are missing. As programmers in a high level language (C, Rust, Python ..) we were merely guiding the compiler to create code. You could say the compiler/interpreter is more deterministic, but the fact remains the code that is run is 100% not what you wrote, and you're at the mercy of the tool .. which we trust.

Compiled output can change between versions, heck, can even change during runtime (JIT compilation).

catigula•3h ago

The hubris here, which is very short-sighted, is the idea that a. You have very important contributions to make and b. You cannot possibly be replaced.

If you're barely doing anything neither of these things can possibly be true even with current technology.

catigula•3h ago

This is a failure of analogy. Artificial intelligence isn't a normal technology.

wahnfrieden•3h ago

Who cares? I don’t see any issue. I write code to put software into users hands, not because I like to write code.

catigula•3h ago

You don't see any issue with the I in this equation falling out of relevance?

Not even a scrap of self-preservation?

wahnfrieden•3h ago

Since I ended my career as a wage worker and just sell my own software now, automation is great for me. Even before GPT hype I saw the writing on the wall for relying on a salary and got out so that I could own the value of my labor.

I don’t see my customers being able to one-shot their way to the full package of what I provide them anytime soon either. As they gain that capability, I also gain the capability to accelerate what more value I provide them.

I don’t think automation is the cause of your inability to feed and house yourself if it reduces the labor needed by capital. That’s a social and political issue.

Edit: I have competitors already cloning them with CC regularly, and they spend more than 24h dedicated to it too

If the capability does arrive, that’s why I’m using what I can today to get a bag before it’s too late.

I can’t stop development of automation. But I can help workers organize, that’s more practical.

catigula•3h ago

>I don’t see my customers being able to one-shot their way to the full package of what I provide them anytime soon either

What if they are, or worse? Are you prepared for that?

If you point me towards your products, someone can try to replicate them in 24 hours. Sound good?

Edit: I found it, but your website is broken on mobile. Needs work before it's ready to be put into the replication machine. If you'd like I can do this for you for a small fee at my consulting rate (wink emoji).

dist-epoch•2h ago

> someone can try to replicate them in 24 hours.

All the more reason to not hand-code it in a week.

ares623•3h ago

I live for shareholder value.

wahnfrieden•3h ago

It feels great to when I’m the only shareholder

dist-epoch•2h ago

Do you think a programmer not using AI will stop it's march forward?

catigula•13m ago

If more people see the cows 4 beef analogy we gain more votes against it.

bdangubic•1h ago

exactly. the problem did get solved though which is the whole point :)

stuaxo•9h ago

I've been thinking about this a bit.

I reckon something lie Qubes could work fairly well.

Create a new Qube and have control over network connectivity, and do everything there, at the end copy the work out and destroy it.

boredtofears•4h ago

I like the best of both worlds approach of asking Claude to refine a spec with me (specifically instructing it to ask me questions) and then summarize an implementation or design plan (this might be a two step process if the feature is big enough)

When I’m satisfied with the spec, I turn on “allow all edits” mode and just come back later to review the diff at the end.

I find this works a lot better than hoping I can one shot my original prompt or having to babysit the implementation the whole way.

wahnfrieden•3h ago

I recommend trying a more capable model that will read much more context too when creating specs. You can load a lot of full files into GPT 5 Pro and have it produce a great spec and give more surgical direction to CC or Codex (which don’t read full files and often skip over important info in their haste). If you have it provide the relevant context for the agent, the agent doesn’t waste tokens gathering it itself and will proceed to its work.

boredtofears•3h ago

Is there an easy way to get a whole codebase into GPT 5 Pro? It's nice with claude to be able to say "examine the current project in the working directory" although maybe that's actually doing less than I think it is.

simonw•3h ago

I wrote a tool for that: https://github.com/simonw/files-to-prompt - and there are other similar tools like repomix.

These days I often use https://gitingest.com - it can grab any full repo on GitHub has something you can copy and paste, e.g. https://gitingest.com/simonw/llm

boredtofears•3h ago

Of course you did - thanks, huge fan!

dist-epoch•2h ago

I wrote a similiar tool myself, mostly because your tool or repomix doesn't support "presets" (saved settings):

    [client]
    root = "~/repo/client"
    include = [
        "src/**/*.ts",
        "src/**/*.vue",
        "package.json",
        "tsconfig*.json",
        "*.ts",
    ]
    exclude = [
        "src/types/*",
        "src/scss/*",
    ]
    output = "bundle-client.txt"

    $ bundle -p client

What do you do when you repeatedly need to bundle the same thing? Bash history?

almosthere•4h ago

Anyone from the Cursor world already YOLO's it by default.

A massive productivity boost I get is using to do server maintenance.

Using gcloud compute ssh, log into all gh runners and run docker system prune, in parellel for speed and give me a summary report of the disk usage after.

This is an undocumented and underused feature of basic agentic abilities. It doesn't have to JUST write code.

simonw•3h ago

Does Cursor have a good sandboxing story?

tuhgdetzhh•3h ago

I run multiple instances of cursor cli yolo in a 4 x 3 tmux grid each in an isolated docker container. That is a pretty effective setup.

normie3000•2h ago

Is this what ansible does? Or some other classic ops tool?

mandevil•2h ago

There are a million different tools that are designed to do this, e.g. this task (log into a bunch of machines and execute a specific command without any additional tools running on each node) is literally the design use case for Ansible. It would be a simple playbook, why are you bringing AI into this at all?

giobox•2h ago

Agreed, this is truly bizarre to me. Is OP not going to have to do this work all over again in x days time once the nodes fill with stale docker assets again?

AI can still be helpful here if new to scheduling a simple shell command, but I'd be asking the AI how do I automate the task away, not manually asking the AI to do the thing every time, or using my runners in a fashion that means I don't have to even concern myself with scheduled prune command calls.

bdangubic•2h ago

> but I'd be asking the AI how do I automate the task away

AI said “I got this” :)

almosthere•1h ago

No, we have a team dedicated to fixing this long term, but this allowed 20 engineers to get working right away. Long term fix is now in.

giobox•6m ago

If a team of 20 engineers got blocked because you/the team didn't run docker prune, you arguably have even bigger problems...

ericmcer•1h ago

Yeah that sounds like a CI/CD task or scheduled job. I would not want the AI to "rewrite" the scripts before running them. I can't really think of why I would want it to?

almosthere•1h ago

Because I didn't have to do anything other than write that english statement and it worked. Saved me a long time.

wrs•2h ago

Yesterday I was trying to move a backend system to a new AWS account and it wasn’t working. I asked Claude Code to figure it out. About 15 minutes and 40 aws CLI commands later, it did! Turned out the API Gateway’s VPCLink needed a security group added, because the old account’s VPC had a default egress rule and the new one’s didn’t.

I barely understand what I just said, and I’m sure it would have taken me a whole day to track this down myself.

Obviously I did NOT turn on auto-approve for the aws command during this process! But now I’m making a restricted role for CC to use in this situation, because I feel like I’ll certainly be doing something like this again. It’s like the AWS Q button, except it actually works.

manmal•2h ago

Relevant: https://steipete.me/posts/2025/claude-code-is-my-computer

mike_hearn•3h ago

sandbox-exec isn't really deprecated. It's just a tiny wrapper around some semi-private undocumented APIs, it says that because it's not intended for public use. If it were actually deprecated Apple would have deleted it at some point, or using it would trigger a GUI warning, or it'd require a restricted entitlement.

The reason they don't do that is because some popular and necessary apps use it. Like Chrome.

However, I tried this approach too and it's the wrong way to go IMHO, quite beyond the use of undocumented APIs. What you actually want to do is virtualize, not sandbox.

krackers•1h ago

Fun fact: the sandboxing rules are defined using scheme!

jampa•3h ago

I don't understand why people advocate so strongly for `--dangerously-skip-permissions`.

Setting up "permissions.allow" in `.claude/settings.local.json` takes minimal time. Claude even lets you configure this while approving code, and you can use wildcards like "Bash(timeout:*)". This is far safer than risking disasters like dropping a staging database or deleting all unstaged code, which Claude would do last week, if I were running it in the YOLO mode.

The worst part is seeing READMEs in popular GitHub repos telling people to run YOLO mode without explaining the tradeoffs. They just say, "Run with these parameters, and you're all good, bruh," without any warning about the risks.

I wish they could change the parameter to signify how scary it can be, just like React did with React.__SECRET_INTERNALS_DO_NOT_USE_OR_YOU_WILL_BE_FIRED (https://github.com/reactjs/react.dev/issues/3896)

dist-epoch•2h ago

I tried this path. The issue is that agents are very creating in coming up with new variations. "uv run pytest", "python3 -m pytest", "bash -c pytest"

It's a never ending game of whitelisting.

bdangubic•2h ago

changing the parameter name to something scary will only increase its usage

ZeroConcerns•2h ago

So, yeah, only tangentially related, but if anyone at Anthropic would see it fit to let Claude loose on their DNS, maybe they can create an MX record for 'email.claude.com'?

That would mean that their, undoubtedly extremely interesting, emails actually get met with more than a "450 4.1.8 Unable to find valid MX record for sender domain" rejection.

I'm sure this is just an oversight being caused by obsolete carbon lifeforms still being in charge of parts of their infrastructure, but still...

zxilly•2h ago

I should like to know how much this would cost? Even Claude's largest subscription appears insufficient for such token requirements.

simonw•1h ago

I ran a cost estimate on the project I describe in https://simonwillison.net/2025/Oct/23/claude-code-for-web-vi... - which was covered by my Claude Max account, but I dug through the JSONL log files for that session to try and estimate the cost if I had been using the API.

The cost estimate came out to 63 cents - details here: https://gistpreview.github.io/?27215c3c02f414db0e415d3dbf978...

Claude Memory

Trump pardons convicted Binance founder

Can "second life" EV batteries work as grid-scale energy storage?

What happened to Apple's legendary attention to detail?

New updates and more access to Google Earth AI

Pyscripter – open-source Python IDE written in Delphi

I spent a year making an ASN.1 compiler in D

Show HN: OpenSnowcat – A fork of Snowplow to keep open analytics alive

Kaitai Struct: declarative binary format parsing language

Armed police swarm student after AI mistakes bag of Doritos for a weapon

PyTorch Monarch

OpenAI acquires Sky.app

Zram Performance Analysis

Make Any TypeScript Function Durable

I Managed to Grow Countable Yeast Colonies

Date bug in Rust-based coreutils affects Ubuntu 25.10 automatic updates

Summary of the Amazon DynamoDB Service Disruption in US-East-1 Region

The OS/2 Display Driver Zoo

US probes Waymo robotaxis over school bus safety

How count-min sketches work – frequencies, but without the actual data

Antislop: A framework for eliminating repetitive patterns in language models

Show HN: Git for LLMs – a context management interface

Glasses-free 3D using webcam head tracking

OpenMaxIO: Forked UI for MinIO Object Storage

Nango (YC W23) is hiring staff back-end engineers (remote)

Programming with Less Than Nothing

Show HN: I built a tech news aggregator that works the way my brain does

VectorWare – from creators of `rust-GPU` and `rust-CUDA`

The Muscular Compassion of "Paper Girl"

Unconventional Ways to Cast in TypeScript

Claude Memory

Trump pardons convicted Binance founder

Can "second life" EV batteries work as grid-scale energy storage?

What happened to Apple's legendary attention to detail?

New updates and more access to Google Earth AI

Pyscripter – open-source Python IDE written in Delphi

I spent a year making an ASN.1 compiler in D

Show HN: OpenSnowcat – A fork of Snowplow to keep open analytics alive

Kaitai Struct: declarative binary format parsing language

Armed police swarm student after AI mistakes bag of Doritos for a weapon

PyTorch Monarch

OpenAI acquires Sky.app

Zram Performance Analysis

Make Any TypeScript Function Durable

I Managed to Grow Countable Yeast Colonies

Date bug in Rust-based coreutils affects Ubuntu 25.10 automatic updates

Summary of the Amazon DynamoDB Service Disruption in US-East-1 Region

The OS/2 Display Driver Zoo

US probes Waymo robotaxis over school bus safety

How count-min sketches work – frequencies, but without the actual data

Antislop: A framework for eliminating repetitive patterns in language models

Show HN: Git for LLMs – a context management interface

Glasses-free 3D using webcam head tracking

OpenMaxIO: Forked UI for MinIO Object Storage

Nango (YC W23) is hiring staff back-end engineers (remote)

Programming with Less Than Nothing

Show HN: I built a tech news aggregator that works the way my brain does

VectorWare – from creators of `rust-GPU` and `rust-CUDA`

The Muscular Compassion of "Paper Girl"

Unconventional Ways to Cast in TypeScript

Living Dangerously with Claude

Comments