frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you run LLM Agents safely?

5•rsyring•2h ago
A lot of recent HN stories and comments lately are on the topic if LLM agent safety (or lack thereof).

Let's assume that giving a non-deterministic and easily fooled program full access to run anything it wants on your dev machine is a bad idea. Let's also assume that any "handshake promises" that the LLM won't do that, or that it will get your permission before running commands, are null and void. That is, we want confidence the agent is sandboxed, not a promise from the agent it will sandbox itself.

I'm currently aware of three possible solutions but have not tried any of them yet:

- https://imbue.com/sculptor/: container based Claude, unknown post-beta pricing model

- https://docs.augmentcode.com/using-augment/remote-agent

- Run the agent in Docker with a mounted volume for the code. Seems like it would be workable but not a great DX.

What are the current best practices for sandboxing LLM agents that still give a reasonable DX for the developers using them?

AI Circular Invetment Bubble Illustrated

https://pbs.twimg.com/media/G3S6WeSXYAAJEL8?format=png&name=small
1•donsupreme•1m ago•0 comments

State of AI Report 2025

https://www.stateof.ai/
1•iamflimflam1•1m ago•0 comments

Navan founders cashing out, while company losing 100M

https://www.teamblind.com/post/founders-cashing-out-25m-while-our-company-is-losing-100m-lol-x3ko...
1•another_twist•2m ago•0 comments

Chemical Telescopes and the Process of Science

https://brianschrader.com/archive/chemical-telescopes-and-the-process-of-science/
1•sonicrocketman•3m ago•0 comments

Ancient Shipwrecks Rewrite the Story of Iron Age Trade

https://today.ucsd.edu/story/ancient-shipwrecks-rewrite-the-story-of-iron-age-trade
1•gmays•3m ago•0 comments

Porting from Perl to Go: Simplifying for Platform Engineering

https://phoenixtrap.com/2025/10/05/brew-patch-upgrade-go-port/
2•todsacerdoti•5m ago•0 comments

WTF: Ad Context Protocol?

https://digiday.com/media-buying/wtf-ad-context-protocol/
1•thm•6m ago•0 comments

Caveat Emptor: Sanctions Against Lawyers Citing Fictitious, AI-Generated Cases

https://lians.ca/news/lianswers/caveat-emptor-sanctions-against-lawyers-citing-ficticious-ai-gene...
1•nomilk•7m ago•0 comments

Cloudflare Sandboxes

https://sandbox.cloudflare.com/
2•bauerpl•10m ago•0 comments

A Bright HDR Image

https://walzr.com/HDR2.jpg
9•walz•10m ago•1 comments

Kardigan fashions $254M Series B to spin late-stage cardio assets through clinic

https://www.fiercebiotech.com/biotech/kardigan-fashions-254m-series-b-spin-late-stage-cardio-asse...
1•randycupertino•12m ago•1 comments

Are OpenAI/Nvidia/Oracle/AMD Round-Tripping?

https://www.youtube.com/watch?v=CBCujAQtdfQ
2•indigodaddy•13m ago•0 comments

Calling All Libraries: Celebrate 1T Web Pages in the Internet Archive

https://blog.archive.org/2025/10/07/calling-all-libraries-celebrate-1-trillion-web-pages-archived...
2•pieter_mj•13m ago•1 comments

Cory Doctorow: Reverse Centaurs

https://locusmag.com/feature/commentary-cory-doctorow-reverse-centaurs/
2•signa11•15m ago•0 comments

Show HN: Dirt-cheap custom ranking layer for your vector search

https://coderswap.ai/
1•vtaya•16m ago•0 comments

F5 Networks breach by "highly sophisticated nation-state" based on SEC filing

https://www.sec.gov/ix?doc=/Archives/edgar/data/1048695/000104869525000149/ffiv-20251015.htm
3•rjzzleep•17m ago•0 comments

Climate services bundles preferences of smallholder farmers in West Africa

https://www.frontiersin.org/journals/climate/articles/10.3389/fclim.2025.1581001/full
1•PaulHoule•19m ago•0 comments

Title Arbitrage as Status Engineering

https://www.humaninvariant.com/blog/titles
2•el_benhameen•19m ago•0 comments

My Top Favourite Features in Python 3.14

https://blog.codingconfessions.com/p/python-3-14-whats-new
1•rbanffy•19m ago•0 comments

Engineering a Better Java Build Tool Experience [video]

https://www.youtube.com/watch?v=-DTYm1qhQ6U
1•lihaoyi•19m ago•0 comments

LLM Structure Outputs: The Silent Hero of Production AI

https://www.decodingai.com/p/llm-structured-outputs-the-only-way
2•rbanffy•20m ago•0 comments

Ask HN: What are you vibe coding?

1•phendrenad2•20m ago•0 comments

Hazardous States and Accidents

https://entropicthoughts.com/hazardous-states-and-accidents
1•ibobev•21m ago•0 comments

Chinese interests purchased the data hub used by Whitehall departments

https://twitter.com/Steven_Swinford/status/1978469953717080190
1•beejiu•21m ago•0 comments

Certifying almost all quantum states with few single-qubit measurements

https://arxiv.org/abs/2404.07281
1•rbanffy•21m ago•0 comments

API design principle: Don't tempt people to divide by zero

https://devblogs.microsoft.com/oldnewthing/20251013-00/?p=111677
1•ibobev•22m ago•0 comments

Claude Haiku 4.5

https://www.anthropic.com/news/claude-haiku-4-5
36•adocomplete•22m ago•5 comments

Client-Side Path Traversal: Exploiting CSRF in Header-Based Auth Scenarios

https://blog.kulkan.com/client-side-path-traversal-exploiting-csrf-in-header-based-auth-scenarios...
1•laserspeed•22m ago•1 comments

ClickHouse table engines & CDC data (MergeTree, Replacing, Collapsing +)

https://www.fiveonefour.com/blog/clickhouse-table-engines-and-cdc
1•oatsandsugar•23m ago•0 comments

Optimizing Rust Enum Debugging with Perfect Hashing (2023)

https://swatinem.de/blog/optimizing-enums/
1•Rendello•23m ago•0 comments