frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting

https://botaoye.github.io/yonosplat/
1•smusamashah•54s ago•0 comments

Toyota may ban drivers from switching off safety tech

https://www.chasingcars.com.au/news/car-safety/toyota-may-ban-drivers-from-switching-off-safety-t...
1•geox•2m ago•0 comments

Show HN: FluxAPI – 13-rule API performance auditor with network-adjusted scoring

https://github.com/aswinsasi/fluxapi
1•aswinsasi123•12m ago•0 comments

AI helped me through burnout (but not how you think)

https://keygen.sh/blog/ai-helped-me-through-burnout/
2•saeedesmaili•17m ago•0 comments

Over 80% of 16 to 24-year-olds would vote to rejoin the EU

https://www.itv.com/news/2026-02-19/over-80-of-16-to-24-year-olds-would-vote-to-rejoin-the-eu-itv...
3•saubeidl•18m ago•2 comments

Show HN: CheckAPI – open-source API monitoring built with FastAPI and Next.js

https://www.checkapi.io/
2•JEONSEWON•19m ago•0 comments

The Current State of RDAP

https://www.ietf.org/blog/current-state-of-rdap/
2•gslin•19m ago•0 comments

Show HN: EV424 – Evidence Definition (Don't Trust, Verify)

2•lws9262•22m ago•1 comments

A managed disk adapter storage and routing layer for LoRA adapters on vLLM

https://github.com/shayonj/loraplex
2•shayonj•29m ago•0 comments

Don't leave assumptions up to AI

https://aklodhi.com/articles/dont-leave-assumptions-up-to-ai
2•capex•32m ago•0 comments

Ask HN: How do we protect tech workers fast

3•orangecoffee•35m ago•1 comments

Memed-in: Meme-fy your LinkedIn feed

https://github.com/taman-islam/memed-in
3•hedayet•40m ago•0 comments

Over 4 days of Google AI Studio outages

https://aistudio.google.com/status
2•idoxer•46m ago•0 comments

Decades-old programming languages still power critical modern systems

https://www.dongascience.com/en/news/76423
2•benewton•46m ago•0 comments

As We Can't Slow AI, We Must Accelerate Responsibility

https://www.michaelagreiler.com/if-we-cant-slow-ai-we-must-accelerate-responsibility/
3•madamdo•51m ago•0 comments

Show HN: Claude Code Open – AI Coding Platform with Web IDE and Agents

https://github.com/kill136/claude-code-open
2•694623326•54m ago•0 comments

24 Simultaneous Claude Code agents on local hardware

https://github.com/Mattbusel/tokio-prompt-orchestrator
2•Shmungus•55m ago•1 comments

Taalas Etches AI Models onto Transistors to Rocket Boost Inference

https://www.nextplatform.com/2026/02/19/taalas-etches-ai-models-onto-transistors-to-rocket-boost-...
1•hochmartinez•56m ago•0 comments

Elixir's Original Readme (2011)

https://github.com/elixir-lang/elixir/tree/ae3bdb72eb5a3d3bc80b4335a47579b533b0537b
2•sergiomattei•56m ago•0 comments

Armatron

https://armatron.vercel.app/
1•thomasfromcdnjs•1h ago•0 comments

Sanders warns US has no clue about speed and scale of coming AI revolution

https://www.theguardian.com/us-news/2026/feb/21/ai-revolution-bernie-sanders-warning
3•thunderbong•1h ago•0 comments

PostHog's 404 Page

https://posthog.com/skdjghdjkfhgkjhdfg
2•howToTestFE•1h ago•1 comments

Perplexity Pro promo subscription suspended without explanation?

1•aanno•1h ago•0 comments

Indian food delivery giant leak location metadata,food preferences to strangers

https://medium.com/@jatin.b.rx3/how-a-zomato-feature-enables-stalking-which-they-call-working-as-...
2•jatin-dot-py•1h ago•1 comments

Building a language that people want

https://blog.merigoux.fr/en/2026/02/19/building-proper-pl.html
1•art-w•1h ago•0 comments

Palantir Captured the UK Ministry of Defence

https://www.ft.com/content/5207928a-13e8-4832-8c6f-2e78740c16c9
6•macleginn•1h ago•0 comments

Code has always been the easy part

https://laughingmeme.org/2026/02/09/code-has-always-been-the-easy-part.html
2•Ozzie_osman•1h ago•0 comments

What Happened to Software Is Happening to Finance and Accounting

https://doempke.com/what-happened-to-software-is-happening-to-finance-and-accounting/
2•robk•1h ago•0 comments

Rare Blood Clots After Certain Covid Vaccines

https://www.mcgill.ca/oss/article/covid-19-medical-health-and-nutrition-technology/rare-blood-clo...
4•cyrc•1h ago•1 comments

Show HN: Shellspec – DSL to Test CLIs

https://github.com/itsfarseen/shellspec
2•itsfarseen-1•1h ago•0 comments
Open in hackernews

Andrej Karpathy talks about "Claws"

https://simonwillison.net/2026/Feb/21/claws/
85•helloplanets•2h ago

Comments

bjackman•1h ago
The actual content: https://xcancel.com/karpathy/status/2024987174077432126
fxj•42m ago
He also talks about picoclaw (a IoT solution) and nanoclaw (running on your phone in termux) and has a tiny code base.
7777777phil•1h ago
Karpathy has a good ear for naming things.

"Claw" captures what the existing terminology missed, these aren't agents with more tools (maybe even the opposite), they're persistent processes with scheduling and inter-agent communication that happen to use LLMs for reasoning.

arrowsmith•1h ago
He didn't name it though, Peter Steinberger did. (Kinda.)
9dev•1h ago
Why do we always have to come up with the stupidest names for things. Claw was a play on Claude, is all. Granted, I don’t have a better one at hand, but that it has to be Claw of all things…
keiferski•1h ago
The real-world cyberpunk dystopia won’t come with cool company names like Arasaka, Sense/Net, or Ono-Sendai. Instead we get childlike names with lots of vowels and alliteration.
m4rtink•1h ago
The name still kinda reminds me of the self replicating murder drones from Screemers that would leep out from the ground and chop your head off. ;-)
anewhnaccount2•10m ago
Except Phillip K Dick calls the murder bots in Second Variety claws already so there's prior art right from the master of cyberpunk.
JumpCrisscross•33m ago
> I don’t have a better one at hand

Perfect is the enemy of good. Claw is good enough. And perhaps there is utility to neologisms being silly. It conveys that the namespace is vacant.

sunaookami•23m ago
The name fits since it will claw all your personal data and files and send them somewhere else.
jcgrillo•6m ago
Much like we now say somebody has been "one-shotted", might we now say they have been "clawed"?
jcgrillo•18m ago
I've been hoping one of them will be called Clod
dakolli•29m ago
He's basically just a marketing guy now for the AI industry.
TowerTall•1h ago
Who is Andrej Karpathy?
Aeolun•1h ago
The person that made the svmjs library I used for a blue monday.
onion2k•1h ago
https://karpathy.ai/

PHD in neural networks under Fei-Fei Li, founder of OpenAI, director of AI at Tesla, etc. He knows what he's talking about.

ahoka•1h ago
Ex cathedra.
password54321•1h ago
>He knows what he's talking about.

https://en.wikipedia.org/wiki/Argument_from_authority

onion2k•1h ago
While I appreciate an appeal to authority is a logical fallacy, you can't really use that to ignore everyone's experience and expertise. Sometimes people who have a huge amount of experience and knowledge on a subject do actually make a valid point, and their authority on the subject is enough to make them worth listening to.
avaer•52m ago
But we're talking about authority of naming things being justified by a tech resume.

It's as irrelevant as George Foreman naming the grill.

onion2k•45m ago
Naming things in the context of AI, by someone who is already responsible for naming other things in the context of AI, when they have a lot of valid experience in the field of AI. It's not entirely unreasonable.
wepple•56m ago
https://en.wikipedia.org/wiki/Argument_from_fallacy
password54321•49m ago
Not claiming anything to be false, just a reminder that you should question ones opinion a bit more and not claim they "know what they are talking about" because they worked with Fei-Fei Li. You are outsourcing your thinking to someone else which is lazy and a good way of getting conned.

What even happened to https://eurekalabs.ai/?

Der_Einzige•37m ago
At one point he did. Cognitive atrophy has led him to decline just like everyone else.
William_BB•10m ago
Oh, like the LLM OS?
jb1991•1h ago
A quick Google might’ve saved you from the embarrassment of not knowing who one of the most significant AI pioneers in history is, and in a thread about AI too.
bravetraveler•1h ago
I bet they feel so, so silly. A quick bit of reflection might reveal sarcasm.

I'll live up to my username and be terribly brave with a silly rhetorical question: why are we hearing about him through Simon? Don't answer, remember. Rhetorical. All the way up and down.

snayan•4m ago
Welp, would have been a more useful post if he provided some context as to why he feels contempt for Karpathy rather than a post that is likely to come across as the parent interpreted.
tokenless•1h ago
Really smart AI guy ex Tesla, cum educator now cum vibe coder (he coined the term vibe coder)
password54321•1h ago
Someone who uses status to appeal to the tech masses / tech influencer / AI hype man.
rcore•16m ago
Snake oil salesman.
tomjuggler•1h ago
There's a gap in the market here - not me but somebody needs to build an e-commerce bot and call it Santa Claws
intrasight•19m ago
Well now somebody will
bjackman•1h ago
Does anyone know a Claw-like that:

- doesnt do its own sandboxing (I'll set that up myself)

- just has a web UI instead of wanting to use some weird proprietary messaging app as its interface?

tokenless•1h ago
Openclaw!

You can sandbox anything yourself. Use a VM.

It has a web ui.

bjackman•39m ago
Yeah I think this is gonna have to be the approach. But I don't like the fact that it has all the complexity of a baked in sandboxing solution and a big plugin architecture and blah blah blah.

TBH maybe I should just vibe code my own...

kzahel•13m ago
https://yepanywhere.com/ But has no Cron system. Just relay / remote web UI that's mobile first. I might add Cron system to it, but I think special purpose tool is better / more focused (I am the author of this)
ZeroGravitas•1h ago
So what is a "claw" exactly?

An ai that you let loose on your email etc?

And we run it in a container and use a local llm for "safety" but it has access to all our data and the web?

mattlondon•1h ago
I think for me it is an agent that runs on some schedule, checks some sort of inbox (or not) and does things based on that. Optionally it has all of your credentials for email, PayPal, whatever so that it can do things on your behalf.

Basically cron-for-agents.

Before we had to go prompt an agent to do something right now but this allows them to be async, with more of a YOLO-outlook on permissions to use your creds, and a more permissive SI.

Not rocket science, but interesting.

snovv_crash•1h ago
Cron would be for a polling model. You can also have an interrupts/events model that triggers it on incoming information (eg. new email, WhatsApp, incoming bank payments etc).

I still don't see a way this wouldn't end up with my bank balance being sent to somewhere I didn't want.

altmanaltman•1h ago
Definitely interesting but i mean giving it all my credentials feels not right. Is there a safe way to do so?
dlt713705•57m ago
In a VM or a separate host with access to specific credentials in a very limited purpose.

In any case, the data that will be provided to the agent must be considered compromised and/or having been leaked.

My 2 cents.

isuckatcoding•55m ago
Ideally workflow would be some kind of Oauth with token expirations and some kind of mobile notification for refresh
nnevatie•1h ago
That's it basically. I do not think running the tool in a container really solves the fundamental danger these tools pose to your personal data.
zozbot234•1h ago
You could run them in a container and put access to highly sensitive personal data behind a "function" that requires a human-in-the-loop for every subsequent interaction. E.g. the access might happen in a "subagent" whose context gets wiped out afterwards, except for a sanitized response that the human can verify.

There might be similar safeguards for posting to external services, which might require direct confirmation or be performed by fresh subagents with sanitized, human-checked prompts and contexts.

fxj•38m ago
A claw is an orchestrator for agents with its own memory, multiprocessing, job queue and access to instant messengers.
bravura•13m ago
There are a few qualitative product experiences that make claw agents unique.

One is that it relentlessly strives thoroughly to complete tasks without asking you to micromanage it.

The second is that it has personality.

The third is that it's artfully constructed so that it feels like it has infinite context.

The above may sound purely circumstantial and frivolous. But together it's the first agent that many people who usually avoid AI simply LOVE.

trippyballs•1h ago
lemme guess there is going to be inter claw protocol now
tokenless•1h ago
i am thinking 2 steps (48 hours in ai land) ahead and conclude we need a linkedin and fiverr for these claws.
zkmon•1h ago
AI pollution is "clawing" into every corner of human life. Big guys boast it as catching up with the trend, but not really thinking about where this is all going.
_pdp_•1h ago
You can take any AI agent (Codex, Gemini, Claude Code, ollama), run it on a loop with some delay and connect to a messaging platform using Pantalk (https://github.com/pantalk/pantalk). In fact, you can use Pantalk buffer to automatically start your agent. You don't need OpenClaw for that.

What OpenClaw did is to show the messages that this is in fact possible to do. IMHO nobody is using it yet for meaningful things, but the direction is right.

ksynwa•1h ago
Why mac mini instead of something like a raspberry pi? Aren't thede claw things delegating inference to OpenAI, Antropic etc.?
djfergus•1h ago
A Mac allows it to send iMessage and access the Apple ecosystem.
ksynwa•1h ago
Really? That's it?
joshstrange•11m ago
Ehh, not “it” but it’s important if you want an agent to have access to all your “stuff”.

macOS is the only game in town if you want easy access to iMessage, Photos, Reminders, Notes, etc and while Macs are not cheap, the baseline Mac Mini is a great deal. A raspberry Pi is going to run you $100+ when all is said and done and a Mac Mini is $600. So let’s call it. $500 difference. A Mac Mini is infinitely more powerful than a Pi, can run more software, is more useful if you decide to repurpose it, has a higher resale value and is easier to resell, is just more familiar to more people, and it just looks way nicer.

So while iMessage access is very important, I don’t think it comes close to being the only reason, or “it”.

I’d also imagine that it might be easier to have an agent fake being a real person controlling a browser on a Mac verses any Linux-based platform.

Note: I don’t own a Mac Mini nor do I run any Claw-type software currently.

kator•37m ago
Some users are moving to local models, I think, because they want to avoid the agent's cost, or they think it'll be more secure (not). The mac mini has unified memory and can dynamically allocate memory to the GPU by stealing from the general RAM pool so you can run large local LLMs without buying a massive (and expensive) GPU.
bravetraveler•1h ago
I read [and comment on] two influencers maintaining their circles
mittermayr•1h ago
I wonder how long it'll take (if it hasn't already) until the messaging around this inevitably moves on to "Do not self-host this, are you crazy? This requires console commands, don't be silly! Our team of industry-veteran security professionals works on your digital safety 24/7, you would never be able to keep up with the demands of today's cybersecurity attack spectrum. Any sane person would host their claw with us!"

Next flood of (likely heavily YC-backed) Clawbase (Coinbase but for Claws) hosting startups incoming?

iugtmkbdfil834•1h ago
In a sense, self-hosting it ( and I would argue for a personal rewrite ) is the only way to limit some of the damage.
xg15•1h ago
What exactly are they self hosting here? Probably not the model, right? So just the harness?

That does sound like the worst of both worlds: You get the dependency and data protection issues of a cloud solution, but you also have to maintain a home server to keep the agent running on?

aitchnyu•30m ago
There are lots of results for "host openclaw", some from VPS SEO spam, some from dedicated CaaS, some from PaaS. Many of them may be profitable.
hizanberg•1h ago
Why is this linking to a blog post of what someone said, instead of directly linking to what they said?

[1] https://x.com/karpathy/status/2024987174077432126

handfuloflight•1h ago
Because Simon says.
rvz•1h ago
Because the author of the blog is paid to post daily about nothing but AI and needs to link farm for clicks and engagement on a daily basis.

Most of the time, users (or the author himself) submit this blog as the source, when in fact it is just content that ultimately just links to the original source for the goal of engagement. Unfortunately, this actually breaks two guidelines: "promotional spam" and "original sourcing".

From [0]

"Please don't use HN primarily for promotion. It's ok to post your own stuff part of the time, but the primary use of the site should be for curiosity."

and

"Please submit the original source. If a post reports on something found on another site, submit the latter."

The moderators won't do anything because they are allowing it [1] only for this blog.

[0] https://news.ycombinator.com/newsguidelines.html

[1] https://news.ycombinator.com/item?id=46450908

geeunits•1h ago
I've been warned for calling this out, but I'm glad others are privy to the obvious
PacificSpecific•1h ago
Yeah it's really quite annoying. Is there a way to just block his site source from showing up on here without using external tools?
bahmboo•21m ago
I find is very easy to hit the hide button. It makes reading the site much faster but there is some feeling of fomo.
PacificSpecific•15m ago
That's per-post though isn't? I can't ban a submission source can I?

Regardless thanks for the tip

nl•52m ago
Simon's work is always appreciated. He thinks through things well, and his writing is excellent.

Just because something is popular doesn't make it bad.

sunaookami•21m ago
He massively fell off, is now only in for the marketing hype and even has a sponsor now for his blog. Sad.
hizanberg•49m ago
So everyone has to waste their time to visit a link on a blog first instead of being able to go directly to the source?

and why would anyone down vote you for calling this out, like who wants to see more low effort traffic-grab posts like this?

bahmboo•43m ago
Because he didn't submit it.
bahmboo•45m ago
The author didn't submit this to HN. I read his blog but I'm not on X so I do like when he covers things there. He's submitted 10 times in last 62 days.
bakugo•22m ago
> He's submitted 10 times in last 62 days.

Now check how many times he links to his blog in comments.

Actually, here, I'll do it for you: He has made 13209 comments in total, and 1422 of those contain a link to his blog[0]. An objectively ridiculous number, and anyone else would've likely been banned or at least told off for self-promotion long before reaching that number.

[0] https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

bahmboo•9m ago
I like being able to follow tangents and related topics outside the main comment thread so generally I appreciate when people do that via a link along with some context.

But this isn't my site and I don't get to pick the rules.

odshoifsdhfs•41m ago
Hah i didn’t see who submitted it but as soon as I read your message i thought it was simonw, and behold, tada!

HN really needs a way to block or hide posts from some users.

Der_Einzige•39m ago
Thank you for calling this out. The individual in question is massively overhyped.
bakugo•28m ago
Because HN is Simon Willison's personal advertising platform and the moderators are in on the grift, so any link to his blog or comment from him gets instantly propelled to the top and stays there all day regardless of how many guidelines it breaks.
ggrab•1h ago
IMO the security pitchforking on OpenClaw is just so overdone. People without consideration for the implications will inevitably get burned, as we saw with the reddit posts "Agentic Coding tool X wiped my hard drive and apologized profusely". I work at a FAANG and every time you try something innovative the "policy people" will climb out of their holes and put random roadblocks in your way, not for the sake of actual security (that would be fine but would require actual engagement) but just to feel important, it reminds me of that.
sa-code•52m ago
> every time you try something innovative the "policy people" will climb out of their holes and put random roadblocks in your way

This is so relatable. I remember trying to set up an LLM gateway back in 2023. There were at least 3 different teams that blocked our rollout for months until they worked through their backlog. "We're blocking you, but you’ll have to chase and nag us for us to even consider unblocking you"

At the end of all that waiting, nothing changed. Each of those teams wrote a document saying they had a look and were presumably just happy to be involved somehow?

embedding-shape•29m ago
To be fair, the alternative is them having to maintain and continuously check N services that various devs deployed because it felt appropriate in the moment, and then there is a 50/50 chance the service will just sit there unused and introduce new vulnerability vectors.

I do know the feeling you're talking about though, and probably a better balance is somewhere in the middle. Just wanted to add that the solution probably isn't "Let devs deploy their own services without review", just as the solution probably also isn't "Stop devs for 6 months to deploy services they need".

miki123211•10m ago
I think you should read "the Phoenix project."

One of the lessons in that book is that the main reasons things in IT are slow isn't because tickets take a long time to complete, but that they spend a long time waiting in a queue. The busier a resource is, the longer the queue gets, eventually leading to ~2% of the ticket's time spent with somebody doing actual work on it. The rest is just the ticket waiting for somebody to get through the backlog, do their part and then push the rest into somebody else's backlog, which is just as long.

I'm surprised FAANGs don't have that part figured out yet.

0x3f•37m ago
Work expands to fill the allocated resources in literally everything. This same effect can be seen in software engineering complexity more generally, but also government regulators, etc. No department ever downsizes its own influence or budget.
H8crilA•37m ago
This may be a good place to exchange some security ideas. I've configured my OpenClaw in a Proxmox VM, firewalled it off of my home network so that it can only talk to the open Internet, and don't store any credentials that aren't necessary. Pretty much only the needed API keys and Signal linked device credentials. The models that can run locally do run locally, for example Whisper for voice messages or embeddings models for semantic search.
embedding-shape•36m ago
I think the security worries are less about the particular sandbox or where it runs, and more about that if you give it access to your Telegram account, it can exfiltrate data and cause other issues. But if you never hand it access to anything, obviously it won't be able to do any damage, unless you instruct it to.
dakolli•31m ago
Genuinely curious, what are you doing with OpenClaw that genuinely improves your life?

The security concerns are valid, I can get anyone running one of these agents on their email inbox to dump a bunch of privileged information with a single email..

aaronrobinson•28m ago
It’s not to feel important, it’s to make others feel they’re important. This is the definition of corporate.
throwaway_z0om•18m ago
> the "policy people" will climb out of their holes

I am one of those people and I work at a FANG.

And while I know it seems annoying, these teams are overwhelmed with not only innovators but lawyers asking so many variations of the same question it's pretty hard to get back to the innovators with a thumbs up or guidance.

Also there is a real threat here. The "wiped my hard drive" story is annoying but it's a toy problem. An agent with database access exfiltrating customer PII to a model endpoint is a horrific outcome for impacted customers and everyone in the blast radius.

That's the kind of thing keeping us up at night, not blocking people for fun.

I'm actively trying to find a way we can unlock innovators to move quickly at scale, but it's a bit of a slow down to go fast moment. The goal isn't roadblocks, it's guardrails that let you move without the policy team being a bottleneck on every request.

whyoh•15m ago
>IMO the security pitchforking on OpenClaw is just so overdone.

Isn't the whole selling point of OpenClaw that you give it valuable (personal) data to work on, which would typically also be processed by 3rd party LLMs?

The security and privacy implications are massive. The only way to use it "safely" is by not giving it much of value. Maybe for some open source project where everything is public anyway, but then why does it need to take over the whole device?

pvtmert•5m ago
I am also ex-FAANG (recently departed), while I partially agree the "policy-people" pop-up fairly often, my experience is more on the inadequate checks side.

Though with the recent layoffs and stuff, the security in Amazon was getting better. Even the best-practices for IAM policies that was the norm in 2018, is just getting enforced by 2025.

Since I had a background of infosec, it always confused me how normal it was to give/grant overly permissive policies to basically anything. Even opening ports to worldwide (0.0.0.0/0) had just been a significant issue in 2024, still, you can easily get away with by the time the scanner finds your host/policy/configuration...

Although nearly all AWS accounts managed by Conduit (internal AWS Account Creation and Management Service), the "magic-team" had many "account-containers" to make all these child/service accounts joining into a parent "organization-account". By the time I left, the "organization-account" had no restrictive policies set, it is up to the developers to secure their resources. (like S3 buckets & their policies)

So, I don't think the policy folks are overall wrong. In the best case scenario, they do not need to exist in the first place! As the enforcement should be done to ensure security. But that always has an exception somewhere in someone's workflow.

the_real_cher•57m ago
What is the benefit of a Mac mini for something like this?
gostsamo•52m ago
Apple fans paying apple tax to have an isolated device accessing their profile.
intrasight•16m ago
It works and is plug and play. And can also work as a Mac. But getting in short supply since Apple hadn't planned for this new demand.
joshstrange•10m ago
Just commented in reply to someone else about this:

https://news.ycombinator.com/item?id=47099886

fxj•44m ago
He also talks about picoclaw which even runs on $10 hardware and is a fork by sipeed, a chinese company who does IoT.

https://github.com/sipeed/picoclaw

another chinese coompany m5stack provides local LLMs like Qwen2.5-1.5B running on a local IoT device.

https://shop.m5stack.com/products/m5stack-llm-large-language...

Imagine the possibilities. Soon we will see claw-in-a-box for less than $50.

tovej•30m ago
Ah yes, let's create an autonomic actor out of a nondeterministic system which can literally be hacked by giving it plaintext to read. Let's give that system access to important credentials letting it poop all over the internet.

Completely safe and normal software engineering practice.

Artoooooor•26m ago
So now the official name of the LLM agent orchestrator is claw? Interesting.
Artoooooor•22m ago
So now I will be able to tell OpenClaw to speedrun Captain Claw. Yeah.
lysecret•21m ago
Im honestly not that much worried there are some obvious problems (exfiltrate data labeled as sensitive, take actions that are costly, delete/change sensitive resources) if you have a properly compliant infrastructure all these actions need confirmations logging etc. for humans this seemed more like a neusance but now it seems essential. And all these systems are actually much much easier to setup.
dainiusse•17m ago
I don't understand the mac mini hype. Why can it not be a vm?
Aditya_Garg•13m ago
It absolutely can be a vm. Someone even got it running on a 2 dollar esp32. Its just making api calls
borplk•12m ago
I don't know but I'm guessing that it's because it makes it easy to give access to it to Mac desktop apps? Not sure what's the VM story with Mac but usually cloud VM stuff is linux so it may be inconvenient for some users to hook it up to their apps/tools.
mhher•15m ago
The current hype around agentic workflows completely glosses over the fundamental security flaw in their architecture: unconstrained execution boundaries. Tools that eagerly load context and grant monolithic LLMs unrestricted shell access are trivial to compromise via indirect prompt injection.

If an agent is curling untrusted data while holding access to sensitive data or already has sensitive data loaded into its context window, arbitrary code execution isn't a theoretical risk; it's an inevitability.

As recent research on context pollution has shown, stuffing the context window with monolithic system prompts and tool schemas actively degrades the model's baseline reasoning capabilities, making it exponentially more vulnerable to these exact exploits.

dgellow•13m ago
could you share that study?
mhher•3m ago
https://arxiv.org/abs/2512.13914

Among many more of them with similar results. This one gives a 39% drop in performance.

https://arxiv.org/abs/2506.18403

This one gives 60-80% after multiple turns.

kzahel•9m ago
I think this is basically obvious to anyone using one of these but they're just they like the utility trade off like sure it may leak and exfiltrate everything somewhere but the utility of these tools is enough where they just deal with that risk.
pvtmert•14m ago
Does one really need to _buy_ a completely new desktop hardware (ie. mac mini) to _run_ a simple request/response program?

Excluding the fact that you can run LLMs via ollama or similar directly on the device, but that will not have a very good token/s speed as far as I can guess...

Dilettante_•8m ago
I still haven't really been able to wrap my head around the usecase for these. Also fingers crossed the name doesn't stick. Something about it rubs my brain the wrong way.