frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Broken "Public Research Articles" Link: Any Support for Google Scholar?

2•yann-gael•4h ago•1 comments

How are Iranian drones getting their guidance?

5•dottenad•4h ago•2 comments

Ask HN: The new wave of AI agent sandboxes?

8•ATechGuy•8h ago•2 comments

Spotify playing ads for paid subscribers

136•IncandescentGas•1d ago•121 comments

Tell HN: If working with agents means this, robots, please take my job

9•tiredagent•7h ago•9 comments

Ask HN: Are we ready for vulnerabilities to be words instead of code?

3•lielcohen•7h ago•8 comments

Ask HN: What do you look for in your first 10 hires?

24•neilk17•21h ago•29 comments

European municipalities leak citizen data to US companies

7•sam_lowry_•9h ago•0 comments

Ask HN: How to Find a Job in the UK

6•0x3444ac53•9h ago•1 comments

Ask HN: How do you deal with people who trust LLMs?

145•basilikum•1d ago•189 comments

TTal – CLI that turns Claude Code into a multi-agent software factory

4•neilbb•23h ago•2 comments

Ask HN: What is it like being in a CS major program these days?

210•tathagatadg•3d ago•201 comments

Ask HN: Who is still using Windsurf and why?

9•mak8•21h ago•5 comments

I built a game where you argue consumer rights against AI bots

5•dragonmann•15h ago•1 comments

Ask HN: Have you cancelled any software subscriptions because AI replaced them?

7•maxim_bg•19h ago•10 comments

Ask HN: Is vibe coding a new mandatory job requirement?

33•newswangerd•2d ago•71 comments

Tell HN: AI tools are making me lose interest in CS fundamentals

98•Tim25659•4d ago•92 comments

Ask HN: Is Claude down Again?

10•rreyes1979•1d ago•6 comments

Ask HN: Why is everyone on HN obsessed with Rust?

14•goldkey•1d ago•10 comments

Open AI is actively censoring information about voting today in the US

11•resters•2d ago•16 comments

Ask HN: How are you protecting yourself from skill atrophy?

5•xpnsec•1d ago•12 comments

Ask HN: Which router makers do you trust most?

9•general_reveal•1d ago•12 comments

Claude Code 500s

16•bavarianbob•2d ago•5 comments

Skills Manager – manage AI agent skills across Claude, Cursor, Copilot

3•evergreenxx•1d ago•8 comments

Knowledge workers managing AI show collapsed productivity, not just a plateau

6•dfordp11•1d ago•2 comments

Ask HN: Did GitHub remove Opus and Sonnet from their Copilot Pro subscription?

14•lgl•3d ago•7 comments

Ask HN: How to Learn C++ in 2026?

15•creatorcoder•4d ago•14 comments

WFH is becoming a benefit again

61•sharemywin•15h ago•72 comments

Ask HN: We need to learn algorithm when there are Claude Code etc.

11•JasonHEIN•2d ago•3 comments

Ask HN: How do you handle payments for AI agents?

2•bahaghazghazi•2d ago•3 comments
Open in hackernews

Ask HN: Are we ready for vulnerabilities to be words instead of code?

3•lielcohen•7h ago
Until now, security has been math. Buffer overflows, SQL injections, crypto flaws — deterministic, testable, formally verifiable.

But we're giving agents terminal access and API keys now. The attack vector is becoming natural language. An agent gets "socially engineered" by a prompt; another hallucinates fake data and passes it down the chain.

Trying to secure these systems feels like trying to write a regex that catches every possible lie. We've shifted the foundation of security from numbers to words, and I don't think we've figured out what that means yet.

Is anyone thinking about actual architectural solutions to this? Not just "use another LLM to guard the LLM" — that feels like circular logic. Something fundamentally different.

(Not a native English speaker, used AI to clean up the grammar.)

Comments

nine_k•7h ago
Scams and "social engineering", as known for a long time, could be a good approximation.
lielcohen•7h ago
Right, but with scams you trick a human into doing something. With agents, you give them the keys upfront - terminal, file system, API keys - because otherwise what's the point? You can't have an agent that asks permission for every action, you'd just be babysitting it all day. So the question isn't "how do we stop someone from being tricked." It's "how do we secure something that already has root access and runs on vibes instead of logic."
codingdave•6h ago
Don't give it root access.

That answer hasn't changed since day one of LLMs, despite some of the thing people are attempting to build these days: If you don't want to get in trouble, don't give LLMs access to anything that can cause actual harm, nor give them autonomy.

lielcohen•6h ago
Sure, that works today. But Meta is cutting 20% of its workforce. So is everyone else. The whole bet is that agents replace human work - and that only works if they can actually do things. Deploy, access databases, call APIs.

"Don't give it access" is like saying "don't connect to the internet" in 1995. The question isn't whether agents get these permissions. They will. The question is what happens when they do.

nine_k•5h ago
Let's see how well it works for them. Apparently Salesforce had been a bit overly enthusiastic about layoffs, and recently had to backtrack.
nine_k•5h ago
How do we expect that everything goes all right if we give prod access to a pack of very smart dogs that know some key tricks? Now the same, when humans actually leave the room?

My answer is simple: it just won't be all right this way. The problems will cost the management who drank too much kool-aid; maybe they already do (check out what was happening at Cloudflare recently). Sanity will return, now as a hard-won lesson.

lielcohen•6h ago
To be clear - I'm not really talking about my personal laptop. I'm thinking about where this is going at scale. When companies start replacing entire teams with agents (and looking at the layoffs, that's clearly the direction), those agents will need real access to production systems. That's the scenario where "just don't give it access" stops being an answer.
raw_anon_1111•4h ago
It’s really not that hard to secure agents. Just give them tightly scope API Keys, put them in front of your API and treat it like you would a user instead of behind your API.

If I were to ever use Claude in a production environment for an AWS account for instance, you best believe the role it was running with with temporary access keys would be the bare minimum.