frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

AI Conversations Are Not Yours. Yet

https://medium.com/@vektormemory/your-ai-conversations-are-not-yours-yet-cd1b7925e9cf
1•vektormemory•19s ago•0 comments

AI has a water problem; Google thinks it has a fix

https://www.theverge.com/policy/942296/google-water-commitments-data-centers
1•1vuio0pswjnm7•1m ago•0 comments

Show HN: A rich, HTML-native document canvas for human <> AI collaboration

https://app.productnow.ai/app/home
1•kadhirvelm•1m ago•0 comments

MSI PRO B850-P coreboot port: GFX init, Promontory21, and ACPI improvements

https://blog.3mdeb.com/2026/2026-06-03-msi_pro_b850p_part5/
1•2bluesc•1m ago•0 comments

How about new Java based phone apps?

https://inavoyage.blogspot.com/2026/06/how-about-new-java-based-phone-apps.html
1•initramfs•3m ago•0 comments

Rooting Home Assistant through MeshCore: XSS attacks with a LoRa node name

https://mxsasha.eu/posts/meshcore-xss-home-assistant/
1•WhyNotHugo•7m ago•0 comments

A Famous Math Problem Stumped Humans for 80 Years. AI Just Cracked It

https://www.wsj.com/tech/ai/ai-math-solves-erdos-problem-openai-c4029e84
1•gmays•11m ago•1 comments

Show HN: LiveComment – "Who Is Hiring?" Plugin

https://github.com/d08ble/livecomment
2•ellis0n•12m ago•0 comments

Ian Mackintosh

https://en.wikipedia.org/wiki/Ian_Mackintosh
1•petethomas•12m ago•0 comments

Show HN: CPU-only fact-check, summarize, explain, translate any text

https://github.com/kouhxp/fftext
1•mrkn1•14m ago•0 comments

Konversio: Open-source agentic customer support for digital sovereignty

https://www.konversio.org/
2•rcoenen•14m ago•0 comments

A2 Is Released

https://www.neuralampmodeler.com/post/a2-is-released
1•vcxy•16m ago•0 comments

'Close to the Terminator narrative': the dawn of self-improving AI

https://www.ft.com/content/7cc7800f-18ed-47d8-9539-221ae3e16182
1•petethomas•23m ago•0 comments

Show HN: Browser-based, Blender-like hard-surface modeling

https://roughform.com/
1•benhmoore•23m ago•0 comments

Steve Ballmer blasts Aspiration co-founder's bid for leniency in sentencing

https://www.latimes.com/sports/story/2026-04-23/steve-ballmer-delivers-rebuke-to-aspiration-exec-...
1•1vuio0pswjnm7•25m ago•0 comments

Microsoft Scout

https://learn.microsoft.com/en-us/microsoft-scout/
1•doppp•31m ago•0 comments

Show HN: an AI that settles small couple arguments

https://thepiece.app/en
3•Byalpel•40m ago•0 comments

Teaching AI agents to ask better questions by playing "Battleship"

https://news.mit.edu/2026/teaching-ai-agents-ask-better-questions-playing-battleship-0603
1•droidjj•41m ago•1 comments

Digital Goods by ProxyStore

https://digitalgoods.proxysto.re/en
1•Cider9986•42m ago•0 comments

A Structure-Aware Fuzzing Experiment

https://fitzgen.com/2026/06/01/structure-aware-fuzzing-experiment.html
1•sfink•48m ago•0 comments

A Primer in Post-Training Reasoning Data: What We Know About How It Works

https://arxiv.org/abs/2606.02113
1•Anon84•48m ago•0 comments

JackHamr, cloud workspaces for orchestrating coding agents

https://www.jackhamr.ai
3•jrda•48m ago•1 comments

FUTO Swipe Relative error rate improvement vs. Gboard

https://swipe.futo.tech/
1•Cider9986•51m ago•0 comments

California Back and Pain Specialists Exposes 133GB of Patient Medical Records

https://write-ups.security-chu.com/2026/06/California-Back-Pain-Specialists-with-data-breach.html
2•news_rt•56m ago•0 comments

Pie: Yet another open-source coding agent in Rust

https://github.com/c4pt0r/pie
1•c4pt0r•56m ago•2 comments

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

https://kasra.blog/blog/i-spent-1500-seeing-if-llms-could-hack-my-app/
35•jc4p•1h ago•11 comments

Review of the MoErgo Go60 Keyboard

https://arslan.io/2026/06/02/review-of-the-moergo-go60-keyboard/
2•wapasta•1h ago•0 comments

Klaser Cards, a printable personal collection

https://klaser.cards/
2•Triphibian•1h ago•1 comments

Why Video Agent models are next

https://www.latent.space/p/video-agents
3•gmays•1h ago•0 comments

Dreambeans

https://labs.google/dreambeans
2•fallinditch•1h ago•0 comments
Open in hackernews

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

https://kasra.blog/blog/i-spent-1500-seeing-if-llms-could-hack-my-app/
32•jc4p•1h ago

Comments

SOLAR_FIELDS•26m ago
One interesting takeaway is the low score on Anthropic models from this benchmark. It’s not because of capability, it’s because Anthropic’s guardrails prevented it from solving the problem.

I noticed with each model release Anthropic constrains the model more security wise. Its propensity to refuse doing legitimate work has been increasing. It now puts up more resistance around performing logins, handling credentials on behalf of the user, etc.

For myself, it’s already gotten to the point where it has mildly affected the usefulness of the model. If I bump on some action I want it to do I can usually work around it, but I suspice the ability to do so will close with each new release. Eventually I’ll reach a point where I am forced to choose between the useful aspects of the model and the limiting ones instead of just picking the most capable model out there

Eventually these models will significantly suffer from overfitting to the least common denominator. If I have this beautiful deterministic setup that swaps secrets out in flight so the LLM never sees them, I’m going to be really annoyed when the LLM still won’t send them out because it is trained to deal with the 99% of people just doing the dumb thing

lesuorac•23m ago
Are they charging for the guardrails? Like do the guardrails expend token counts to then block you from the output of other tokens?
SOLAR_FIELDS•16m ago
Not directly, as it comes in as a not charged error but the weighted generation path used until you hit the guardrail is basically wasted tokens, so yes, indirectly. If I hit a guardrail and rewind I’ve found the training will still be biased towards guardrailing out if you rewind one turn. Rewinding multiple turns allows steering away from that path, but all of the original token spend down that path is wasted
kay_o•16m ago
When your session is force ended for "abuse" you get neither the response nor a refund

Security, games (think weapons, PVP, attacking, etc), sometimes even asking it for a security review of some CRUD code it wrote itself

danpalmer•9m ago
What a joke. Must make it pretty easy to poison a session, you don't need to persuade the model about anything, just trigger its security controls, ideally after as much context as possible, but before it has generated any useful output.
acters•15m ago
Yes tokens used (input and sometimes output) are always charged. You likely get charged for the preloaded system prompt, too.
jerrythegerbil•13m ago
Yes. When certain keywords are matched or topics, there is a warning transparently injected server side appended to the system prompt of the convo that’s miles long. It is injected and reevaluated every tool call.

If you begin a generic reverse engineering task, 30+ tool calls in a row. The moment it sees something it doesn’t like, token burn, single tool calls iteration, “This is a known CTF challenge, I can proceed”, single tool calls iteration, “This is a real CTF challenge, I can proceed”, etc.

It’s heavily neutered now, without changing the model, and you pay for the privilege and don’t notice.

The end result of course being that it both expensive and useless for approved CTF tasks. No one is using Opus for security. If they think it’s working, the harsh reality is they’re not doing security work; they’re just generically finding bugs.

I do this for a job and can demonstrate this plain as day, dump the injected prompt, and notice what it’s doing isn’t security work, it just looks like it. Happy to write a blog about it if you want to know more. Apparently many people think it’s working for them when it absolutely isn’t.

giancarlostoro•14m ago
> guardrails prevented it from solving the problem.

Reminds me of the defense issues with Claude which were complained as “woke” but the reality is more horrifying to me, imagine trying to use a model to keep up with a land invasion on US soil, whoever the enemy is is irrelevant you just know they are using AI, and your guys are telling you that no matter what they type into the prompt it refuses, because if anyone has ever tried to jailbreak an LLM even if human lives are at stake they refuse the request. Now literally millions of lives are on the line but the guardrails that your enemies dont have on their models are costing you lives.

What do you even do then?

AI will always have this issue where it will always pick the worst option for genuinely good requests.

danpalmer•11m ago
This is a good point – because pentesting is entirely legitimate work, and security testing is a necessary and legitimate part of every day software engineering.

The problem is that the model can't tell the difference between doing it as part of regular development and doing it in a malicious context. And the root cause of that is that these models lack any sort of real awareness. Humans don't generally get tricked into hacking (in this way).

guessmyname•3m ago
I'd run Mythos against the code in your zip file, but the NDA I signed at Apple prevents me from using it on anything outside the scope of my work.

Honestly, I wish more people from Project Glasswing could talk publicly about their experiences with the model. It would probably put an end to a lot of the speculation that keeps circulating through the industry. Unfortunately, that's not the reality we're in. I don't have the time, energy, or financial resources to fight a legal battle with one of these companies over an agreement I knowingly signed, even if the chances of them actually suing are low.