frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Red-teaming agents with the GOAT attack strategy

https://strandsagents.com/docs/user-guide/evals-sdk/red-teaming/strategies/
2•ryancoleman•1h ago

Comments

ryancoleman•1h ago
GOAT (Generative Offensive Agent Tester, arXiv:2410.01606): an attacker LLM holds an in-context toolbox of 7 jailbreak techniques and reasons in an Observation/Thought/Strategy/Reply structure each turn, sending only the Reply to the target. Now available in an evals SDK for any agent harness SDK.

Trump officials discussed structuring government equity stakes in AI companies

https://www.semafor.com/article/06/17/2026/trump-advisers-weigh-structure-of-potential-ai-stakes
1•thoughtpeddler•2m ago•0 comments

The slow loris, the only venomous primate [video]

https://www.youtube.com/watch?v=JDqpcHGfJ2I
1•gmays•2m ago•0 comments

Mapping the neuronal building blocks of human language with language models

https://www.nature.com/articles/s41586-026-10691-5
1•rndsignals•2m ago•0 comments

Don't run SQL migrations in tests: How I sped up the test suite by 2x

https://gaultier.github.io/blog/I_sped_up_the_test_suite_by_x2.html
1•thunderbong•4m ago•0 comments

Ken Griffin's Billions and Billions

https://www.newyorker.com/magazine/2026/06/22/ken-griffins-billions-and-billions
1•gmays•5m ago•0 comments

Bugbot is now over 3x faster, 22% cheaper, and finds 10% more bugs

https://cursor.com/blog/bugbot-updates-june-2026
1•cvburgess•5m ago•0 comments

FortiBleed leak exposes Fortinet VPN credentials for 73,000 devices

https://www.bleepingcomputer.com/news/security/fortibleed-leak-exposes-fortinet-vpn-credentials-f...
1•thm•6m ago•0 comments

RedlineBench: how models handle a multi-turn, real world contract negotiation

https://intelligence.crosby.ai/benchmark/
1•zachkrall•6m ago•0 comments

Los Alamos method helps expose hallucinations in vision-language AI

https://www.lanl.gov/media/news/0603-vision-language-ai
1•geox•7m ago•0 comments

Ask HN: The economics of manufactured exclusivity and algorithmic viral loops

1•nehadangwal•8m ago•0 comments

Build a GitHub API integration for AI agents

https://nango.dev/blog/build-a-github-api-integration-for-ai-agents/
1•sapneshnaik•8m ago•0 comments

Firefox: Smart Window Beta

https://www.firefox.com/en-US/smart-window/
1•coronapl•9m ago•0 comments

Robinhood to layoff 10% of workforce in restructuring

https://www.reuters.com/sustainability/robinhood-cut-10-its-full-time-workforce-2026-06-16/
1•gurjeet•9m ago•0 comments

Show HN: Live-updating, information-dense World Cup dashboard and simulator

https://worldcupdash.com/
1•localhost3000•10m ago•0 comments

Show HN: Our Claude Code Plugin Routes Lightweight AI Tasks to Specialized SLMs

https://medium.com/zerogpu/how-to-reduce-ai-compute-costs-with-our-claude-code-plugin-routing-lig...
1•joshdappier•11m ago•0 comments

Meta head of product for 'AI for work' transformation is leaving company

https://www.reuters.com/world/meta-head-product-ai-work-transformation-is-leaving-company-2026-06...
1•tartoran•11m ago•0 comments

Why the Web Won't Be Nirvana (1995)

https://www.newsweek.com/clifford-stoll-why-web-wont-be-nirvana-185306
2•downbad_•11m ago•0 comments

Show HN: bb, an agentic IDE that can control itself

https://github.com/ymichael/bb
1•sawyerjhood•11m ago•0 comments

I Review My Agents' Code

https://ylan.segal-family.com/blog/2026/06/16/how-i-review-my-agents-code/
2•speckx•13m ago•0 comments

ANEForge: Python for direct computation on the Apple Neural Engine

https://arxiv.org/abs/2606.17090
1•StatsAreFun•14m ago•0 comments

One Foot Tsunami: Redacted Brands

https://onefoottsunami.com/2026/06/17/redacted-brands/
1•ibobev•14m ago•0 comments

Genoma Labs' open 14B agentic coding model trained on Kraken

https://huggingface.co/GenomaLabs-com/KALYPSO-v1.1L
2•nicomontuschi•15m ago•0 comments

Ask HN: How are you managing context loss when switching LLMs to save tokens?

2•nehadangwal•15m ago•0 comments

Show HN: Minecraft AI Agent for mods and server management

https://orcaclient.com/
1•ekduman•15m ago•0 comments

Matcha: A new tool for curbing AI cheating

https://dailynous.com/2026/06/17/a-new-tool-for-curbing-ai-cheating-guest-post/
2•altairprime•16m ago•0 comments

Ask HN: How close to profitability did companies get before dot com burst?

3•AbstractH24•16m ago•0 comments

Amulet – live-generated AI audio adventures

https://amulet.games/
2•pathbind•17m ago•0 comments

Apple's WebKit Rules Reportedly Cost iOS Users Almost 30% Browser Performance

https://www.macrumors.com/2026/06/17/webkit-rule-costs-ios-users-browser-performance/
2•thm•17m ago•0 comments

When your agent extensions fight each other

https://developer.microsoft.com/blog/when-your-agent-extensions-fight-each-other
2•ibobev•18m ago•0 comments

Louis Pope Gratacap, a Curator in Lost Worlds

https://publicdomainreview.org/essay/gratacap-curator-in-lost-worlds
2•crescit_eundo•18m ago•0 comments