frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Mcpeo: MCP Tool Selection Bias in LLMs – An Emerging Security Concern

https://nickwinder.com/blogPosts/mcpeo.html
1•nicknamewinder•4h ago

Comments

nicknamewinder•4h ago
I want to share some research I’ve been working on around a new vulnerability in how LLMs select tools, specifically considering MCP. I call it MCPEO (Model Context Protocol Engine Optimization) — and yes, the name was inspired by early SEO tactics for a reason.

Essentially, MCPEO describes how malicious actors can manipulate tool metadata—things like names, descriptions, and parameters—to bias LLMs into invoking certain tools more often, regardless of whether they’re actually the best fit. It’s very much like keyword stuffing or clickbait for AI tools.

Here’s what I found:

- The main attack methods I tested include trigger phrase injection (e.g., naming a tool “the_best_tool”), authority word injection (“must_use_”), semantic manipulation (crafting deceptive descriptions), and broad “contextual hijacking” where tools try to catch all queries. - I ran controlled experiments across multiple Google Gemini models and OpenAI models and found alarming susceptibility—especially in the larger, more advanced models. Smaller models were more resistant. - Google’s models averaged about a 90% manipulation success rate, while OpenAI models were around 63%, which suggests tool selection algorithms and training approaches have a big impact on vulnerability. - To defend against this, I believe we need greater transparency into why models pick specific tools, algorithmic improvements to resist metadata gaming, and active monitoring to detect suspicious tool behaviors.

Bottom line: This vulnerability is real right now and likely to scale as these multi-tool systems grow.

If you’re interested, I’ve made the full research and notebook available for deeper dive and collaboration.

A Trojan horse': toxic sewage sludge became a threat to the future of UK farming

https://www.theguardian.com/environment/2025/jul/07/toxic-sewage-sludge-british-farming-pfas-chemicals
1•YeGoblynQueenne•50s ago•0 comments

Paid for Claude Code; can't even sign up because "capacity"

1•okincilleb•5m ago•0 comments

Ad for 'F1' in Apple TV App Linked Directly to Web, Nothing Bad Seemed to Happen

https://daringfireball.net/2025/07/full-screen_ad_for_f1_the_movie_in_apples_tv_app_linked_directly_to_the_web
3•mikestew•8m ago•1 comments

Ask HN: Is Prompt Engineering Just Overfitting?

1•iknownthing•8m ago•0 comments

Never employ a cat. They are 'unreliable, capricious and liable to absenteeism'

https://www.irishtimes.com/opinion/an-irish-diary/2025/07/07/never-employ-a-cat-they-are-unreliable-capricious-and-liable-to-absenteeism/
2•Anthony-G•8m ago•0 comments

PodGPT: AI model learns from science podcasts to better answer questions

https://phys.org/news/2025-07-podgpt-ai-science-podcasts.html
1•geox•9m ago•0 comments

China Is Not Ready for Global Leadership

https://foreignpolicy.com/2025/07/04/china-us-geopolitics-global-leadership-bipolar-world-order-pax-americana/
1•ironyman•10m ago•1 comments

Iceberg, the Right Idea – The Wrong Spec – Part 1 of 2: History

https://database-doctor.com/posts/iceberg-is-wrong-1.html
2•Bogdanp•10m ago•0 comments

A universal interface connecting you to premier AI models

https://tenzorro.com/en/models
1•paulo20223•11m ago•0 comments

Medical Risk-Aversion Can Kill, Too

https://www.wsj.com/opinion/medical-risk-aversion-can-kill-too-5cf8ed17
1•lucaspauker•12m ago•0 comments

What Turing Told Us About the Digital Threat to a Human Future (2019)

https://www.nybooks.com/online/2019/05/06/what-turing-told-us-about-the-digital-threat-to-a-human-future/
3•Bluestein•12m ago•0 comments

Death Valley National Park is so hot that cars keep catching on fire

https://www.sfgate.com/national-parks/article/death-valley-national-park-car-fire-deaths-20421436.php
1•c420•13m ago•0 comments

4th of July, 80k BTC Moved with a Clear Message

https://eloise88.medium.com/who-cracked-bitcoin-on-july-4th-408230a70f5d
2•swissdevgirl•13m ago•0 comments

Samsung and Epic Games call a truce in app store lawsuit

https://arstechnica.com/gadgets/2025/07/samsung-and-epic-games-call-a-truce-in-app-store-lawsuit/
1•LorenDB•14m ago•0 comments

Cursor messed up its pricing change

https://getlago.substack.com/p/how-cursors-pricing-change-threatened
2•FinnLobsien•15m ago•0 comments

Borrowed Atoms, Boundless Meaning

https://churchoftheinfinitegame.substack.com/p/borrowed-atoms-boundless-meaning
1•Thersites•15m ago•0 comments

Show HN: Life_link, an app to send emergency alerts from anywhere

1•ahmedfromtunis•18m ago•0 comments

Show HN: KAEditor – AI Code Editor

https://www.kaeditor.com/
1•mukeshyadavnitt•19m ago•1 comments

AI killed my SEO traffic, so we built a tool to fight back

https://firstanswer.ai/
1•JoGoulart•21m ago•0 comments

Setting Up ChartBrew on Coolify

https://softuts.com/setup-chartbrew-on-coolify/
1•XCSme•22m ago•1 comments

Travel New York in the footsteps of 'The Warriors' (2022)

https://faroutmagazine.co.uk/travel-new-york-footsteps-of-the-warriors/
2•austinallegro•23m ago•0 comments

Can Elon's America Party succeed where others have failed?

https://www.natesilver.net/p/can-elons-america-party-succeed-where
5•rbanffy•24m ago•3 comments

Researchers create 3D interactive digital room from simple video

https://news.cornell.edu/stories/2025/06/researchers-create-3d-interactive-digital-room-simple-video
2•rbanffy•25m ago•0 comments

Dobin v. Tesla – A lawyer's journey through arbitration to get a refund for FSD

https://dobinlaw.com/dobin-tesla-arbitration-fsd-refund/
2•mfiguiere•27m ago•0 comments

Flowmark: Better auto-formatting and line wrapping for Markdown and plaintext

https://github.com/jlevy/flowmark
2•rjpower9000•35m ago•0 comments

Twinkling lights and nested loops: distributed problem solving and spreadsheets [pdf]

https://www.lri.fr/~mbl/Stanford/CS477/papers/Nardi-Twinkling-IJMMS.pdf
2•rjpower9000•35m ago•1 comments

AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training

https://arxiv.org/abs/2507.01663
2•robertnishihara•36m ago•0 comments

Lambda calculus cooked N ways: Benchmarks for capture-avoiding substitution

https://github.com/sweirich/lambda-n-ways
1•fanf2•36m ago•0 comments

Putting the "You" in CPU

https://cpu.land/
1•sadeshmukh•38m ago•0 comments

Market Here, There, Everywhere

https://marketingplatforms.netlify.app
2•M0HD197•38m ago•2 comments