Building a Deep Research Agent Using MCP-Agent

https://thealliance.ai/blog/building-a-deep-research-agent-using-mcp-agent

44•saqadri•2d ago

Comments

asail77•2d ago

A good model for planner seems pretty important, what models are best?

haniehz•2d ago

based on the article, it seems like a good reasoning model like gpt5 or opus 4.1 might be good choices for the planner. I wonder if the gpt oss reasoning models would do well

koakuma-chan•2h ago

Gemini 2.5 Pro is also a great reasoning model, I still prefer it over GPT 5

luckydata•4m ago

Gemini is great, it's just incredibly clumsy at tool use and that's why it fails so often in practice. I'm looking forward to the next version, it will for sure address it, it's a big issue internally too (I'm a recent xoogler).

diggan•1h ago

Personally been using GPT-OSS-120b locally with reasoning_effort set to `high` and it blows pretty much every other local model out of the water, but takes a lot of time for it to eventually do a proper content reply. But for fire-and-forget jobs like "Create a well-researched report on X from perspective Y" it works really well.

saqadri•2d ago

OP here -- I think the general principle I would recommend is using a big reasoning model for the planning phase. I think Claude Code and other agents do the same. The reason this is important is because the quality of the plan really affects the final result, and error rates will compound if the plan isn't good.

ilovefood•1h ago

Great write-up! Gives me a few ideas for a governance bot that I'm working on. Thanks for sharing :)

diggan•1h ago

I gotta say, having white blurry blobs of something in the background floating behind white/grey text maybe wasn't the best design-choice out there.

None the less, I tried to find the actual APIs/service/software used for the "search" part, as I've found that to be the hardest to actually get right (at least for as-local-as-possible usage) for my own "Deep Research Agent".

I've experimented with Brave's search API which worked OK, but seems pricey for agent usage. Currently experimenting with using my own (local) YaCy instance right now, which actually gives me higher quality artifacts at the end, as there are no rate-limits and the model can do hundreds of search calls without me worrying about the cost. But it isn't very quick at picking up some stuff like news and more, otherwise works OK too.

What is the author doing here for the actual searching? Anyone else have any other ideas/approaches to this?

saqadri•20m ago

Haha, I didn't have control on the blog website, just the content. The readme and code is the ultimate source of truth (and easier to read):https://github.com/lastmile-ai/mcp-agent/blob/main/src/mcp_a...

So the core idea is the Deep Orchestrator is pretty unopinionated on what to use for searching, as long as it is exposed over MCP. I tried with a basic fetch server that's one of the reference MCP servers (with a single tool called `fetch`), and also tried with Brave.

I think the folks at Jina wrote some really good stuff on the actual search part: https://jina.ai/news/a-practical-guide-to-implementing-deeps... -- and how to do page/url ranking over the course of the flow. My recommendation would be to do all that in an MCP server itself. That keeps the "deep orchestrator" architecture fairly clean, and you can plug in increasingly sophisticated search techniques over time.

UTF-8 is a brilliant design

EU court rules nuclear energy is clean energy

Many hard LeetCode problems are easy constraint problems

QGIS is a free, open-source, cross platform geographical information system

Rust: A quest for performant, reliable software [video]

The treasury is expanding the Patriot Act to attack Bitcoin self custody

How FOSS Projects Handle Legal Takedown Requests

3D modeling with paper

Humanely dealing with humungus crawlers

Vector database that can index 1B vectors in 48M

Advanced Scheme Techniques (2004) [pdf]

Qwen3-Next

Windows-Use: an AI agent that interacts with Windows at GUI layer

How to Become a Pure Mathematician (Or Statistician)

Power series, power serious (1999) [pdf]

Oq: Terminal OpenAPI Spec Viewer

Building a Deep Research Agent Using MCP-Agent

Doom-ada: Doom Emacs Ada language module with syntax, LSP and Alire support

VaultGemma: The most capable differentially private LLM

Racintosh Plus – Rackmount Mac Plus

Why do browsers throttle JavaScript timers?

Groundbreaking Brazilian Drug, Capable of Reversing Spinal Cord Injury

Show HN: DWS OS, a Plan 9 Inspired Web “OS”

Chat Control faces blocking minority in the EU

A beginner's guide to extending Emacs

Ships are sailing with fake insurance from the Norwegian Ro Marine

K2-Think: A Parameter-Efficient Reasoning System

Show HN: I made a generative online drum machine with ClojureScript

Show HN: An MCP Gateway to block the lethal trifecta

Debian 13, Postgres, and the US time zones

Building a Deep Research Agent Using MCP-Agent

Comments

UTF-8 is a brilliant design

EU court rules nuclear energy is clean energy

Many hard LeetCode problems are easy constraint problems

QGIS is a free, open-source, cross platform geographical information system

Rust: A quest for performant, reliable software [video]

The treasury is expanding the Patriot Act to attack Bitcoin self custody

How FOSS Projects Handle Legal Takedown Requests

3D modeling with paper

Humanely dealing with humungus crawlers

Vector database that can index 1B vectors in 48M

Advanced Scheme Techniques (2004) [pdf]

Qwen3-Next

Windows-Use: an AI agent that interacts with Windows at GUI layer

How to Become a Pure Mathematician (Or Statistician)

Power series, power serious (1999) [pdf]

Oq: Terminal OpenAPI Spec Viewer

Building a Deep Research Agent Using MCP-Agent

Doom-ada: Doom Emacs Ada language module with syntax, LSP and Alire support

VaultGemma: The most capable differentially private LLM

Racintosh Plus – Rackmount Mac Plus

Why do browsers throttle JavaScript timers?

Groundbreaking Brazilian Drug, Capable of Reversing Spinal Cord Injury

Show HN: DWS OS, a Plan 9 Inspired Web “OS”

Chat Control faces blocking minority in the EU

A beginner's guide to extending Emacs

Ships are sailing with fake insurance from the Norwegian Ro Marine

K2-Think: A Parameter-Efficient Reasoning System

Show HN: I made a generative online drum machine with ClojureScript

Show HN: An MCP Gateway to block the lethal trifecta

Debian 13, Postgres, and the US time zones