news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Open Source SLM Trained for MCP

https://osmosis.ai/blog/applying-rl-mcp

8•KaseyZhang•4h ago

Comments

KaseyZhang•4h ago

Hey HN! We’re Kasey and Andy from Osmosis (https://osmosis.ai/). We’ve been playing with MCP recently and wanted to share a lightweight open source model that can connect any MCP client* to any MCP server! Check it out here: https://huggingface.co/osmosis-ai/osmosis-mcp-4b

Right now, the only models that consistently work well for MCP are large and closed-source (e.g. 3.7 Sonnet, Gemini 2.5). Other models struggle with tool-calling consistency, and in particular, there’s a lack of options that can run locally.

We used Dr. GRPO to train Qwen3-4B (w/ VeRL + SGLang for multi-turn tool-calling training) for this purpose, and we were able to get performance parity with Gemini 2.5 Pro on relevant benchmarks like GSM8K (https://i.imgur.com/4RXq2Pm.png).

And since the model is open source and lightweight, that means you can also do further fine-tuning / training that’s fully local and customized to your specific needs.

Let us know what you think, or if there’s anything we can answer!

* Any client that supports Qwen3 models (i.e. it works with OpenRouter, local deployments with Ollama, etc.)

Nuclear arsenals in Pakistan and India portend regional catastrophe (2019)

https://www.science.org/doi/10.1126/sciadv.aay5478

1•Jimmc414•1m ago•0 comments

Connecticut Fire Officials Warn Against TikTok Challenge That Can Result in Fire

https://portal.ct.gov/das/press-room/press-releases/2025/connecticut-fire-officials-warn-against-tiktok-laptop-challenge-which-can-result-in-fire

1•josephcsible•3m ago•0 comments

Chinese Celebrate J-10C Fighters' "Success" over Rafale Jets During In-Pak Clash

https://www.eurasiantimes.com/as-claims-about-pakistan-shooting-down-rafale/

1•inverted_flag•5m ago•0 comments

Ask HN: My company is forcing 1 week sprints. What should I do?

1•mcsolid•5m ago•0 comments

Anchor links copied from project READMEs now add a query parameter

https://github.com/orgs/community/discussions/70577

1•mooreds•5m ago•0 comments

State-Tracer – Visualize Recoil and Jotai State Dependencies

1•apple-yagi•5m ago•0 comments

Creating a Search Engine for Fun

https://vincents.dev/blog/creating-a-search-engine/

1•mooreds•6m ago•0 comments

Creating Products People Want with Brian Pontarelli [video]

https://www.youtube.com/watch?v=sjbrojPqpYo

1•mooreds•7m ago•0 comments

The Future of Programming

1•victor_js•10m ago•0 comments

Ask HN: Are you using AI coding assistance?

1•cloudking•11m ago•2 comments

Simulating high-speed solar wind streams from coronal holes

https://www.nature.com/articles/s41598-025-97246-2

1•PaulHoule•14m ago•0 comments

A Lightweight Merge Queue Using GitHub Actions

https://sketch.dev/blog/lightweight-merge-queue

1•caust1c•15m ago•0 comments

D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning

https://dllm-reasoning.github.io/

2•t55•16m ago•0 comments

Reverse Engineering Granola to Pull Notes into Obsidian

https://josephthacker.com/hacking/2025/05/08/reverse-engineering-granola-notes.html

1•rez0123•17m ago•0 comments

The reason why I don't use AI or even code completion

https://unixdigest.com/articles/the-reason-why-i-dont-use-ai-or-even-code-completion.html

2•bitbasher•17m ago•0 comments

Are LLMs more than autocomplete? AI Debate

https://rehearsal.so/character/98f9ac23-9b48-49a0-99dc-d72d9b88d230

1•t55•17m ago•0 comments

My Excellent Conversation with Jack Clark

https://marginalrevolution.com/marginalrevolution/2025/05/my-excellent-conversation-with-jack-clark.html

1•paulpauper•23m ago•0 comments

kit - Code Intelligence Toolkit

https://github.com/cased/kit

1•helloericsf•30m ago•0 comments

Show HN: Built a site for tech comparisons backed by battle scars, not AI

https://convinceme.tech

1•Hmerac•33m ago•0 comments

Gender characteristics of service robots can influence customer decisions

https://www.psu.edu/news/health-and-human-development/story/gender-characteristics-service-robots-can-influence-customer

2•gnabgib•34m ago•0 comments

Belgian teens arrested with 5k smuggled ants, Kenya warns new trafficking trends

https://apnews.com/article/garden-ants-kenya-smugglers-belgians-vietnamese-624f12ae80e0d66a87f03966079efbc1

2•bookofjoe•36m ago•0 comments

Enforcement of Copilot premium request limits moved to June 4, 2025

https://github.blog/changelog/2025-05-07-enforcement-of-copilot-premium-request-limits-moved-to-june-4-2025/

2•kjhughes•39m ago•0 comments

Attacked by Thugs (2004)

https://idlewords.com/2004/05/attacked_by_thugs.htm

2•impish9208•40m ago•0 comments

The Molecular Bond That Helps Secure Your Memories

https://www.quantamagazine.org/the-molecular-bond-that-helps-secure-your-memories-20250507/

1•pseudolus•41m ago•0 comments

VMware perpetual license holders receive cease-and-desist letters from Broadcom

https://arstechnica.com/gadgets/2025/05/broadcom-sends-cease-and-desist-letters-to-subscription-less-vmware-users/

8•turtlegrids•41m ago•0 comments

Reasons to Write More in an Age When Writing Means Less

https://miloandthecalf.substack.com/p/three-reasons-to-write-more-in-an

2•paulpauper•43m ago•1 comments

Globalization did not hollow out the U.S. middle class

https://marginalrevolution.com/marginalrevolution/2025/05/globalization-did-not-hollow-out-the-u-s-middle-class.html

1•paulpauper•44m ago•0 comments

How eggs break and the role of strength versus toughness

https://www.nature.com/articles/s42005-025-02087-0

1•sohkamyung•46m ago•0 comments

Radiation Tolerant Software Framework for Space Applications

https://github.com/r0nlt/Space-Radiation-Tolerant

6•r0nlt•47m ago•0 comments

Speeding Up Graph Learning Models with PyG and Torch.compile

https://kumo.ai/research/speeding-up-graph-learning-models-with-pyg-and-torch-compile/

1•gk1•52m ago•0 comments