RubyLLM: A Ruby framework for all major AI providers

147•doener•1h ago

Comments

mosselman•1h ago

It is quite nice, but not as nice as you'd want. You still have to set platform specifics when running completions when you want to tune things like temperature, effort, max tokens, etc.

earcar•1h ago

RubyLLM author here.

I'm not sure where you got that.

`chat.with_temperature(0.2)`

https://rubyllm.com/chat/#controlling-response-behavior

`chat.with_thinking(effort: :high, budget: 8000)`

https://rubyllm.com/thinking/#controlling-extended-thinking

Max tokens is the only one of your list that require provider specific params:

https://rubyllm.com/chat/#provider-specific-parameters

I'm one guy doing it for free. Happy to see your contribution!

techscruggs•1h ago

And thank you! It is absolutely awesome and a true joy to work with.

mosselman•51m ago

Hi! Valid challenge, I am probably misremembering. We were playing with various 'one-interface to all providers' solutions and I might have mixed up RubyLLM there. Sorry for that.

I will have a deep dive into which things I felt we needed to adapt per provider.

I didn't mean to imply that you have to solve all of our wants of course.

One thing we did do was monkey-patch the spot where tool_calls are performed by RubyLLM. We had our own mechanism for that and were able to skip RubyLLM's and still extract the tool calls and run them through our own tool harness. That all worked beautifully. I don't know if that type of stuff is something you want PRs on or that you want to keep steering towards the route that does everything within RubyLLM classes. Happy to contribute some of that.

earcar•40m ago

Interesting! What were you guys trying to achieve by running them in your own tool harness?

swe_dima•1h ago

I found Ruby LLM to be surprisingly good - in terms of usability it's close to Vercel's AI framework.

It tries to strike a balance between working out of the box and being flexible... which has its challenges, still nice overall.

One big real-life pain I experienced is that caches don't always work, e.g. for xAI, since it only supports completions API and thought signatures are returned wrong.

earcar•1h ago

Thank you!

Responses API is now implemented and it's coming in RubyLLM 2.0

https://github.com/crmne/ruby_llm/blob/main/lib/ruby_llm/pro...

techscruggs•1h ago

Do you have any details published around 2.0? Would love to learn more.

earcar•1h ago

Not yet. I'll do a series of blog posts and tweets in the next weeks.

zhisme•1h ago

thank you for bringing ruby into AI community and your open-source work. Great language must be explored and get more attention :)

earcar•1h ago

Thank you!

I love how MINASWAN Hacker News is when talking about Ruby!

fragkakis•1h ago

I have created an open source chatgpt clone with rubyllm, check it out here: https://www.railschat.org/

EGreg•1h ago

In case you're using PHP or Node.js, we've made a similar toolkit free and open source on github: https://github.com/Qbix/AI/tree/main/classes/AI

Finbarr•54m ago

RubyLLM is very easy to use. Made extensive use of it for a project last year. Drawbacks are it was difficult to instrument for true trace observability and it has a pattern where retries will delete the underlying models so the history you see is clean but not necessarily great for seeing exactly what the sequence of API calls was.

earcar•52m ago

Glad you like it.

Rails-style instrumentation landed in 1.16.0.

https://rubyllm.com/instrumentation/

bitedeck•53m ago

Thank you

themcgruff•35m ago

I built a similar Ruby based agent development kit that has a different focus and feature set:

https://github.com/tweibley/legate

obiefernandez•19m ago

I have an open source gem called Raix that builds on top of RubyLLM's abstractions and is quite popular. https://github.com/OlympiaAI/raix

notpachet•15m ago

Why would anyone still build in dynamically typed languages in 2026? Why relinquish the crystal clear signals that static typing is able to provide to the LLM?

jimbokun•13m ago

This is not a tool for using LLMs to write Ruby code.

taylorlapeyre•10m ago

Well, LLMs have an obscene amount of context built into their weights about Ruby on Rails, and can work within it extremely quickly.

The AI Data Centre Legal Case That Could Eradicate Civil Rights

Why big AI labs are hiring so many philosophers

What does your eval measure?

Show HN: Tuip – CLI / TUI for checking SaaS vendors' statuses

Loops Burn Tokens

Show HN: Gifhub, bug hunter that shows instead of tells

The Bargain. Or what America forgot and Europe still keeps

The Xteink X4 E-Ink Reader

Sentrup – AI Customer Support Platform

Exploiting vulnerabilities in Johnson and Johnson web apps

Show HN: Cutlistor – Instant cut list optimizer with 3D Model and PDF Import

I crawled 827 employers' career sites to measure ATS market share

Germany's Kai Havertz: 'I make runs that look pointless but I'm creating space'

Ask HN: How much coding should beginners learn in the AI era?

Show HN: Empowering codex/Claude Code with Aswath Damodaran valuation thinking

Building a LoFi Radio

Show HN: Metaspec: The DpANS3R Common Lisp Spec in S-Expr and HTML Format

Show HN: Browser based tool for programming ch57x macro-pads

Create cross-platform mobile apps with Ruby

Show HN: (Spotlight/Raycast for Web Search not local) && (compare AI responses)

How to Measure the ROI of FDE

Show HN: LinkedIn Remote jobs by technology and country Map. Joint effort.

Seoul: AWS and Google Cloud Kept Failing the Same Network Path?

Human Dignity – On the Perils of Indifference

Claude Agents in Notion

Fable – Is it ever coming back?

Retracted: Paper claiming immunochemotherapy more effective in morning

Agentic Design Patterns

ModelFit – find the cheapest LLM that can back up your main coding model

Predicting AI Job Exposure

RubyLLM: A Ruby framework for all major AI providers

Comments

The AI Data Centre Legal Case That Could Eradicate Civil Rights

Why big AI labs are hiring so many philosophers

What does your eval measure?

Show HN: Tuip – CLI / TUI for checking SaaS vendors' statuses

Loops Burn Tokens

Show HN: Gifhub, bug hunter that shows instead of tells

The Bargain. Or what America forgot and Europe still keeps

The Xteink X4 E-Ink Reader

Sentrup – AI Customer Support Platform

Exploiting vulnerabilities in Johnson and Johnson web apps

Show HN: Cutlistor – Instant cut list optimizer with 3D Model and PDF Import

I crawled 827 employers' career sites to measure ATS market share

Germany's Kai Havertz: 'I make runs that look pointless but I'm creating space'

Ask HN: How much coding should beginners learn in the AI era?

Show HN: Empowering codex/Claude Code with Aswath Damodaran valuation thinking

Building a LoFi Radio

Show HN: Metaspec: The DpANS3R Common Lisp Spec in S-Expr and HTML Format

Show HN: Browser based tool for programming ch57x macro-pads

Create cross-platform mobile apps with Ruby

Show HN: (Spotlight/Raycast for Web Search not local) && (compare AI responses)

How to Measure the ROI of FDE

Show HN: LinkedIn Remote jobs by technology and country Map. Joint effort.

Seoul: AWS and Google Cloud Kept Failing the Same Network Path?

Human Dignity – On the Perils of Indifference

Claude Agents in Notion

Fable – Is it ever coming back?

Retracted: Paper claiming immunochemotherapy more effective in morning

Agentic Design Patterns

ModelFit – find the cheapest LLM that can back up your main coding model

Predicting AI Job Exposure