Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

https://georgelarson.me/writing/2026-03-23-nullclaw-doorman/

44•j0rg3•1h ago

Comments

j0rg3•1h ago

The stack: two agents on separate boxes. The public one (nullclaw) is a 678 KB Zig binary using ~1 MB RAM, connected to an Ergo IRC server. Visitors talk to it via a gamja web client embedded in my site. The private one (ironclaw) handles email and scheduling, reachable only over Tailscale via Google's A2A protocol.

Tiered inference: Haiku 4.5 for conversation (sub-second, cheap), Sonnet 4.6 for tool use (only when needed). Hard cap at $2/day.

A2A passthrough: the private-side agent borrows the gateway's own inference pipeline, so there's one API key and one billing relationship regardless of who initiated the request.

You can talk to nully at https://georgelarson.me/chat/ or connect with any IRC client to irc.georgelarson.me:6697 (TLS), channel #lobby.

jgrizou•1h ago

Works very well

sbinnee•1h ago

Nice. I had some fun. Good work!

One question. Sonnet for tool use? I am just guessing here that you may have a lot of MCPs to call and for that Sonnet is more reliable. How many MCPs are you running and what kinds?

iLoveOncall•1h ago

The model used is a Claude model, not self-hosted, so I'm not sure why the infrastructure is at all relevant here, except as click bait?

petcat•1h ago

Meh it's kind of interesting. Even if it is just a ridiculously over engineered agent orchestrator for a chat box and code search

echelon•47m ago

We need more infra in the cloud instead of focusing on local RTX cards.

We need OpenRunPods to run thick open weights models.

Build in the cloud rather than bet on "at the edge" being a Renaissance.

jazzyjackson•33m ago

It’s not that deep, show HN is just that, show and tell, I seriously doubt this was built just to get engagement on social media

0xbadcafebee•1h ago

This is such a great idea. I have an idea now for a bot that might help make tech hiring less horrible. It would interview a candidate to find out more about them personally/professionally. Then it would go out and find job listings, and rate them based on candidate's choices. Then it could apply to jobs, and send a link to the candidate's profile in the job application, which a company could process with the same bot. In this way, both company and candidate could select for each other based on their personal and professional preferences and criteria. This could be entirely self-hosted open-source on both sides. It's entirely opt-in from the candidate side, but I think everyone would opt-in, because you want the company to have better signal about you than just a resume (I think resumes are a horrible way to find candidates).

eclipxe•54m ago

Working on this actually

jaggederest•38m ago

Triplebyte was a thing for a little while, maybe it's time for it to live again.

InitialPhase55•48m ago

Curious, how did you settle on Haiku/Sonnet? Because there are much cheaper models on OpenRouter that probably perform comparatively...

Consider Haiku 4.5: $1/M input tokens | $5/M output tokens vs MiniMax M2.7: $0.30/M input tokens | $1.20/M output tokens vs Kimi K2.5: $0.45/M input tokens | $2.20/M output tokens

I haven't tried so I can't say for sure, but from personal experience, I think M2.7 and K2.5 can match Haiku and probably exceed it on most tasks, for much cheaper.

eric_khun•37m ago

that's so fun ! how do you know when to call haiku or sonnet?

czhu12•6m ago

Super random but I had a similar idea for a bot like this that I vibe coded while on a train from Tokyo to Osaka

https://web-support-claw.oncanine.run/

Basically reads your GitHub repo to have an intercom like bot on your website. Answer questions to visitors so you don’t have to write knowledge bases.

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

Why so many control rooms were seafoam green (2025)

Chicago artist creates tourism posters for city's neighborhoods

DOOM Over DNS

Judge blocks Pentagon effort to 'punish' Anthropic with supply chain risk label

New York City hospitals drop Palantir as controversial AI firm expands in UK

Moving from GitHub to Codeberg, for lazy people

Show HN: Veil – Dark mode PDFs without destroying images, runs in the browser

My minute-by-minute response to the LiteLLM malware attack

Apple Discontinues Mac Pro

Anthropic Subprocessor Changes

CERN to host a new phase of Open Research Europe

HyperAgents: Self-referential self-improving agents

John Bradley, author of xv, has died

OpenTelemetry profiles enters public alpha

Whistler: Live eBPF Programming from the Common Lisp REPL

Order Granting Preliminary Injunction – Anthropic vs. U.S. Department of War [pdf]

Using FireWire on a Raspberry Pi

We haven't seen the worst of what gambling and prediction markets will do

Show HN: Fio: 3D World editor/game engine – inspired by Radiant and Hammer

How much precision can you squeeze out of a table?

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

What Does a Hologram Trademark Signify When the Hologram Isn't There?

Colibri – chat platform built on the AT Protocol for communities big and small

Stripe Projects: Provision and manage services from the CLI

Running Tesla Model 3's computer on my desk using parts from crashed cars

From zero to a RAG system: successes and failures

Fast regex search: indexing text for agent tools

DeployTarot.com – Tarot card reading for deployments

My home network observes bedtime with OpenBSD and pf

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

Comments

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

Why so many control rooms were seafoam green (2025)

Chicago artist creates tourism posters for city's neighborhoods

DOOM Over DNS

Judge blocks Pentagon effort to 'punish' Anthropic with supply chain risk label

New York City hospitals drop Palantir as controversial AI firm expands in UK

Moving from GitHub to Codeberg, for lazy people

Show HN: Veil – Dark mode PDFs without destroying images, runs in the browser

My minute-by-minute response to the LiteLLM malware attack

Apple Discontinues Mac Pro

Anthropic Subprocessor Changes

CERN to host a new phase of Open Research Europe

HyperAgents: Self-referential self-improving agents

John Bradley, author of xv, has died

OpenTelemetry profiles enters public alpha

Whistler: Live eBPF Programming from the Common Lisp REPL

Order Granting Preliminary Injunction – Anthropic vs. U.S. Department of War [pdf]

Using FireWire on a Raspberry Pi

We haven't seen the worst of what gambling and prediction markets will do

Show HN: Fio: 3D World editor/game engine – inspired by Radiant and Hammer

How much precision can you squeeze out of a table?

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

What Does a Hologram Trademark Signify When the Hologram Isn't There?

Colibri – chat platform built on the AT Protocol for communities big and small

Stripe Projects: Provision and manage services from the CLI

Running Tesla Model 3's computer on my desk using parts from crashed cars

From zero to a RAG system: successes and failures

Fast regex search: indexing text for agent tools

DeployTarot.com – Tarot card reading for deployments

My home network observes bedtime with OpenBSD and pf