frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
214•isitcontent•12h ago•25 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
319•vecti•14h ago•141 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•24m ago•1 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
270•eljojo•15h ago•159 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
69•phreda4•12h ago•13 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
90•antves•1d ago•66 comments

Show HN: ARM64 Android Dev Kit

https://github.com/denuoweb/ARM64-ADK
16•denuoweb•1d ago•2 comments

Show HN: Compile-Time Vibe Coding

https://github.com/Michael-JB/vibecode
9•michaelchicory•1h ago•1 comments

Show HN: Slack CLI for Agents

https://github.com/stablyai/agent-slack
47•nwparker•1d ago•11 comments

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

https://github.com/artifact-keeper
150•bsgeraci•1d ago•63 comments

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

https://github.com/rivet-dev/sandbox-agent/tree/main/gigacode
17•NathanFlurry•20h ago•7 comments

Show HN: Slop News – HN front page now, but it's all slop

https://dosaygo-studio.github.io/hn-front-page-2035/slop-news
8•keepamovin•2h ago•2 comments

Show HN: Fitspire – a simple 5-minute workout app for busy people (iOS)

https://apps.apple.com/us/app/fitspire-5-minute-workout/id6758784938
2•devavinoth12•5h ago•0 comments

Show HN: Horizons – OSS agent execution engine

https://github.com/synth-laboratories/Horizons
23•JoshPurtell•1d ago•5 comments

Show HN: I built a RAG engine to search Singaporean laws

https://github.com/adityaprasad-sudo/Explore-Singapore
4•ambitious_potat•6h ago•4 comments

Show HN: Daily-updated database of malicious browser extensions

https://github.com/toborrm9/malicious_extension_sentry
14•toborrm9•17h ago•7 comments

Show HN: Sem – Semantic diffs and patches for Git

https://ataraxy-labs.github.io/sem/
2•rs545837•7h ago•1 comments

Show HN: Micropolis/SimCity Clone in Emacs Lisp

https://github.com/vkazanov/elcity
172•vkazanov•2d ago•49 comments

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

https://www.biotradingarena.com/hn
25•dchu17•17h ago•12 comments

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

https://rahuljaguste.github.io/Nethack_Falcons_Eye/
4•rahuljaguste•11h ago•1 comments

Show HN: FastLog: 1.4 GB/s text file analyzer with AVX2 SIMD

https://github.com/AGDNoob/FastLog
5•AGDNoob•8h ago•1 comments

Show HN: Local task classifier and dispatcher on RTX 3080

https://github.com/resilientworkflowsentinel/resilient-workflow-sentinel
25•Shubham_Amb•1d ago•2 comments

Show HN: Gohpts tproxy with arp spoofing and sniffing got a new update

https://github.com/shadowy-pycoder/go-http-proxy-to-socks
2•shadowy-pycoder•9h ago•0 comments

Show HN: I built a directory of $1M+ in free credits for startups

https://startupperks.directory
4•osmansiddique•9h ago•0 comments

Show HN: A Kubernetes Operator to Validate Jupyter Notebooks in MLOps

https://github.com/tosin2013/jupyter-notebook-validator-operator
2•takinosh•10h ago•0 comments

Show HN: A password system with no database, no sync, and nothing to breach

https://bastion-enclave.vercel.app
11•KevinChasse•17h ago•16 comments

Show HN: GitClaw – An AI assistant that runs in GitHub Actions

https://github.com/SawyerHood/gitclaw
9•sawyerjhood•18h ago•0 comments

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

https://33rpm.noonpacific.com/
3•kaniksu•11h ago•0 comments

Show HN: Chiptune Tracker

https://chiptunes.netlify.app
3•iamdan•11h ago•1 comments

Show HN: Craftplan – I built my wife a production management tool for her bakery

https://github.com/puemos/craftplan
567•deofoo•5d ago•166 comments
Open in hackernews

Show HN: RowboatX – open-source Claude Code for everyday automations

https://github.com/rowboatlabs/rowboat
131•segmenta•2mo ago
Claude Code is great, but it’s focused on coding. The missing piece is a native way to build and run custom background agents for non-code tasks. We built RowboatX as a CLI tool modeled after Claude Code that lets you do that. It uses the file system and unix tools to create and monitor background agents for everyday tasks, connect them to any MCP server for tools, and reason over their outputs.

Because RowboatX runs locally with shell access, the agents can install tools, execute code, and automate anything you could do in a terminal with your explicit permission. It works with any compatible LLM, including open-source ones.

Our repo is https://github.com/rowboatlabs/rowboat, and there’s a demo video here: https://youtu.be/cyPBinQzicY

For example, you can connect RowboatX to the ElevenLabs MCP server and create a background workflow that produces a NotebookLM-style podcast every day from recent AI-agent papers on arXiv. Or you can connect it to Google Calendar and Exa Search to research meeting attendees and generate briefs before each event.

You can try these with: `npx @rowboatlabs/rowboatx`

We combined three simple ideas:

1. File system as state: Each agent’s instruction, memory, logs, and data are just files on disk, grepable, diffable, and local. For instance, you can just run: grep -rl '"agent":"<agent-name>"' ~/.rowboat/runs to list every run for a particular workflow.

2. The supervisor agent: A Claude Code style agent that can create and run background agents. It predominantly uses Unix commands to monitor, update, and schedule agents. LLMs handle Unix tools better than backend APIs [1][2], so we leaned into that. It can also probe any MCP server and attach the tools to the agents.

3. Human-in-the-loop: Each background agent can emit a human_request message when needed (e.g. drafting a tricky email or installing a tool) that pauses execution and waits for input before continuing. The supervisor coordinates this.

I started my career over a decade ago building spam detection models at Twitter, spending a lot of my time in the terminal with Unix commands for data analysis [0] and Vowpal Wabbit for modeling. When Claude Code came along, it felt familiar and amazing to work with. But trying to use it beyond code always felt a bit forced. We built RowboatX to bring that same workflow to everyday tasks. It is Apache-2.0 licensed and easily extendable.

While there are many agent builders, running on the user's terminal enables unique use cases like computer and browser automation that cloud-based tools can't match. This power requires careful safety design. We implemented command-level allow/deny lists, with containerization coming next. We’ve tried to design for safety from day one, but we’d love to hear the community’s perspective on what additional safeguards or approaches you’d consider important here.

We’re excited to share RowboatX with everyone here. We’d love to hear your thoughts and welcome contributions!

—

[0] https://web.stanford.edu/class/cs124/kwc-unix-for-poets.pdf [1] https://arxiv.org/pdf/2405.06807 [2] https://arxiv.org/pdf/2501.10132

Comments

jckahn•2mo ago
Can this use local LLMs?
segmenta•2mo ago
Yes - you can use local LLMs through LiteLLM and Ollama. Would you like us to support anything else?
thedangler•2mo ago
LM Studio?
ramnique•2mo ago
Yes, because LM Studio is openai-compatible. When you run rowboatx the first time, it creates a ~/.rowboat/config/models.json. You can then configure LM Studio there. Here is an example: https://gist.github.com/ramnique/9e4b783f41cecf0fcc8d92b277d...
divan•2mo ago
One of the main reasons for me for sticking with Claude Code (also for non-coding tasks, I think the name is a misnomer) is the fixed price plan. Pretty much any other open-source alternative requires API key, which means that as soon as I start using it _for real_, I'll start overpaying and/or hitting limits too fast. At least that was my initial experience with API from OpenAI/Claude/Gemini.

Am I biased/wrong here?

segmenta•2mo ago
Yep, this is a fair take. Token usage shoots up fast when you do agentic stuff for coding. I too end up doing the same thing.

But for most background automations your might actually run, the token usage is way lower and probably an order of magnitude cheaper than agentic coding. And a lot of these tasks run well on cheaper models or even open-source ones.

So I don't think you are wrong at all. It is just that I believe the expensive token pattern mostly comes from coding-style workloads.

kej•2mo ago
I don't doubt you, but it would be interesting to see some token usage measurements for various tasks like you describe.
segmenta•2mo ago
For example, the NotebookLM-style podcast generator workflow in our demo uses around 3k tokens end to end. Using Claude Sonnet 4.5’s blended rate (about $4.5 per million tokens for typical input/output mix), you can run this every day for roughly eight months for a bit over three dollars. Most non-coding automations end up in this same low range.
giancarlostoro•2mo ago
You're not wrong, though I suspect the AI "bubble burst" begins to happen when companies like Anthropic stop giving us so much compute for 'free' the only hope is that as things get better their cheaper models get as good as their best models today and so it costs drastically less to use them.
segmenta•2mo ago
Yeah, I think when they made the bet it genuinely made sense. But in coding workflows, once models got cheaper, people did not spend less. They just started packing way more LLM calls into a single turn to handle complex agentic coding steps. That is probably where the math started to break down.
aitchnyu•2mo ago
Sonnet is 3$ per million tokens, Grok Code Fast is 0.2$. IME the latter is better for me. Wish everybody treats AI as a pay-as-you-go commodity instead of getting dependant on rugpulls. My stack is Openrouter (model marketplace) and Aider (Kilocode and Cline for user friendly alternatives).
segmenta•2mo ago
Will check out Grok Code Fast - thanks for the pointer. In my experience, coding agents can swing a lot in quality depending on the model’s reasoning power. When the model starts making small but avoidable mistakes, the overhead tends to cancel out the benefit. Curious to see how Grok performs on multi-step coding tasks.
aitchnyu•2mo ago
True. Im working with Python CRUD apps, which every model is fluent in. And I'm personally generating 100-line changes, not letting it run while I'm AFK.
giancarlostoro•2mo ago
That's what I love most about Claude. I love Django and I love React (the richness of building UIs with React is insane) and sure enough Claude Code (and other models I'm sure) is insanely good at both.
segmenta•2mo ago
Ah, that makes sense. I’ve had similar luck with UI refactoring on cheaper models, mainly because you can quickly verify whether the output is right.
giancarlostoro•2mo ago
Zed is nicely setup for this I just have not taken the time. I do like how Claude works atm. The coding agent functionality is what's nice about Claude Code. I don't know that Grok Code Fast has that?
segmenta•2mo ago
Came across this interesting article on the internals of Claude Code: https://medium.com/@outsightai/peeking-under-the-hood-of-cla... . The author redirected the LLM calls into LiteLLM to analyze how it behaves.
giancarlostoro•2mo ago
That's really neat!
jonathanleane•2mo ago
What non coding tasks do you use Claude Code for? Genuinely curious.
world2vec•2mo ago
It can also take notes, write down plans and TODO lists, update on gym records, etc, etc.
Tusharmagar•2mo ago
Curious how you're handling notes and TODO lists. Do you give Claude Code access to a local markdown file, or is it working some other way?
world2vec•2mo ago
Pretty much yeah, just a folder with .md files
brianjking•2mo ago
LOTS! Sometimes for quick file system organization, creating Claude skills, deep document analysis.

Anthropic published a doc or two about this too, here's one of them: https://www-cdn.anthropic.com/58284b19e702b49db9302d5b6f135a...

segmenta•2mo ago
Ah, that’s interesting. Are there any parts of Claude Code that you feel could work differently or get in your way for these kinds of tasks?
ntnsndr•2mo ago
I use CC regularly for editing large text files (especially turning interview transcripts into something readable) and have found it works much better than web chat interfaces because of filesystem access and ability to work with large files.
segmenta•2mo ago
That’s great to know. I’ve come to the same conclusion. I’ve found that things work best when they happen right where I’m already working. Uploading files or recreating context in a web service adds friction, especially when everything is already available locally.
nl•2mo ago
I'm increasingly seeing code-adjacent people who are using coding agents for non-coding things because the tooling support it better, and the agents work really well.

It's an interesting area, and glad to see someone working on this.

The other program in the space that I'm aware of is Block's Goose.

segmenta•2mo ago
Yep, totally agree. We actually had an earlier web version, and the big learning was that without access to code-related tools the agent feels pretty limited. That pushed us toward a CLI where it can use the full shell and behave more like a real worker.

Really appreciate the support and the Goose pointer. Would love to hear what you think of RowboatX once you try it.

RamblingCTO•2mo ago
Pretty cool! A bit of an upgrade of just letting claude write pocketflow agents for stuff. That's what I'm doing now.
segmenta•2mo ago
Thanks! Curious what kinds of workflows you are automating right now and any pain points you’ve run into.
RamblingCTO•2mo ago
I automated a bunch of business workflows like sorting documents for accounting in my cloud storage, tagging emails that are invoices, stuff like that. I do this for a living though, so I also use these as casestudies and rowboat is a hard sell for end users I guess.
segmenta•2mo ago
Thanks, that makes sense. I assume the CLI approach might be the main hurdle for end users?
RamblingCTO•2mo ago
Exactly. They want something that just works and runs somewhere. I'm talking about SME owners.
_ink_•2mo ago
> E.g. "Generate a NotebookLM-style podcast from my saved articles every morning"

How does it do that? Does it require a tool for that? Or a special model?

segmenta•2mo ago
In the demo - for the voice and music, it uses the ElevenLabs MCP server for TTS, and ffmpeg locally to stitch the audio together. In the demo we pull content from Tweets via an MCP tool, but you can swap that step for anything — for example, fetching your saved articles with curl. There’s no special model required; any LLM that can call tools will work.
segmenta•2mo ago
We’ve included the exact agents used in the demo here for reference: https://github.com/rowboatlabs/rowboat/blob/cli/apps/cli/exa....

We’re adding an easier way to run examples soon. In the meantime, if you’d like to try this one locally: (1) Copy the agent file into ~/.rowboat/agents/ (2) Add the MCP server (and your keys) to ~/.rowboat/config/mcp.json (3) Run: 'rowboatx --agent=tweet-podcast --input=go'

crimsoneer•2mo ago
I wonder how this compares with the CLine CLI?
segmenta•2mo ago
Cline is great, but it’s primarily focused on coding workflows, similar to Claude Code. RowboatX is aimed at a different category: background agents for non-coding automations (e.g. meeting prep, daily briefings, etc).

The big difference from Claude Code (and Cline) is that RowboatX can spin up persistent background agents that run on schedules, use the system shell, and call MCP tools to automate tasks outside of coding.

thepasswordapp•2mo ago
The browser automation point is key. We've been using browser-use (the Python library) to automate password changes across websites for thepassword.app - turns out this is a surprisingly hard non-coding automation task.

Every site has a different password change flow. Some require 2FA mid-flow, others throw CAPTCHAs, and many actively detect automation. Traditional selectors break constantly as sites update their DOM.

The LLM loop approach handles this much better than scripted automation because the agent adapts in real-time. But the security architecture matters a lot - we had to modify browser-use to pass credentials via a separate parameter instead of including them in the prompt. Otherwise passwords would be visible in the LLM context.

Human-in-the-loop is essential here too. When the agent encounters something unexpected (unusual 2FA, security questions), it needs to pause and let the user decide rather than guessing.