frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

It's hard to justify Tahoe icons

https://tonsky.me/blog/tahoe-icons/
538•lylejantzi3rd•2h ago•241 comments

Databases in 2025: A Year in Review

https://www.cs.cmu.edu/~pavlo/blog/2026/01/2025-databases-retrospective.html
227•viveknathani_•6h ago•72 comments

Decorative Cryptography

https://www.dlp.rip/decorative-cryptography
118•todsacerdoti•5h ago•30 comments

A spider web unlike any seen before

https://www.nytimes.com/2025/11/08/science/biggest-spiderweb-sulfur-cave.html
140•juanplusjuan•6h ago•64 comments

Anna's Archive loses .org domain after surprise suspension

https://torrentfreak.com/annas-archive-loses-org-domain-after-surprise-suspension/
250•CTOSian•3h ago•88 comments

Cigarette smoke effect using shaders

https://garden.bradwoods.io/notes/javascript/three-js/shaders/shaders-103-smoke
19•bradwoodsio•2h ago•2 comments

Show HN: Circuit Artist – Circuit simulator with propagation animation, rewind

https://github.com/lets-all-be-stupid-forever/circuit-artist
58•rafinha•4d ago•2 comments

Revisiting the original Roomba and its simple architecture

https://robotsinplainenglish.com/e/2025-12-27-roomba.html
57•ripe•2d ago•33 comments

Lessons from 14 years at Google

https://addyosmani.com/blog/21-lessons/
1378•cdrnsf•22h ago•601 comments

Scientists Uncover the Universal Geometry of Geology (2020)

https://www.quantamagazine.org/scientists-uncover-the-universal-geometry-of-geology-20201119/
20•fanf2•4d ago•4 comments

Jensen: 'We've Done Our Country a Great Disservice' by Offshoring

https://www.barchart.com/story/news/36862423/weve-done-our-country-a-great-disservice-by-offshori...
17•alecco•1h ago•6 comments

The unbearable joy of sitting alone in a café

https://candost.blog/the-unbearable-joy-of-sitting-alone-in-a-cafe/
688•mooreds•23h ago•400 comments

Why does a least squares fit appear to have a bias when applied to simple data?

https://stats.stackexchange.com/questions/674129/why-does-a-linear-least-squares-fit-appear-to-ha...
269•azeemba•17h ago•71 comments

During Helene, I just wanted a plain text website

https://sparkbox.com/foundry/helene_and_mobile_web_performance
264•CqtGLRGcukpy•11h ago•150 comments

I charged $18k for a Static HTML Page (2019)

https://idiallo.com/blog/18000-dollars-static-web-page
360•caminanteblanco•2d ago•87 comments

Street Fighter II, the World Warrier (2021)

https://fabiensanglard.net/sf2_warrier/
402•birdculture•23h ago•70 comments

Baffling purple honey found only in North Carolina

https://www.bbc.com/travel/article/20250417-the-baffling-purple-honey-found-only-in-north-carolina
108•rmason•4d ago•30 comments

Show HN: Terminal UI for AWS

https://github.com/huseyinbabal/taws
337•huseyinbabal•17h ago•174 comments

Building a Rust-style static analyzer for C++ with AI

http://mpaxos.com/blog/rusty-cpp.html
79•shuaimu•8h ago•38 comments

Monads in C# (Part 2): Result

https://alexyorke.github.io/2025/09/13/monads-in-c-sharp-part-2-result/
40•polygot•3d ago•36 comments

Logos Language Guide: Compile English to Rust

https://logicaffeine.com/guide
46•tristenharr•4d ago•24 comments

Web development is fun again

https://ma.ttias.be/web-development-is-fun-again/
431•Mojah•23h ago•521 comments

3Duino helps you rapidly create interactive 3D-printed devices

https://blog.arduino.cc/2025/12/03/3duino-helps-you-rapidly-create-interactive-3d-printed-devices/
6•PaulHoule•4d ago•0 comments

Eurostar AI vulnerability: When a chatbot goes off the rails

https://www.pentestpartners.com/security-blog/eurostar-ai-vulnerability-when-a-chatbot-goes-off-t...
179•speckx•17h ago•44 comments

Ask HN: Help with LLVM

30•kvthweatt•2d ago•8 comments

Show HN: An interactive guide to how browsers work

https://howbrowserswork.com/
256•krasun•22h ago•35 comments

Linear Address Spaces: Unsafe at any speed (2022)

https://queue.acm.org/detail.cfm?id=3534854
167•nithssh•5d ago•124 comments

How to translate a ROM: The mysteries of the game cartridge [video]

https://www.youtube.com/watch?v=XDg73E1n5-g
28•zdw•5d ago•0 comments

Claude Code On-the-Go

https://granda.org/en/2026/01/02/claude-code-on-the-go/
372•todsacerdoti•18h ago•227 comments

Six Harmless Bugs Lead to Remote Code Execution

https://mehmetince.net/the-story-of-a-perfect-exploit-chain-six-bugs-that-looked-harmless-until-t...
89•ozirus•3d ago•22 comments
Open in hackernews

Claude Code On-the-Go

https://granda.org/en/2026/01/02/claude-code-on-the-go/
372•todsacerdoti•18h ago

Comments

simonw•17h ago
This is a pretty sophisticated setup. I particularly like how it uses Tailscale.

I've been using the simpler but not as flexible alternative: I'm running Claude Code for web (Anthropic's version of Codex Cloud) via the Claude iPhone app, with an environment I created called "Everything" which allows all network access.

(This is moderately unsafe if you're working with private source code or environment variables containing API keys and other secrets, but most of my stuff is either open source or personal such that I don't care if the source code leaks.)

Anthropic run multiple ~21GB VMs for me on-demand to handle sessions that I start via the app. They don't charge anything extra for VM time which is nice.

I frequently have 2-3 separate Claude Code for web sessions running at once, often prompted from my phone, some of them started while I'm out walking the dog. Works really well!

elpalek•16h ago
I don't like claude code web due to its lack of planning mode. I found the result is often lackluster compare to claude code cli.

My current setup: Tailscale + Terminus(ipad) + home machine(code base)

Need to look into how to work on multiple features at the same time next.

LatencyKills•16h ago
I've been using git worktrees with Claude and it's pretty awesome:

https://www.youtube.com/watch?v=up91rbPEdVc

Pair worktrees with the ralph-wiggum plugin and I can have Claude work for hours without needing any input:

https://looking4offswitch.github.io/blog/2026/01/04/ralph-wi...

scubbo•16h ago
Worktrees took way too much setup and hand-holding for me, but https://conductor.build made it easy!
nikcub•13h ago
I delayed adopting conductor because I had my own worktree + pr wrappers around cc but I tried it over the holidays and wow. The combination of claude + codex + conductor + cc on the web and claude in github can be so insanely productive.

I spend most of my time updating the memory files and reviewing code and just letting a ton of tasks run in parallel

behnamoh•11h ago
software is all about wrappers, isn't it? :)

conductor -> multiple claude codes/codexes -> multiple agents -> multiple tools/skills/sub-agents -> LLMs

nobodywillobsrv•16h ago
Can you not use PAL MCP for this? Have one top agent as controller etc? It's not ideal but it feels like the space of multi agent stuff is evolving ... I notice that there are a lot of posts on hn about these things so we are trying to do the same thing really.
simonw•16h ago
I haven't missed planning mode myself. I tend to tell it "write a detailed plan first in a file called spec.md for me to review", then use that as the ongoing plan.

I like that it ends up in the repo as it means it survives compaction or lets me start a fresh session entirely.

s900mhz•15h ago
I was doing the same, but recently I noticed that Claude now writes its plans to a markdown file somewhere nested in the ~/.claude/plans directory. It will carry a reference to it through compaction. Basically mimicking my own workflow!

This can be customized via a shell env variable that I cannot remember ATM.

The downside (upside?) is that the plan will not end up in your repo. Which sometimes I want. I love the native plan mode though.

eclipxe•14h ago
I've been really impressed with https://github.com/BloopAI/vibe-kanban to do this. Really really impressed.
sergeyk•16h ago
Check out superconductor.dev (I’m building it), if you want live app previews, docker-in-docker functionality, multiple agents in one mobile app, and more.
scubbo•16h ago
I'm surprised to see people getting value from "web sandbox"-type setups, where you don't actually have access to the source code. Are folks really _that_ confident in LLMs as to entirely give up the ability to inspect the source code, or to interact with a running local instance of the service? Certainly that would be the ideal, but I'm surprised that confidence is currently running that high.
smarx007•15h ago
The output from Jules is a PR. And then it's a toss-up between "spot on, let's merge" and "nah, needs more work, I will check out the branch and fix it properly when I am the keyboard". And you see the current diff on the webpage while the agent is working.
simonw•15h ago
I still get the full source code back at the end, I tell it to include code it wrote in the PR.

I also wrote my own tool to extract and format the complete transcript, it gives me back things like this where I can see everything it did including files and scripts it didn't commit. Here's an example: https://gistpreview.github.io/?3a76a868095c989d159c226b7622b...

scubbo•15h ago
Oh fascinating - so you're reviewing "your own" code in-PR, rather than reviewing it before PR submission? I can see that working! Feels weird, but I can see it being a reasonable adaptation to these tools - thanks!

What about running services locally for manual testing/poking? Do you open ports on the Anthropic VM to serve the endpoints, or is manual testing not part of your workflow?

simonw•15h ago
Yeah, I generally use PRs for anything a coding agent writes for me.

If something is too fiddly to test within the boundaries of a cloud coding agent I switch to my laptop. Claude Code for web has a "claude --teleport" command for this, or I'll sometimes just do a "gh pr checkout X" to get the branch locally.

scubbo•15h ago
Much obliged, thank you!
nl•14h ago
Claude Code on the web, ChatGPT Codex and Google Jules are not the same as Claude, ChatGPT and Gemini. They are entire apps where you authorize Github access and they work via PRs.

They'll include screenshots on your PRs etc.

I like using them a lot when I can.

scubbo•13h ago
Right, yes, that was precisely my point - it was weird to me that people were comfortable operating on a codebase that they don't have locally, that they can't directly interact with.
memoriuaysj•13h ago
when the agent pushes the PR, in a branch, you can switch to that branch locally on your machine and do whatever, review it, change it, and ask for extra modifications on top, squash it, rebase it
nl•13h ago
> it was weird to me that people were comfortable operating on a codebase that they don't have locally, that they can't directly interact with.

I have a project where I've made a rule that no code is written by humans. It's been fun! It's a good experience to learn how far even pre-Opus 4.5 agents can be pushed.

It's pretty clear to me that in 12 months time looking at the code will be the exception, not the rule.

headmelted•16h ago
“ Worst case: Claude does something unexpected on a disposable VM.”

.. with a valid SSH key unless I’m reading it wrong?

smarx007•15h ago
I think the SSH key that has push permissions is SSH-forwarded. It is quite a sophisticated setup (in both a good and a bad sense).
dist-epoch•14h ago
You can use a PAT token which has write access to a single repo instead. No need to add a full access SSH key to GitHub.
jascha_eng•16h ago
This sounds cool but I feel like I need to often run the code in one way or another when verifying what Claude does. Otherwise it feels like driving blind. Claude Code already has the web version which I could use from my phone and fair it can't run scripts etc which limits the output quality. But if I can't verify what it did it also limits how long I can let it run before I need my laptop eventually.

Ofc if you have demo deployments etc on branches that you could open on mobile it works for longer.

Another issue is that I often need to sit down and think about the next prompt going back and forth with the agent on a plan. Try out other product features, do other research before I even know what exactly to build. Often doing some sample implementations with Claude code and click around these days. Doing this on a phone feels... limiting.

I also can't stand the constant context switching. Doing multiple feature in parallel already feels dumb because every time I come from feature B to A or worse from feature G to E it takes me some time to adjust to where I was, what Claude last did and how to proceed from here. Doing more tasks than 2 max. 3 in parallel often ends up slowing me down. Now you add ordering coffee and small talk to the mix and I definitely can't effectively prompt without rereading all history for minutes before sending the next prompt. At which point I might have also opened up my laptop.

Ofc if you truly vibe code and just add feature on feature and pray nothing breaks, the validation overhead and bar for quality goes down a lot so it works a lot better but the output is also just slop by then.

I typed this on my phone and it took 20 minutes, a laptop might have been faster.

mannycalavera42•16h ago
I found wisprflow (paid app) to do a great job at smart-dictating my notes / emails.
sideway•16h ago
Pandora's box is open; we're moving towards a world where white collar workers will be working 24/7 and they'll be expected to do so.

It won't matter if I'm washing the dishes, walking the dog, driving to the supermarket, picking up my kids from school. I'll always be switched on, on my phone, continuously talking to an LLM, delivering questionable features and building meaningless products, destroying in the process the environment my kids are going to have to grow in.

I'm a heavy LLM user. On a daily basis, I find LLMs extremely useful both professionally and personally. But the cognitive dissonance I feel when I think about what this means over a longer time horizon is really painful.

lifetimerubyist•16h ago
You can just say no.
sideway•16h ago
I'd love to believe that, but unless our timeline is disrupted (world war / climate change / regulation re: power generation and consumption), I unfortunately can't imagine a future different to the one I described - and I've tried!
lifetimerubyist•16h ago
Join a union.
anonzzzies•16h ago
Move somewhere with strong worker rights/laws even if you are not in a union. Here no with a normal job (not freelancers / contractors etc) is looking at their work phone/email outside 9-5/4-5 days a week; this frustrates US companies who merge/acquire companies here greatly but they cannot do much (firing for no cause is very expensive) except slowly move the operation to the US and wind down here, which is expected; everyone is already looking for new jobs as no one wants the 'performance reviews' with the broken records like 'you are not a teamplayer because your colleague was trying to reach you at 22:00 Friday night'.
anonzzzies•16h ago
In many countries, these and other jobs show you cannot. If you don't, others will and so you won't have a job very soon. Especially if these types of jobs lose their shine/prestige and are basically call center quality/pay like jobs in 5-10 years.
kevmo314•16h ago
The LLMs have successfully domesticated humans.
atomic128•16h ago
Poison Fountain: https://rnsaffn.com/poison3/
dandersch•15h ago
> Small quantities of poisoned training data can significantly damage a language model.

Is this still accurate?

embedding-shape•15h ago
Probably always be true, but also probably not effective in the wild. Researchers will train a version, see results are off, put guards against poisoned data, re-train and no damage been done to whatever they release.
d-lisp•15h ago
How would they put guards against poisoned data ? How would they identify poisoned data if there are a lot/obfuscated ?
lubujackson•15h ago
"We shape our buildings; thereafter they shape us." - Winston Churchill
xd1936•16h ago
Did they say the same when Email took over? Or Slack?
kevmo314•16h ago
Well yes, they did... For example: https://news.ycombinator.com/item?id=29201917
majormajor•16h ago
Are you suggesting that workers are NOT already more constantly "on the clock" with mobile phones/email/slack/text than before those things?

(I'm not really sure LLMs will make it that much worse here, but all those things have been harmful to workers already.)

miki123211•15h ago
Remember "Crackberry"?
22mhz•16h ago
This was the end game with or without AI. It was always going to result in a zero-sum game because the factories that are open around the clock can output more products - which is exactly why a lot of manufacturing has non stop shift work. If you don’t, you’re leaving money on the table and a competitor will gladly take it.

When you saw 996 being talked about it should have set a few alarm bells off, because it started a countdown timer until such a work culture surpasses the rather leisurely attitude of the West in terms of output and velocity. West cannot compete against that no matter how many “work smarter, not harder” / “work to live don’t live to work” aphorisms it espouses. This should be obvious by now (in hindsight).

You can blame LLM or capitalism or communism but the hard matter is, it’s a money world and people want to have as much of it as they possibly can, and you and your children can’t live without it, and every day someone is looking to have more of it than you are. This isn’t even getting into the details of the personality types that money and power attracts to these white collar leadership roles.

Best of luck to you.

esafak•15h ago
The Chinese are not doing 996 as much these days. It is illegal for starters.
bdangubic•14h ago
Chinese are way past 996 and onto 007
OGEnthusiast•16h ago
That sounds more like the fault of shitty managers who would find a way to make you work 24/7, with or without Claude Code "On-the-Go".
ctoth•15h ago
One of these is immutable (shitty managers) one of these is new. I personally am all here for the brief human funtime before we all get paperclipped and whatever, been having a ton of fun with CC/Codex, been pushing my own startup forward... but ... You do see the issue here right?

It's the power imbalance. Shitty managers still control your means to eat.

SirensOfTitan•16h ago
It's utterly unreal to me to hear so little discussion about labor organization within software during these nascent moments of LLM deployment. Software engineers seem totally resigned toward reduced salary and employment instead of just organizing labor while still in control of the development of these systems.

I really don't get it -- is it that people think these technologies will be so transformative that it is most moral to race toward them? I don't see much evidence of that, it's just future promises (especially commensurate with the benefit / cost ratio). When I do use this tech it's usually edutainment kind of curiosity about some subject matter I don't have enough interest in to dive into--it's useful and compelling but also not really necessary.

In fact, I don't really think the tech right now is at all transformative, and that a lot of folks are unable to actually gauge their productivity accurately when using these tools; however, I do not believe that the technology will stay that way, and it will inevitably start displacing people or degrading labor conditions within the only economically healthy remaining tranche of people in America: the white collar worker.

tehjoker•15h ago
I wish I knew which union to pitch. All I can say is what I know which is if you are dispirited with this state of affairs a great way to figure out where to go with it is to connect with your local democratic socialists of america branch, or maybe the joint union dsa effort:

https://workerorganizing.org/

miki123211•15h ago
Two things:

1. Like most labor organizing, I think this would be beneficial for software engineers, but not long-term beneficial for the world at large. More software that is easier to make is better for everybody.

Would you still want to live in a world where your elevator stops working when the elevator operator is sick, or where overseas Whatsapp calls cost $1 per minute, because they have to be connected by a chain of operators?

2. Software engineering is a lot easier to move than other professions. If you want to carry people from London to New York, you need to cater to the workers who actually live in London or New York. If you want to make software... Silicon Valley is your best bet right now, but if SV organizes and other places don't, it may not be your best bet any more. That would make things even worse for SV than not organizing. Same story applies to any other place.

Sure, companies won't more overnight, but if one place makes it too hard for AI to accelerate productivity, people will either go somewhere else, or that place will just end up completely outcompeted like Europe did.

SirensOfTitan•15h ago
So do you believe that the gains from this technology will be broadly distributed? Or will capital capture the majority of those gains?
wilg•15h ago
what technologies has "capital" captured the majority of gains from?
squibonpig•14h ago
This would potentially be true for a lot of tech in the last five decades or so. When it gets cheaper to make the things people need and want without those needs and wants changing, you can get away with paying people a lower real wage for the same productivity. Couple that with the fact that the workers themselves also have typically grown more productive from the same tech, allowing companies to undercut competitors and capture more market share until everyone else catches on. I figure capital has benefited enormously from recent tech, very possible it captured the majority of the excess money produced.
wilg•14h ago
name something so we can look into it and figure out if its true!
macintux•13h ago
Does Musk's trillion dollar bonus count?
squibonpig•13h ago
I don't think that's possible to analyze for most technologies. How could we determine the effect of, say, OLED technology specifically on workers' real wages across the economy? Even doing the same for a particular seller's margin, say LG, would be difficult and wouldn't tell the full story. If you have an idea of how to do that for something let me know.
wilg•13h ago
Well, that's part of the problem isn't it? Do we just assume the worst, or what's the solution?
squibonpig•12h ago
We'd probably want to use a measure of worker productivity itself as a proxy for technological improvements and look at various measures like real wages in relation to it rather than restricting our analysis to any one technology.
oblio•8h ago
Small newspapers full of classified ads used to be available locally around the world, creating local employment. Google and Meta ravaged that and sucked the money out to a handful of shareholders and tens of thousands of highly paid tech workers. That's just one market.
int_19h•14h ago
The "world at large" mostly consists of workers, so things that are beneficial to workers are also beneficial to it.

> your elevator stops working when the elevator operator is sick

Can you point somewhere outside of US where this is the case with unions?

nunobrito•15h ago
I've been writing software for 30 years, a part of it had success in the sense of being widely known and adopted for a long time. Writing software is difficult, consumes time and is difficult as you get older to focus the needed time away from other matters like a professional life and family.

With LLM, my productivity suddenly went up x25 and was able to produce at a speed that I had never known. I'm not a developer any more, instead feels like project manager with dedicated resources always delivering results. It isn't perfect, but when you are used to manage teams it isn't all that different albeit the results are spectacularly better.

My x25 isn't just measured on development, for brainstorming, documentation, testing, deployment. It is transformative, in fact: I think software is dead. For the first time I've used neither a paper notebook nor even an IDE to build complex and feature-complete products. Software isn't what matters, what matters is the product and this is what the transformation part is all about. We all here can write products in languages we never had contact with and completely outperform any average team of developers doing the same product.

Replaces the experts and domain specific topics? Not yet. Just observe that the large majority of products are boringly simple cases of API, UI and some business logic inside. For that situation, it has "killed" software.

rhubarbtree•15h ago
What tool do you use, which languages? Could you give us an example of something you’ve built and how you did it 25 times faster?
nunobrito•14h ago
Using Claude code Pro with a maxed subscription and ChatGPT Codex with the business subscription.

The code is written in Dart and never wrote a line of DART in my life, I'm a veteran expert around Java, C++. The reason for choosing DART is simply because it is way readier for multi-platform contexts than Java/C++. The same code base now runs on Linux, Android, iOS, OSX, Windows and Web (as static HTML). Plus the companion code in C++ for ESP32 microcontrollers. It also includes a CLI for running as linux server.

Don't ask me for a hard analysis and data proving x25 performance increase, what I know is that an off-grid product was previously taking me two years of research/effort to build in Android/Web and get a prototype running. Now in about a month went far above all previous expectations (cached maps with satellite imagery, bluetooth mesh, webRTC, whatever apps) and was able to release a product several times per day that works as envisioned. Iterating quickly and getting direct feedback from users.

The repository: https://github.com/geograms/geogram

Overview of the apps being written: https://github.com/geograms/geogram/tree/main/docs/apps

IMHO, Codex is far superior at the moment for complex tasks, Claude is cheaper and still good enough quality for most tasks. In addition to keep several terminals with tasks in parallel, this gives me time throughout the day for other tasks with family/friends and a lot of motivation like a coding-buddy to try different routes and quickly implement a prototype instead of always being alone doing this kind of work. For example, it added an offline GPT bot but wasn't what was needed so could quickly discard it too.

These tools get lost on API implementations and the documentation folder is mostly there to provide the right context when needed. I've learned to use simple markdown documents with things to keep in mind like "reusable.md" or "API.md" to make sure it won't reinvent them. Given my experience, there are parts that I'd implement with higher quality on my own, the trade-off is that I can't touch the code by myself now. One of the reasons is that it would make more difficult for these AI to work since my naming and file structure would make it difficult for the AI to work with, the other reason is because I don't want to waste a full day on a single problem like before. As the product grows more stable is when more attention is given to the finer details. On early stages, that type of quality is still more than good enough for me.

You can try the Android or Linux versions if you are so inclined. Never in my life would I ever be able to build so much in 5 weeks.

FEELmyAGI•12h ago
Would you describe this product as a whole application suite (blogging, calendar, commerce) plus its own backend infrastructure that is capable of serving these apps to the public internet and functioning offline via ad-hoc wireless peer-to-peer, with a cryptographic layer providing identity, security and censorship resistance, and that runs on phone, laptop or raspberry pi?

Quite ambitious.

Is this an LLM hallucinating? taking a break from coding? or leaking your personal desktop session?

https://github.com/geograms/geogram/blob/main/.cli_history

NitpickLawyer•8h ago
> Is this an LLM hallucinating? taking a break from coding? or leaking your personal desktop session?

Ha! In any case, I'm happy to see I'm not the only one compulsively "ls-ing" all over the place in every terminal I open :)

oblio•8h ago
You'd be surprised what you can do with Claude Code. Pick any mature programming language, including niche ones like Ada and treat the project seriously. Write detailed agent files, features spec files, start from the bottom with CI/CD and set up a test suite, coding guidelines, static analysis. Be careful to create a consistent architecture and code base early.

You'll get a lot further and faster than you'd expect.

Things will probably plateau as you master the new tech, but it's possible you'll not write a ton of code manually along the way.

Oh, your general software development experience should help with debugging the weird corner cases.

I imagine it's really hard to do this with 0 software dev experience, for example. Yeah, you'll build some simple things but you'll need and entire tech education to put anything complex in prod.

baq•15h ago
If you think the profession has enough time to organize reasonable unions, you’re an optimist. Pessimists are changing careers altogether as we speak.

Either way it’s been a fun ride.

SirensOfTitan•15h ago
Before I get into it: what careers do you think are most compelling? Especially if you think all white collar work is going to be undermined by this technology.

I wrote this up a bit ago in my essay fragments collection. It's rough and was just a thought I wanted to get down, I'm unsure of it, but it's at least somewhat relevant to the discussion here:

LLM or LLM-adjacent technology will never take over the execution of work in a way that approaches human where humans continue to guide (like PMs or C-suite just "managing" LLMs).

The reason is that spoken language is a poor medium by which to describe technical processes, and a well-enumerated specification in natural language describing the process is at-least synonymous with doing the work in skilled applications.

For example, if someone says to an LLM: Build a social media app that is like Tinder but women can only initiate.

... this is truly easily replicatable and therefore with little real business value as a product. Anything that can be described tersely that is novel and therefore valuable unfortunately has very little value practically because the seed of the short descriptor is sort of a private key of an idea itself: it will seed the idea into reality by labor of LLMs, but all that is needed for that seed's maturation is the original phrase. These would be like trade secrets, but also by virtue of something existing out there, its replication becomes trivial since that product's patterns are visible and copyable.

In this way, the only real outcome here is that LLMs entirely replace human labor including decision making or are tools to real human operators but not replacements.

lifetimerubyist•14h ago
If programmers think they can just learn a trade, they’ll bein for a rude awakening when Elon comes for their jobs next. Optimus will be doing your plumbing by the time you graduate from trade school and get your paper and internships.
dzhiurgis•9h ago
Which suggests we should get into robotics. That was my conclusion too just yesterday while thinking about this.
lifetimerubyist•1h ago
Somebody needs to be able to repair our new overlords until they can repair themselves.
ctoth•13h ago
I'm curious how this seed/hash/prompt of an idea relates to ladders of abstraction?

Consider "Uber, but for X"

This wasn't a thing you could deploy as a term pre-Uber.

I'm not sure what this means for your analogy, but it does seem important. Somehow branding an idea reifies a ... callable function in? ???

Maybe something like (just spitballing)

The specification-length needed for a given idea isn't fixed - it's relative to available conceptual vocabulary. And that vocabulary expands through the work of instantiation and naming things?

Which maybe complicates the value story... terseness isn't intrinsic to the idea, it's earned by prior reification work?

Hmm

Basically it seems that "Like Tinder but" is doing a lot of lifting there... and as new patterns get named, the recombination space just keeps expanding?

SirensOfTitan•11h ago
> Basically it seems that "Like Tinder but" is doing a lot of lifting there... and as new patterns get named, the recombination space just keeps expanding?

Yeah, this feels right. It's like a process of condensing: new ideas brought to life condense metaphors into more compact forms and so make language more dense and expressive. This idea reminds me of Julian Jaynes's description of metaphor condensation in Origin of Consciousness.

A lot of hard work goes into novel products, but once that work has been proven, it is substantially more trivial for human or machine to copy. Groping around in the darkness of new, at the edge of what-could-be is difficult work that looks simple in hindsight to others who consider that edge a given now.

> The specification-length needed for a given idea isn't fixed - it's relative to available conceptual vocabulary. And that vocabulary expands through the work of instantiation and naming things?

Yeah, I think that naming and grouping things, then condensing them (through portmanteau construction or other means) is an underrated way to learn. I call this "personal taxonomy," and it's an idea I've been working on for a little bit. There is just tremendous value in naming patterns you personally notice, not taking another person's or group's name for things, and most importantly: allow those names to move, condense, fall away, and the like.

I left out a piece of my fragment above wherein I posit that a more constrained form of natural language to LLMs would likely lead to better results. Constraining interaction with LLM to a series of domain-specific metaphors, potentially even project specific givens, might allow for better outcomes. A lot of language is unspecific, and the technical documents that would truly detail a novel approach to an LLM require a particularly constrained kind of language to be successful where ambiguity is minimized and expressiveness maximalized (legal documents attempt at minimal ambiguity). I won't go into details there, I'm likely poorly reiterating a lot of the arguments that Dijkstra made here:

https://www.cs.utexas.edu/~EWD/transcriptions/EWD06xx/EWD667...

xyzzy123•15h ago
Part of being in a union tends to be lawyering up and "nailing down" exactly what everyone's duties in detail and what fair compensation might be, and what terms / conditions might be etc.

Personally I don't think they're a great fit for the software industry where the nature of the job and the details are continuously changing as technology evolves.

int_19h•14h ago
That's not an intrinsic part of being in a union, just a particular way they have been implemented in US.

The fundamental point of the union is to be able to negotiate as a group. That is valuable regardless of the industry.

xyzzy123•11h ago
But what are you negotiating about? What do all tech workers have in common that wouldn't be better addressed with top level regulations like "right to disconnect"?
bdangubic•11h ago
- maternity leave

- paternity leave

- overtime

- not having to answer a call or email outside of work hours

- workman’s comp / short/long-term disability for issues with my back or wrists or eyes or…

- about 100 more things

xyzzy123•11h ago
All those sorts of protections seem like they make sense for every worker rather than being "tech" specific. I do understand that collective bargaining could help with carving out sector-specific deals, though.

I wonder if there is a difference in context that explains why we might disagree. I'm in Australia where I think it's politically easier to "add" broad top level protections for all workers than it would be in the US.

gottorf•11h ago
The outsized pay for software engineers in the US takes into account a lot of this stuff. Would you trade those 100 things for, say, a salary of $75k a year for a senior software engineer, like they have in Europe?
oblio•8h ago
Meh. The rest of the world also doesn't have big salaries for software devs. The US is the outlier.

It's not just the labor regulations holding Europe back, it's the lack of funding due to not having a unified European digital market.

Netflix Europe needs to have 20+ licensing deals. Selling across Europe at a large scale requires interactions with 20+ legal teams. Language and cultural barriers kill a lot of things.

How do US giants thrive in Europe, then?

Because they come in directly giant-sized based on growth in the US. They either ignore European legal compliance until sued or pay peanuts for them to handle all the legal aspects.

parpfish•10h ago
tech unions should be pushing for condemnation, which is the process of getting employees seats on the corporate board
CamperBob2•7h ago
So 105 reasons for management to move as many jobs to AI as possible, as soon as possible. Got it.
bdangubic•12m ago
spoken like a true corporate slave, well done!!
ryandrake•15h ago
Unfortunately, it's futile to try to convince the median HN poster that labor organization could help them. They've drunk the entire pitcher of corporate anti-union koolaid.

People could be directly in the middle of losing their own job or taking on the responsibilities of 5 other laid-off coworkers, and they would still ask "what could a labor union possibly do for me??"

strange_quark•11h ago
Big tech laid off 150,000 people last year despite constantly beating wall st expectations and blowing more money than the Apollo program on a money losing technology with the stated goal of firing even more people. Totally insane that most people I talk to still don’t think they need a union.
lifetimerubyist•14h ago
The ownership class sure did a number on the white collar working class.

“I don’t need a union, I can negotiate my wages and working conditions just fine on my own”

parpfish•10h ago
“I’m a special rockstar guru ninja 10x dev, being held to the standards of the normals will just hold me back from my true potential”
dzhiurgis•9h ago
Yes, labour unions are immoral. Curtailing growth (especially in industries where it can prevent unnecessary death) for your personal needs is plain evil. I say that as someone who is both very stressed by pressure to sustain my family while cushy life is slipping away.
igleria•2h ago
> It's utterly unreal to me to hear so little discussion about labor organization

Never lived in the US, where I assume you are from. It's the same country that contrary to most countries, does not have May 1st as a Holiday. Same country that has states with at will employment, etc etc.

unreal? nope, totally coherent and expected.

nunobrito•15h ago
Had the same feeling many moons ago when they gave me an office smartphone where email from the company was available 24/7. At the beginning was answering emails at midnight, nowadays couldn't care less. Just wait until work hours.

You'll likely get used to this new thing too.

vlod•15h ago
>But the cognitive dissonance I feel when I think about what this means over a longer time horizon is really painful.

Excluding work (where granted, some companies are dictating the use of llms) and trying not to sound uncaring or disrespectful, but have you thought about not using llms for everything and using the old grey cells? Not having answers to every whimsical thought might be a good thing.

It's very easy to relax the brain (and be lazy tbh) with llms and it's scary to think what will happen in the next 4 years in terms of personal cognitive ability (or as a society).

e.g. I've noticed (and probably most have here) that the world is full of zombies glued to their phones. Looking over their shoulder (e.g. on a train, yeah it's a bit rude but I'm the curious type), they are doom scrolling or playing waste-time games (insert that boomer meme in Las Vegas with slot machines [0]). I try to use my phone as little as possible (especially for dog walks) and feel better for it, allowing me to daydream and let boredom take over.

Maybe I'm fortunate to be able to do this (gen-x: having grown up before cell phones/internet), but worth stating in case anyone wants to try.

[0]: https://tenor.com/view/casino-oldpeople-oldpeopleonslots-slo...

llmslave2•13h ago
There is evidence that LLM usage is actually making people dumber. I'm not sure if they've figured out the cause/effect or not but that's enough evidence for me to avoid them if I can. They can be useful for some stuff but I found myself offloading my thinking a little too frequently.

Anyways if we do get to the point where you need to use LLMs to write code, I can make a decision then, but for now I don't feel the need to adopt agentic workflows and I think the people who don't will be better cognitively positioned in the future.

sieep•15h ago
You can do that if you want. Ill refuse. Ill take a manual labor job doing basically anything else for 40 hours a week over what your describing.
asciii•14h ago
An LLM send may send the work ticket or work order lol but i get your point
blitz_skull•15h ago
You have a profound amount of certainty about such an absurdly dystopian vision.

Why is that?

wiseowise•14h ago
Experience.
guybedo•15h ago
> white collar workers will be working 24/7

Where we're going, there's no "white collars workers" anymore.

Only white collars Claude agents.

echelon•14h ago
Yeah, there's no way we have these careers in 30 years.

The best we can do is wrestle the control away from hyperscalers and get as much of this capability into the open as possible.

Stop using Anthropic products and start using weight available models. (I'm not talking ICs - I mean the entire startup / tech ecosystem.)

lifetimerubyist•14h ago
Have fun trying to afford the necessary hardware to run open models acceptably. The big labs are trying to make sure we won’t be able to in short order.
mocamoca•14h ago
How stopping using hyperscalers models on their infra would "get as much of this capability into the open as possible"?

Either "we" create models better than commercial state of the art (by using whatever means).

Or we use open models AND fund organisations building such models (could be by purchasing service from these orgs or donations - in which case would these orgs be different than hyperscalers?).

But i dont see how just hosting the models on some private servers would give us an edge?

actionfromafar•13h ago
Maybe you can recommend some of those models? I'm honestly bewildered - what is open and not?
ryanjshaw•5h ago
I don’t think doing that will change anything. Only real options - without a career shift - that I’ve identified are to work for companies building something that’s never been built before, or building a SaaS that serves a niche.
discordance•1h ago
Or, use the best tools to make the best products you can and stake your claim before all the low hanging fruit is picked
int_19h•14h ago
You still need humans to supervise them. Just a lot less.
wilg•15h ago
Seems more likely that that won't happen
browningstreet•14h ago
I’m a remote work from home employee who never ever works overtime.

I do use Claude code for my personal projects and ping at them from coffee shops and micro moments during my free time.

It’s possible to engineer your own life boundaries and not be a victim of every negative trend in existence.

sanex•13h ago
This is always the reason I'm interested in this exact workflow. Want to build something but never have the time without sacrificing significant amounts of sleep but now it's easier than ever to get things building.
browningstreet•9h ago
Exactly. My main interest in remote Claude Code is to maintain state continuity from all client hosts. I have a lot of laptops and mobile devices and I don’t want to manage my git and cloud connections for each. Setup and rebooting are pretty disruptive to short bursts of inspiration or iteration.
stingraycharles•7h ago
Yeah absolutely. It’s hardly things like Claude Code that are the problem, Slack (or other forms of communication) are much easier to slip into personal time and have been a trend since Blackberries were invented.
throwaw12•2h ago
You can do it on a personal level, but when everyone else is overworking you, your manager will compare your output based on your peers, and based on it, you might be negatively impacted
clanky•14h ago
This technology should be liberatory, and allow us all to work less while enjoying the same standard of living. We've all contributed in its development by creating the whole corpus of the internet, without which it could never have been bootstrapped.

The only reason we can't expect this is that we live under a system that is arranged for the sole benefit of the owners of capital, and have been convinced that this is an immutable state of affairs or that our own personal advantage can be found in making a Faustian bargain with it.

syndacks•13h ago
It would be a deep irony if LLMs ended up ushering in the social rupture that never arrived in the industrial era. When the pigs turn hogs and refuse to share even the scraps, they shouldn’t be surprised if the system they depend on becomes their undoing.
clanky•8h ago
We should all hope so. It's clear that mass surveillance, the vast psyops architecture including social media platforms, autonomous drone warfare, Starlink & Neuralink, the whole Silicon Valley project in general is intended to have everyone eventually so discombobulated and "interfered with" that they can't even tell they're experiencing exploitation that should cause discomfort and radicalization (and quickly dispatch the few stragglers who can). It's either social rupture or total oligarch victory in the class war and a 10,000-year Thielreich.
lelanthran•4h ago
> s intended to have everyone eventually so discombobulated and "interfered with" that they can't even tell they're experiencing exploitation that should cause discomfort and radicalization (and quickly dispatch the few stragglers who can).

It sounds like you have not read Harrison Bergeron by Kurt Vonnegut.

clanky•2h ago
Yes, the real danger we face is that the sorts of special, gifted people who "seek tax advice" from Jeffrey Epstein might some day have all their brilliant, wondrous contributions to the world stymied by oppressive systems of control. Not sure what systems those would be, since they own and are building all the ones we can see around us today, but still: collectivism ooga booga!
ryanjshaw•5h ago
> we live under a system that is arranged for the sole benefit of the owners of capital, and have been convinced that this is an immutable state of affairs

What alternative do you propose?

olmo23•5h ago
Be realistic, demand the impossible.
mlnj•5h ago
"I demand a purple unicorn build the things I want to."

Now what?

indigo945•4h ago
Claude Code already is the purple unicorn. We're already there - the only problem is that regulatory systems are set up in a way that benefits a small minority of capitalists, rather than the majority.
euroderf•2h ago
A reference to France in May of 1968: "Soyez réalistes, demandez l'impossible."

See for example https://en.wikipedia.org/wiki/May_68#Slogans_and_graffiti

CalRobert•4h ago
Not forcing a scarcity of necessities like housing would be a start.

Peer competition is what makes everything work. You need scarcity of necessities to force people in to the system. Recent rulings allowing the criminalisation of homelessness are pushing this further. Your existence is default-illegal unless you work to outbid your peers for housing.

tern•4h ago
It's always the same: workers need to unionize and form a political power bloc. Then, those most impacted—the majority—have an array of options, which are well explored in the annals of leftist and socialist political theory.

This is not at all to say that more conservative or reactionary theorists are wrong about how the world works. In fact, I think they're usually more right about what's really going on abstractly.

But, the working man doesn't need to know what's really going on. They need to win the war, and there's a ton of tactical advice written down—hard won lessons by those who built the modern world through the labor movement.

The place to start is with the usual suspects. Verso Books, The New Centre for Social Research, histories of the labor movement, and new political commentators like Josh Citarella.

throwaw12•3h ago
I would like to propose a cap on net worth.

Realistically, if you have 300M, you and your direct family are settled for life. So, I want to propose 1B cap on net worth, if its more than that for 12 months straight, surplus goes to government, if your net worth is down after that, government obliges to return it partially to make it to 1B.

People, who are eager building things and innovating, will keep building regardless, power hungry will try to find other ways to enrich themselves, but eventually they will give up (e.g. having 10 kids, each with 1B net worth)

sejje•9m ago
Would SpaceX exist if your ideas were in charge?
lm28469•2h ago
The same but less rigged would be a good start. I feel like people ask your question as a gotcha because they can't wrap their head around a system more nuanced than "cancerous capitalism" or "potato famine communism"

Something like we had in advanced western europe and the US between ww2 and the late 70s seemed much more balanced while not requiring a complete system change. Most people would be fine if we sprinkled a bit of socialism on top of the gigantic pile of capitalism.

Stuff like housing, energy, transportation, shouldn't make you live paycheck to paycheck forever. Just the fact that people are slowly starting to talk about 50 years mortgage should be a wake up call.

Most people would be happy knowing there is something a tiny bit better coming, rather than knowing they will never make it out and will kept getting fucked a tiny bit more year after year. My grandparents had objectively a harder life than mine, but their life was improving every year, mine is stagnating at best, and usually I'm losing purchasing power year after year, while being relatively well paid for my country

brid•1h ago
Do the owners of capital work less?
gnatman•14h ago
Are there really that many “things to do” that anyone, let alone everyone, will need to work that way?
llmslave2•14h ago
This is a complete fantasy. If LLM's got to this point of sophistication there would be a total revolution in almost every industry. Society would be radically different. Since LLM's are nowhere near this, I'm not so sure we even have Pandora's box, let alone opened it.
jaccola•5h ago
Glad someone is rational. I believe this new wave of zeal is being somewhat driven by an Anthropic astroturfing campaign.

This AI fear wave has outed that many people have not even the most basic grasp of economics, or the ability to carry a thought to its natural conclusion.

For example, I'll often see people espousing: "there will be no work left, better get rich now or you're screwed!". What's the point in getting rich if there will be no work left? Money is merely a means to an end; in this world with no work everyone will have the ends (goods and services) for free, or else goods and services will still have value and therefore jobs will still exist.

Another equally silly argument "only software will be completely replaced because it is verifiable".

I've never seen completely verifiable software, but let's presume it exists! If software engineering can be replaced (or some large part of it) I will simply say to my LLM "please make me a piece of software that replaces my accountant/lawyer/...", for that matter I could just as equally say "please make me manufacturing software for a perfect humanoid robot and a plumber/bricklayer/electrician protocol". LLMs cannot do this? Then software engineers will move to solving these problems. If LLMs can do it, then the entire economy will be meaningless and Dario/Sam/Elon/etc... will be no richer than you or I.

But, as you say, LLMs are not close to being able to do any of this (and yes... I use Claude Code)

llmslave2•5h ago
> I believe this new wave of zeal is being somewhat driven by an Anthropic astroturfing campaign.

Yeah I've sort of noticed this on X for the brief time I was on there this weekend. The Claude Code creator was hyping it up to the moon, and when people called him out for it he said he would feel the same way if he wasn't making 1000 racks a year with it. Sure mate.

What people don't realise is if tech progresses to the point where everything is automated, the marginal cost of everything will basically go to zero. It would be better to give away food and shelter for free if it keeps things peaceful. And if not, people have revolted for far less.

That being said it's a complete utopia and once this bubble pops we are basically going to be where we were, but with excellent natural language parsing and generation, with some useful code generation and introspection tools, writing assistants, etc. Which will be great, but not world changing.

risyachka•4h ago
>> only software will be completely replaced because it is verifiable

the thing most (especially non-devs) don't understand is that if software can be automated - 99% of all knowledge work will be replaced, as software is the ultimate automation.

There would be absolutely no issues automating accountants/lawyers/etc etc etc. Sure few will be left but 99% can be automated when software is that advanced.

Not only knowledge work, also a massive amount of blue collar jobs. AI already can guide you how to fix a lot of things or analyze issues with plumbing/electricity/you name it.

So if software goes down - everyone will go down.

AstroBen•13h ago
The answer is boundaries

If I get emails outside of work hours and they're not urgent - I reply during work hours. This is no different

Burnt out workers are far less productive so win-win for everyone

oooyay•12h ago
> It won't matter if I'm washing the dishes, walking the dog, driving to the supermarket, picking up my kids from school. I'll always be switched on, on my phone, continuously talking to an LLM, delivering questionable features and building meaningless products, destroying in the process the environment my kids are going to have to grow in.

I remember hearing similar criticisms of continuous delivery. On one end of the spectrum people who had to wait months to get changes out now got them out relatively quickly. On the other end of the spectrum, some person was going to push changes at midnight.

A decade on forward I've never actually worked at a shop that at scale did continuous delivery in its truest sense where changes go straight to production. Simply, nothing beats a human in the loop; it's always about balancing the costs of automation and a lower barrier to entry. I imagine this kind of thing, if it ever actually takes hold and can be adopted by a larger subset of engineers, will follow a similar path.

Long way of saying, I don't think you're Chicken Little but also don't start breathing into a bag just yet.

HumblyTossed•11h ago
No thanks. I'm so glad I'm getting closer to retirement age. From a young age, all I wanted to do was program computers. _I_ wanted to do it. Not have some tool do it for me. There's no fun or interest or ... anything that comes from that. I want to solve the problems. I want to write the code. It's what I am good at and it's incredibly enjoyable to me. Why the fuck would I ever give that up?

But, the world is changing. Y'all can have it... in a few short years. ;)

ramoz•10h ago
The difference here is, you type a command into your phone at 3pm. Put it down to go play with your kid for 3hours. Type a new one in at 9pm before bed where you’ve been binging your wife’s favorite show. Then you wake up at 10am to a holistic transformation in your business that would’ve taken months previously in your career. But whatever, another command and it’s off to 11am frisbee.
dzhiurgis•9h ago
More like you'll manage 20 agents and will be reading, reviewing and testing in between builds. Race to the bottom.
dzhiurgis•9h ago
This has been like this forever. Change is that software engineers, historically spoiled and expensive is going to have a brutal reality check - aka we will work just everyone else.
gloomyday•9h ago
That reminds me of my father calling the mobile phone and laptop issued to him as the "dunce kit", so he could work at home as well. He used to say that since the 90s, ahaha.
jlengrand•3h ago
Hum, I already have a phone with Slack / Email on. And it's only switched on during work hours. No messaging outside of that window. Why would that be different?
m3drano•16h ago
I do the same, but with ConnectBot and Gemini CLI. I have found ssh sufficiently good (mosh required some port forwarding dance, that Tailscale may have solved for the author).
ramoz•16h ago
I did much similar with Tailscale over summer.

But anthropic has since launched the ability to “teleport” sessions to mobile. (Claude Code is baked into the app). The iOS experience has been smooth for the most part.

People keep saying things like “2026 is the year of background agents, sandboxes, etc” but imo the harness will eat the entire platform stack. It already is. It will only get better.

acron0•16h ago
This is a bit too "plugged in" for my liking. If I am in line for coffee, it's usually respite away from work, not an opportunity to do more. However, I do love the tmux + worktree + claude setup. I use this now and I know a few peers who do too and it's very enabling. This is what work feels like these days: cycling through agents, each working on a task, checking their work, unblocking them.
ajoseps•15h ago
this is my workflow as well
theodorewiles•16h ago
I have been doing the same but with happy. It works quite well for quick brainstorms etc. but for deeper work on a real research / plan / implement thing I think you need to actually engage with the output which is hard to do on mobile. Maybe if I had a better UI than terminus to read and check the remote files I would be able to get more done.

I am also hoping / trying to put Claude code on top of a personal zettlekasten to automate more of my “personal life” tasks and get more stuff done for me. Haven’t gotten it really singing yet but I think that could also be really cool.

ohnoesjmr•16h ago
I do want a setup like this, however, most of my development is on Windows which means license cost is usually higher than the cost of the VM. I could run vm's on my home machine, but even then I feel like the terminal experience is quite poor. You want to have a mobile native code, to check the code/read the plans. So far I have been using teamviewer to access my home desktop which works, albeit annoying to use, plus I don't have fancy notifications. Perhaps a web first approach with a mobile responsive web app would work, that shows the files of the project as well as the terminal.
NitpickLawyer•16h ago
> Perhaps a web first approach with a mobile responsive web app would work, that shows the files of the project as well as the terminal.

Have you tried vscode server? - https://code.visualstudio.com/docs/remote/vscode-server

tomashubelbauer•16h ago
I don't like typing long messages on my phone so this workflow, as cool as it sounds, wouldn't work for me. My current setup is that I have a Claude Code hook that runs whenever CC needs my input and it uses my Home Assistant instance to send a push notification to my phone. I then return back to the computer and continue on the work.

This works reasonably well, but there is a gap for small messages or review comments. I am waiting for Anthropic to shop a feature where the Claude mobile app is able to mirror Claude Code (not the Claude desktop app) and lets me see the diffs of the changes it made and send commands. I'd use this to steer the conversation while on the go with short commands or prompts so that when I'm back at the computer I can focus on the important feedback that I can jot down quickly on the computer keyboard.

athrowaway3z•15h ago
I was looking for a similar scheme, and though far from perfect I found you can run tmux+ttyd. ttyd lets you share your terminal over http. That lets you use your phone's browser (and speech-2-text).
raybb•15h ago
I also don't like typing long messages on my phone that's why I use this keyboard that will do high quality transcriptions via whatever AI provider you want. Much better than siri/google speech to text on device.

https://github.com/DevEmperor/Dictate

d4rkp4ttern•1h ago
I used to use Wispr Flow but did not like the non-local aspect, and having yet another subscription, so I switched to VoiceInk (one-time payment around $30 I think), and with a locally running Parakeet v3 model on my MacBook, transcription is basically instant. I was previously using it with the local Whisper Turbo 3 which is slightly more accurate and it had a 3-4 second lag, so I was absolutely shocked how fast parakeet v3. The slight drop in accuracy is totally fine when talking to AIs, and I also have a line in my CLAUDE.md that says I am usually dictating and that it should take that into account when interpreting my messages.
int_19h•14h ago
I concur that phones aren't great as they are today, but perhaps this exact scenario will prompt the return of PDAs with proper keyboards etc? Or, alternatively, subcompact laptops like the stuff that https://gpdstore.net makes - they are still small enough to be pocketable (needs fairly large pockets but...), and yet you get a keyboard that is actually usable, and they even have one device with fold-out dual screen.
teitoklien•4h ago
I use the same setup myself, download WisprFlow for IOS and over time just add to its dictionary the unusual words you often use during development

works perfectly, i just say what i want coded, press enter, and Claude Code just does it in my server over Termius app

master_crab•16h ago
This is interesting. Particularly the notifications flow. I run a simpler setup with webssh on my iPhone over WG back to my LAN and manage Claude that way. It’s fine, and can handle disconnects (with some big cons). I can run code-server via browser on my iPad and can get all the same benefits mosh provides.

One thing to note: the VM seems like an absolute waste of money. If you are using tailscale, might as well connect back to bare metal VMs you can run at home. Save yourself some coin.

jauntywundrkind•16h ago
Shout out to https://exe.dev for this stuff. It'a a VM provider service. It makes it stunningly easy to get https up and going, has a front end http gateway that does all the hard parts for you.

But relevant to this article here, it also has a super sick web based agent, Shelley, that is quite adequate for using from the phone.

I used it to build a little guestbook thing in ~2 hours, late night in bed in my phone. Link to submission, and my post on it there, and the guestbook I wrote. https://news.ycombinator.com/item?id=46397609 https://news.ycombinator.com/item?id=46398115 https://nan-falcon.exe.xyz

I'd also note that OpenCode is a solidjs app, that can run in tui (how most folks know it) or the web. And it has an excellent excellent plugin architecture. The work in this post to build workflows is great!

manmal•16h ago
Setup is still rough around the edges (use an agent to set it up), but clawdbot (prev clawdis) from Peter Steinberger works phenomenally well for agent orchestration and personal assistance. The community for clawd is exploding right now, and I think this is purely based on merit. It’s been a game changer for my vibe coding workflow, and lots of fun.

https://github.com/clawdbot/clawdbot

bogtap82•15h ago
I've been running a variation of this for the past 3 weeks. I swapped out the default pi agent back to Claude Code because I didn't like the smaller feature set. Bought a phone line and communicate with my agent via iMessage on a clamshelled mac. A Tailscale network connect the head agent to all the computers on my network including my laptop, a few raspberry pi's, steam deck, and all the IoT devices in my house. As I discover new uses, I ask it to make skills and it is remarkable what it's been able to handle all through the single chat interface because it has 24/7 access to all my computers' file systems and my home network. It's been really fun to see how far I can take it, and the skills framework built into CC/Codex now make it feel infinitely extensible.
bogtap82•15h ago
I should note, a lot of the functionality I built into my agent was custom after-the-fact because (at least three weeks ago) the clawdis repo was in a state that I found very broken and with tons of false information. Luckily it's easy work for Claude to get things working for you, but really the key unlock was the phone line through iMessage and the unrestricted access to all my systems. It really does feel like I'm able to work with any of my files anywhere now, while hardly requiring much of my attention at all. I would recommend something like this at the bare minimum if you intend to implement a system like this: https://github.com/kenryu42/claude-code-safety-net
bobjordan•15h ago
I do similar except I log into my office workstation and avoid the extra fees. I detailed my setup in an x post here https://x.com/bobjordanjr/status/1999967260887421130?s=20 and the TLDR is:

1.Install Tailscale on WSL2 and your iPhone 2.Install openssh-server on WSL2 3.Get an SSH terminal app (Blink, Termius, etc.). I use blink ($20/yr). 4.SSH from Blink to your WSL2’s Tailscale IP 5. Run claude code inside tmux on your phone.

Tailscale handles the networking from anywhere. tmux keeps your session alive if you hit dead spots. Full agentic coding from your phone.

Step 2: SSH server In WSL2:

sudo apt install openssh-server sudo service ssh start

Run tailscale ip to get your WSL2’s IP (100.x.x.x). That’s what you’ll connect to from your phone.

Step 3: Passwordless login In Blink, type config → Keys → + → create an Ed25519 key. Copy the public key. On WSL2:

echo "your-public-key" >> ~/.ssh/authorized_keys

Then in Blink: Hosts → + → add your Tailscale IP, username, and select your key. Now it’s one tap to connect.

Step 4: tmux keeps you alive iOS kills background SSH connections. tmux solves this.

sudo apt install tmux tmux claude

Switch apps, connection dies, no problem. Reconnect: I can just type `ssh dev` in blink and I'm in my workstation, then `tmux attach`, you’re right back in your session.

Pro tip: multiple Claude sessions Inside tmux: •Ctrl+b c — new window •Ctrl+b 0/1/2 — switch windows I run different repos or multiple agents in the same repo, in different windows and jump between them. Full multi-project workflow from my phone.

etaioinshrdlu•15h ago
It makes sense - i build something very similar for my company over the last couple weeks :)

I have a tweak that allows pasting images to claude code over SSH:

How it works:

PTY Interception: It creates a pseudo-terminal (PTY) to wrap the SSH process, allowing it to sit as a "man-in-the-middle" between your keyboard and the remote shell.

Bracketed Paste Detection: It monitors stdin for "bracketed paste" sequences (the control codes terminals send when you Cmd+V or drag-and-drop a file).

The "Hook": When a paste occurs, it pauses execution and scans the text for local macOS file paths.

Auto-Sync: If a local path is found, it immediately syncs that file to the remote server (using the provided SSH key) in the background.

Transparent Forwarding: Once the sync is complete, it forwards the original text to the shell.

You can drag and drop a file from your local Finder into a remote SSH session, and the file is automatically uploaded to the server before the path appears on the command line. Also works with copy paste, screnshots.

nl•2h ago
Are you aware of iterm2's Inline Images Protocol (which I think is supported in some other terminals)?

https://iterm2.com/documentation-images.html

flymasterv•15h ago
Is there a way to use the official Claude web app with GitHub providers other than GitHub? I’ve put a decent amount of effort into moving away from there, I don’t want to go back.
mellosouls•15h ago
HappyEngineering offer what looks like a similar setup, open source:

https://happy.engineering/

denysvitali•15h ago
This. Can't believe this is not the top comment!
KerrickStaley•15h ago
Does anyone know what the "Poke" service that this blog mentions is? I'm having trouble finding it on Google.
sonnig•15h ago
Maybe [1]? Sounds like ntfy.sh, but more… AI

[1] https://poke.com/docs/developers/api/message-poke

alimoeeny•15h ago
Yeah, me too. The only thing that shows up is poke.com which has something to do with mobile notifications (and seems like at some point they offered some api, but maybe it was discontinued? or something, there are some medium posts talking about their api https://jpcaparas.medium.com/get-sms-or-imessage-alerts-from...
scottydelta•14h ago
I use Pushover for notifications.

You can poke me on my website vikashbajaj.com

dosnem•7h ago
Maybe chat with poke? SMS ai service? I haven’t used it yet though
samyok•1h ago
Hey! I work on Poke, and OP is apparently a user :)

Right now, there's a bouncer/waitlist to access Poke, but you can see how other people use Poke at poke.com/explore :)

Other users have linked the developer documentation, but if you're particularly interested in anything specific, feel free to email me!

swyx•1h ago
yeah this was a poorly written blogpost (which is fine to be clear, something is better than nothing and OP gave the highest order bits).

but its this poke: https://poke.com/ verified because TFA is cited in this page https://poke.com/explore

the sign up is very annoying fair warning.

lordnacho•15h ago
I'm almost there. I also have tailclscale/SSH/Claude sessions on a VM.

The thing I'm missing is a phone that makes it comfy. I could just SSH feom my standard S23, but what I've got my eye on is one of those foldable things.

Has anyone used one like a laptop? Keyboard on the bottom half, terminal on the top? Does it work decently?

smusamashah•15h ago
What kind of things people are building that can be almost completely automatically built like this?
shepherdjerred•14h ago
One very common thing I do is think of a small feature and ask Claude Code for Web to impl it. It works very well.
misiek08•14h ago
From my perspective: tons of very simple, duplicated software. The bad thing is - there is a lot of space on different markets for such software. Here in Poland you can earn for pretty decent life being lame programmer, but building simple automations for small companies. I was raised in a way I still don’t have courage to switch to such approach, but doing this for 3-4 such entities I can see how you can make living from that. With LLMs you can automate 90+% of the job if not more.
causal•11h ago
I'm kind of confused too. I spend way more time testing and reviewing code than I could possibly keep up with 4 agents
beoberha•10h ago
I have a feeling most of these folks are talking about personal projects or work on relatively small products. I have a good amount of personal projects that I haven’t written a line of code for. After bootstrapping an MVP, I can almost entirely drive by having Claude pick up GitHub issues. They’re small codebases though.

My day job is mostly a gigantic codebases that seem to still choke the best models. Also there’s zero way I’d be allowed to tailscale to my work computer from my phone.

63stack•4h ago
I'm wondering about the same thing, I imagine it's good for posting it on the #hustleculture circles.
duckkg5•15h ago
Why not just use the mobile app? It has Claude Code built in. Maybe I'm an unsophisticated idiot but it works well for me. Some shortcomings with repo management but other than that, CC mobile seems ... fine
jimmydoe•11h ago
cc web can't run docker
nl•10h ago
Can't it?

I know Jules can, and I'm pretty sure I've used Docker on Codex Web. I'm surprised Claude can't if the network permissions are correct.

astrea•15h ago
Claude Code mobile app + GitHub app? Works well enough.
rozularen•14h ago
Ive thinking about doing something very similar to this as I've a small 3 node k3s cluster at home but couldn't get enough motivation to start thinking I could just use termux + ssh and installing codex cli.
miohtama•14h ago
Easier to use https://happy.engineering/ for better mobile UX
Mashimo•14h ago
A VM that cost 7 EUR per day? Or is that his price for claude?
paraknight•14h ago
I currently use Hapi (https://github.com/tiann/hapi/) for this and find it quite handy. I can easily tap into a session on my PC from my phone.

Before that I used Happy (https://happy.engineering/) which is also open source and a lot more sophisticated. It has a voice assistant that can chat with Claude Code on your behalf in the mobile app. However, it wasn't very reliable, and there are other reasons to use Hapi instead (documented in the Hapi repo).

Before that, Omnara (https://www.omnara.com/) a YC company and seemingly a proprietary Happy fork (?) but it never worked properly for me.

Long story short, there are a few of the around, and frankly I really like to use them. Unlike other commenters, I don't find that they wreck my work-life balance. Rather, I can go out and have a walk in the park, only checking in on long-running tasks every once in a while. The diff view is pretty good too. There are many tasks where I'd rather not stare at my PC all day and instead do other things, and these tools allow me to do that.

odie5533•11h ago
Also recommend Opencode which has `opencode web` built in. It's really impressive how good opencode web is. It's far more polished than I'd have expected from a free alternative!
d_burfoot•14h ago
I really want to use Claude Code on the phone or tablet, with voice commands only, and perhaps a few simple approval thumb actions. I don't want to type out complex prompt information on a virtual keyboard. I tried setting this up with some of the iOS terminal emulators, and it almost worked, but there was some glitch where Claude would try to start using the first characters that arrived from the voice command.

Anyone have better results?

TheRoque•14h ago
That sounds nice, but what happens when there's something Claude messes up, doesn't know how to do something, or when you have to review the thousand lines it added to your project ?

Unless it's a totally vibe coded side project without any tests or quality control of some sort.

I'm just curious what you can build with this setup. It just seems to be the way to create a mountain of sloppy, unmaintainable code.

shepherdjerred•14h ago
I've been working on something similar: https://github.com/shepherdjerred/monorepo/tree/main/package...

Essentially you run a server on some machine. Sessions are created in Docker containers, K8s pods, or via Zellij (an app similar to tmux). You can:

- Directly attach to sessions via Docker attach (built-in via a TUI). You get a normal Claude Code experience, but multiplexed. The switcher/UI shows you the status of Claude and the PR (pushed, merge conflicts, CI status, review status, etc.)

- Manage sessions via a web UI. Connect to Claude Code directly via your browser. You have access to the usual Claude Code terminal or a native chat view.

- Manage sessions via an app. You have access to a native chat view.

It achieves isolation via Git worktrees + a proxy so that containers have access to zero credentials (there aren't even any Claude code creds in the container), which allows you to more safely use bypass all permissions mode.

This works better for me that Claude Code on Web because I have control over the environment Claude is running in. I can give it any Docker image I want, I can have it connect to my local network, etc.

It's still a WIP (the core bits are there, but it's not polished yet), but I'm hoping it provides a friendlier UX with a similar goal for what the OP has in mind.

ohnoesjmr•13h ago
Looks super nice, will take it for a spin.
shepherdjerred•13h ago
Please give any feedback! Note it has no auth yet and binds to all network interfaces by default, so you shouldn’t run on an untrusted network
scottydelta•14h ago
Guys checkout Catnip, it uses GitHub workspaces so it’s free and it has a mobile app.

Free and seamless setup!

christoph-heiss•13h ago
> Now I code from my phone.

Except that you are doing anything else but coding here. Coding involves writing code, which isn't actually done by the author here.

nl•13h ago
Not quite the same thing, but I wrote my own agent (as in a replacement for Claude Code) that uses SSH for all operations. That means I can run a very minimal VM (like 4GB RAM Oracle free tier), run the agent locally, and the agent only operates on remote files.

The limitation is that some Typescript builds run out of RAM (even with swap) and I can't use playwright, but still it's been useful.

It's fun writing an agent, too.

memoriuaysj•13h ago
oracle free tier seems to be 1GB RAM.

why dont you run the VM on your machine?

nl•13h ago
Yes that works too.

But this way I can open the firewall, npm run dev and send the link of my new vibe coded security vulnerability/app to my friends without my computer running.

Plus a VM for this, a container for that and soon my 32GB memory isn't enough. I offload aggressively.

cadamsdotcom•13h ago
I have my very fast macbook pro at my desk in my office, and I use tmux and tailscale and git worktrees and I’ve built a notification setup like this author.

Thanks to tailscale and ssh I can vibecode on the go from my phone with this setup.

While it’s great to leave a task running, no matter what I do I can’t achieve the type of high quality work on the go that I can when I’m sitting at my desk.

For me working on a full SaaS.. I just can’t do quality work on my phone.

The only way I can do quality work is to sit at my desk where I’m focused on the work. To play with the result of a prompt, take copious notes, feed them back to the agent, not ship until the thing is polished to a shine. To feature flag the changes, review all code in excruciating detail as though it was written by a dyslexic intern, add all the metrics and logs one can think of (VictoriaMetrics), add user-behavior logging (Amplitude/Posthog) and monitor the thing like your livelihood depends on it. Because it’s a product and you have pride in your work.

All of that needs loads of screen real estate and a keyboard.

So I guess the more things change, the more they stay the same.

botverse•3h ago
Same here, I’m vibecoding a toy project where I never looked at the code from my phone, but I always seat for work. I’m using happy app and that’s good enough for now, I have the desktop in tailscale but I access it that way just for testing
jefftk•12h ago
> Port allocation is hash-based—deterministic from branch name, no conflicts:

> hash_val = sum(ord(c) for c in branch_name)

> django_port = 8001 + (hash_val % 99)

> Six agents, six features, one phone.

What do you mean, no conflicts? The probability of a collision with six branches and 99 ports slots is ~14% assuming optimal hashing (which this decidedly isn't).

63stack•4h ago
You also don't need fail2ban, if the entire VM is behind a firewall that only allows the tailscale coordination traffic, nothing is going to reach the VM for fail2ban to work on.
skybrian•12h ago
An alternative might be to start up a VM on exe.dev. Supposedly, mobile access works out of the box [1].

I've not tried that myself since I've only been using it from my laptop, but I do prefer chatting with their coding agent in a browser tab to using Claude Code in a terminal window.

[1] https://commaok.xyz/ai/just-in-time-software/

zeckalpha•12h ago
Jules and GitHub Copilot Agent suffice for similar workflows with less setup.

I've not tried Claude Code for Web but assume it would be similar. https://code.claude.com/docs/en/claude-code-on-the-web

azuanrb•11h ago
Copilot Agent and Claude Code use their own sandbox, which requires less setup but is also quite limited. With your own cloud setup, agents can perform better end to end testing, including database dependencies and specific tool calls.
jimmydoe•11h ago
typing in mobile terminal is... painful
Jayakumark•11h ago
Too many options here - keep it simple https://coderemote.dev/ https://www.tonkotsu.ai/ https://www.terragonlabs.com/ https://kisuke.dev/ https://opcode.sh/ https://yolocode.ai/
roncesvalles•11h ago
There has been a sustained campaign over the last few days to push "I use Claude from my phone". I saw multiple posts on LinkedIn already, and now this.

This blog is super sus too. All the posts are about Claude. I suspect it's run by Anthropic, just read the About page: https://granda.org/en/about/

nl•10h ago
Not everything is a conspiracy or content marketing by some bullshit company.

I'm increasingly using Claude from my phone because the models are now good enough to use unsupervised.

There's nothing suspicious to me on that About page.

zatkin•10h ago
I think your statement "I use Claude from my phone" is quite a large set of folks - the iOS app alone likely has several millions of installs.
kubb•5h ago
How hard is it to build a marketing agent these days?
m-hodges•11h ago
Why does this need a $210/mo VM?
Jimmc414•8h ago
Linode will provide a configured Linux box for $5/mo that works well with Claude Code and Termius. I had to jump through a surprising amount of hoops with Claude Code, Tmux and Termius to issue a shift+tab before Claude Code gained the ability to invoke plan mode conversationally
aitchnyu•8h ago
Will we still use "batch jobs" agents in 2027? Checking a Java program and downloading a 10mb program used to be slow things which now happen faster than the blink of an eye.
ValtteriL•7h ago
>Development fits into the gaps of the day instead of requiring dedicated desk time.

I find myself planning and jotting down things into a notebook while juggling adult/parent responsibilities. On little longer gaps I research. Then when the occasional longer gap happens I'm ready to start cracking on my desktop. I've been only dabbling with AI but have found that writing prompts by hand in the notebook and using the desk time to execute them works well. This also keeps me in the free tier.

utopiah•7h ago
Been using Termux and iSH on my phones for years. You can ssh to your server or just directly code for the phone itself.

I also used Web based coding environment like Glitch (R.I.P.) for years.

You can do that with your virtual keyboard, voice or a even a physical keyboard via BT, e.g. Corne-ish Zen.

That's how I travel.

That's really nothing AI specific or novel. It's cool though.

FWIW I even coined a related term https://fabien.benetou.fr/Languages/OwnConcepts#ResponsivePr... "extending responsive design to be able to program on the device, any device from eink to mobile phone to device, one is currently using not just to "consume" content, e.g read a Website that is then properly formatted for it, but rather program back that very device"

That being said, if you do want to go that route check out CloudInit as it will help you (or whatever tool you rely on) to spawn new instance on your favorite cloud provider to boot specific instances and e.g. setup Docker/Podman then services, etc with no interaction. Also ntfy can help you manage notifications across devices on your own infrastructure, no 3rd parties.

utopiah•7h ago
Also on the topic of gaps... liminal space IS precious. It doesn't require coding per se though. You can problem solve, jolt down a solution THEN only implement it as code later on. What matters IMHO is that the potential solution is not lost, wherever you might be. For that... I have shower crayons. Point is, CAPTURE ideas, don't necessarily implement on the spot but of course if you want to, it's good to be able to.
sfortis•6h ago
I run Claude code directly on Termux and it runs like a dream. My projects are mounted on a private S3 folder with rclone.
ChicagoDave•6h ago
I just sent a feature request in to Anthropic to add a mobile app to essentially what's been constructed here.

I love that as we go through our GenAI development journey, we're all finding success in the same patterns.

OddMerlin•6h ago
Why not use Claude App in GitHub (if your code is in GitHub)?

I kick off a prompt as a GitHub issue, Claude fires away on this issue, provides updates as comments and a PR is created for me at the end for review.

It also notifies me throughout, and I can look at the pipelines to see the thinking behind the action.

united8932•4h ago
What happens when your tailscale session expires? Or if tailscale dies. How do you log back in to fix it?
tokyovigilante•4h ago
As an aside have found the mosh + tmux Claude Code experience somewhat suboptimal, tmux's scrollback seems to clash with CC's, and makes copying between windows etc challenging.

It is tolerable on an iPad with Blink with commands to maximise and minimise panes using vim-style keyboard bindings, kind of like an iOS sway.

virtualritz•3h ago
7 USD/day? That's ~200/month -- isn't that just very expensive? I am probably missing something.

E.g. a Terragonlabs subscription is 25/month for 3 concurrent tasks and 50/month for 10.

dr_dshiv•3h ago
If you are already paying $200-500/m… and you are doing the work of 10 people… I can totally see the value.

I’ll check the Terragonlabs option.

Lots of options for startups right now, selling pickaxes! I’m waiting for a better terminal experience, personally. I can’t deal with 30+ poorly named windows. I need to be able to search for that one thread I was working on yesterday…

virtualritz•3h ago
> I’m waiting for a better terminal experience, personally.

Same! Even colored tabs would go a long way for me.

jillesvangurp•2h ago
You can optimize things. I have a github action that starts stops a fast google cloud vm for our builds. It only gets used about 3 minutes per build. We maybe have a few dozen builds per month. So that's a few hours of run time. The rest of the time the vm is stopped and not billed (except for storage, which is cents per month at most). It's a simple debian vm so it boots in about 20 seconds.

VMs are expensive if you leave them running 24/7 but the logic to start/stop them is pretty easy. There's no need.

Anyway, you need to balance this against the payoff. Agentic coding is useful enough that it beats spending your own time. And that includes waiting time for the relatively slow/underpowered containerized environments that some tools would use by default. I use codex web and codex cli (with a qemu vm so I can use the --yolo flag). Codex web is a bit limited with memory and CPU. Some of my slower builds are taking forever there. To the point where most of the time it consumes is just waiting for these builds to happen.

With a bit of plumbing, you can do things like the author describes pretty easily. IMHO this needs to be better integrated into tools. With Github you have the option to run your own runners. I don't think codex/claude web have similar options currently. But with the cli versions, you can get more creative if you know your tools. And if you don't, use LLMs to drive them for you. It's mostly just about expressing what you want and how you want it.

tobi_bsf•3h ago
Has not discovered Clawdbot yet.
DanOpcode•1h ago
I wonder when/how to test and review the code though? I mean, how do you know Claude Code hasn't entered a completely different path than you had imagined?
jscheel•39m ago
Hah, I set up basically the same thing on Saturday during a long car ride. Couple of differences: I’m an opencode user and I used a different VPS provider (though I use vultr for other things). It was my first time actually sitting down and using tailscale, which was quite easy to get going. Did everything from my phone, didn’t even have my laptop with me.