The sad part is that we haven't figured out how to distribute our resources fairly to these people even thought their services aren't required as often. Instead we just take their wages and give them to the top 0.1%
Just one funny example I saw recently: On the Amazon website, a submit button labeled “Go” in English was translated to something that translated back would be “Walking”. That’s the kind of thing that would be exceedingly unlikely to happen with a human translator.
As one of such people, I think there is a nuance to it. AI is great when you’re translating something to yourself. But when translating things for others, more caution and human judgement is needed. Espesially when translating instruction manuals, where bad wording could cause someone to injure themself.
Expected Value (Upside, given time/cost savings + Downside, given %reliability).
So, every task falls under a spectrum
> Ah, you can’t fire me, I’m self-employed!
I don't understand thinking like this. I think companies can certainly fire their contractors.
For example, I just read the Lawrence Ellsworth translation of The Three Musketeers, which I very thoroughly enjoyed. I don't speak or read French, but from my understanding Ellsworth's translation is considered one of the more accurate translations of the work.
Out of curiosity, I sic'd Claude Fable on the original French version of The Three Musketeers and told it to translate accurately, but also try and keep the same jovial tone as the original and do not censor anything. After it was done, I didn't read the entire output, but I did compare a few individual chapters between the Ellsworth translation and the Fable translation.
They were honestly remarkably similar. As far as I could tell, nothing was substantially different from the Ellsworth translation and the Fable translation. I do think that the prose for the Ellsworth translation was a bit better, but the prose for the Fable one was actually perfectly readable. Again, I don't speak French so I cannot say for sure, but I do not believe that I would have gotten a significantly different experience had I read the Fable version instead of the Ellsworth version.
Now, it's possible (and likely) that this is somewhat self-fulfilling; Fable might have been trained using Ellsworth's translation and as such it's very directly able to crib from it; sadly since I do not speak any language outside of English, there's sort of a catch-22: the only way I can compare the accuracy of a translation is to compare against other translations, but if other translations exist then that will likely influence the results, and if a translation doesn't already exist then I have no way of auditing it.
I'm still going to continue reading through Ellsworth's translations for the subsequent stories simply because that feels more canonical, and as I said I do think the prose was a bit better.
And here I am, brain the size of a galaxy, and I fumble my way through every language I speak other than English.
Serious respect for the linguists.
Update: in case it’s not obvious, I am sorry. I could not help it.
I have also taken to being sloppier in my prose, as I’ve had stories rejected for being “written by AI” - when they’re shorts I wrote more than a decade ago. Reworked them to sound like a moron, accepted. Sigh.
> “Oh, I can’t! It’s really not reliable enough.”
Gell-Mann Amnesia strikes again.
Specifically: LLMs make it really easy to misunderestimate the complexity of fields other than your own. (You can see this with a lot of vibecoded projects, for example – once they hit the wall of complexity, they stall out or start finding ugly patches for fundamental design issues, etc.)
I don't think this sort of cultural change will happen short-term, though.
I still love the tool, but remain as convinced as ever that AGI does not lie at the end of this particular path.
In my experience this is a real problem. Just yesterday I asked my LLM to create a piece of software that could help me build an 'ambilight-like experience' through my home assistant. It did something that seems to work as I expected, but there is a lot of theory that I just brushed past. It would be pretty easy for me to assume that I would be able to replicate this feature from scratch 'now that I understand the problem'.
Every critique of AI assumes to some degree that contemporary implementations will not, or cannot, be improved upon.
Lemma: any statement about AI which uses the word "never" to preclude some feature from future realization is false.
Lemma: contemporary implementations have already improved; they're just unevenly distributed.
Translation is a gigantic boon for business, but just as important for human connection, for culture, science, art, and entertainment. The value of automatic and cheap translation between all languages, this tower of Babylon, is immeasurable.
Human translators will always be better than any AI at their job. But they don't have unlimited time and energy, and they aren't cheap. AI makes good to great translations available to everybody.
1. AI is a great boon for all tasks and specialties we don’t have the skills to do ourselves. Understandable, since (A) we’re ill equipped to see the flaws in its output because it isn’t our area of expertise, and (B) it often can unlock great gains because if we trust it, we then don’t have to pay and wait for humans to do that thing.
2. AI is a terrible replacement for me - my skills are at such a high level that it’s almost theoretical that it’ll ever be good enough to replace me for 90% of what I get paid to do. It’s a tool at best.
This is why I use AI for all my medical questions and doctors use AI to write software, and we both smirk at the quality the other person is getting from it.
Every month a new guy discovers LLMs; discovers a skill the current LLMs require to get good results; and writes about the future jobs that will always be available for smart people like HIM, that are SKILLED in using LLMs.
The next generation of AIs doesn't need his fancy prompt. The image model goes from needing to type in just the right set of weird words and cryptic sorcerous invocations, to most people being able to type in English what they want and get a pretty good result.
There are still tasks that require careful invocation. But they are a much smaller fraction of all the tasks people are trying to do, or you can get a bleh result without the elaborate invocation to get it really good. And to improve on the bleh result you need to be substantially more of an expert than back when the Guy was memorizing a rule about adding "trending on Artstation" to the image prompts, as would always require a human paid to do that.
Another generation of AIs comes out. The next generation of Clever Skills is obsolete. Image models just obey the instructions for compositing panels without mixing them up, and you don't need to be an expert to get them to do it right. Another human value-add is gone. A wider set of tasks require no human expert.
Now a new Guy notices LLMs have become useful in his field for the first time. He discovers they require SKILL to use CORRECTLY. He posts about how there will always be jobs for humans who are SKILLED in using LLMs like HIM.
But it is not an infinite cycle. It is not the same each time it repeats. Now the Guy is a highly paid programmer or a career mathematician in 2026, instead of a graphic artist in 2023.
In six months the models will no longer require his vaunted Skills.
And by then there will be another Guy.
But the process doesn't continue forever. The Guys are coming from fields that were harder and harder for AIs. The brief centaur eras are shorter and shorter.
Today it is writers who are laughing at how bad the LLMs are at their job, and who will perhaps soon be posting about how it takes Skill to get an LLM to do their job Correctly. But the models are coming faster, and the eras of kinds of human value-add in each field are shortening.
There is a point when you run out of Guys, either because the centaur eras are too short for people to develop SKILLs and post to Twitter about them; or because there are not lands left for AIs to conquer; or because ordinary people are not reassured by some Nobel laureate proclaiming there will always be jobs for Nobel laureates with the SKILLS to prompt robotized biology labs Correctly.
But we'll never run out of amateur economists who assert entirely without a brief contemporary example that there will always be jobs for humans skilled at operating AIs!
We'll run out of professional economists saying it when nobody is paid for that work anymore.
I guess we'll also run out of amateur economists when they're dead.
A list of "Examples AI will silently fail at" would be a lot more interesting, and might just convince your next potential client to _not_ use AI.
AI isn’t replacing me. Like a toddler, it
needs to be constantly coached.
Like a toddler, it will grow up.Humans are really bad at noticing trajectories. They see the current situation. They know what the situation was 5 years ago. But for some reason they do not believe that there is a trajectory. They view the present state as the final destination.
I can confidently say that LLMs do a better job than the average traditionally published fictions in my country, at least when the original works are in English. Every single time I watch a subbed movie there will be some lines noticeably wrong.
It can be reasonably argued that some poetry can be impossible to translate from some languages to others. A poem might be explained, but by a lenghty, dissecting explanation, that completely loses the point of it.
https://www.reddit.com/r/funny/comments/3e786n/chinese_hair_...
On the other hand, a lot of people become extremely put off by the smallest sign of ai slop. And the llms have a tendency to impart their style to any text they touch.
But even the good engineers should likely be a little worried.
At least that's how it was for me, maybe other peoples' careers are different.
Would you trust living in a high rise designed by AI?
Designing a system that survives production is the job.
This reminds me of the adage, that ChatGPT is really great at everything except my own work.
I suspect if I knew another language I would be able to find errors in the translation.
Crucially the full translation was part of ChatGPT’s training set. Recall is a pretty solved problem in machine learning.
How well does it translate a French novel published yesterday? Where neither the original novel nor any translations are in the training set yet? Or might not even exist!
I tried asking ChatGPT to translate a letter I wrote in Slovenian this weekend. It got the general gist but missed a lot of the nuance. Completely missed several of the little touches of tone where the right choice of synonym conveys a whole bunch of information.
Glad we agree :)
But yeah, I broadly do agree; if I read other languages I could find a book that hadn't been thoroughly translated to English and then I could give a proper analysis on how good the translation is, but since I'm a very stereotypical American I know exactly one language (and sometimes my comprehension of even that is questionable).
This isn’t a great test, because Claude almost certainly has multiple translations of The Three Musketeers in its training data.
I still think there are better tests you could do. Ideally, you would choose a book that was published recently—after the model’s cut-off date—which is considered to be a good translation. But even something like The Girl With the Dragon Tattoo, which is not particularly new and by no means obscure, would be better than a famous work of literature like The Three Musketeers that has many translations.
Even if Fable didn't have Ellsworth's translation, it certainly has the William Barrow translation, which would still get it like 80+% of the way there.
My wife speaks Spanish, I should get her to do some kind of comparison with a Spanish book that doesn't have English translations.
The number of lies, lies by omission, deceptive distortions, and fallacious argument tactics they generate is absurd, and increasing rapidly. Translation, when done as a service you are paid for, can't be relied on by propaganda bots.
I'm pretty sure the Ellsworth translation is in the corpus. You basically instructed claude to regurgitate it.
The llms all have the more famous books memorized. You can trick them to recite them more or less word for word.
So i guess in the end it just matters how important the work is.
A raw "word for word" translation (which I also tried) made the story somewhat hard to follow and very dry, but just asking it to keep the same kind of jovial swashbuckling tone of the original made something pretty similar to Ellsworth's translation.
Again, before someone decides to "correct" me on this, I am aware that it's very likely that the Ellsworth translations are part of the training set so it's not directly a fair comparison.
But now I find myself adding noise and imperfections to my writing (not that it was perfect) to make it more human, which is kinda silly.
I'm not sure how to formulate it yet but it seems there is some Peter Principle/Gell-Mann Effect corollary that is AI-related we can say here.
Perhaps: "AI rises to the level of its users' incompetence."
Or: "Confidence in AI output is inversely proportional to one's ability to verify it"
I like this / generally agree. The only wrinkle is that - for some tasks - the verification _is_ "run the script, see if it worked, don't care how... just that it did" which is distinctly different from "not only did it do it correctly, it did so in the most direct and performant way possible".
For a _lot_ of what I use LLMs to build, the former is all I need.
AI produces output that is very convincing to a non-expert, and (dangerously), it's so good at looking like an expert, they might believe that it is an expert. But the moment you ask someone to use it for something they're an expert in themselves, the holes appear wide, consistent & obvious.
My favourite moment of seeing this in action was watching AI-worrier TV host/comedian Bill Maher. He has spent years talking about the dangers of AI taking everyone's jobs, destroying civilisation, ruining the economy, starting wars, "it's just getting better and better all the time", and so on. But one night he let slip a tell. "It's no good at writing jokes. Not yet, anyway". There you go, Bill... connect those dots...
There is real utility in it being a tool to help experts apply their expertise, as in this story where it speeds up some tasks to help the translator do part of the work, enhance their expertise, allow them to be more productive.
It's a better screwdriver, a better hammer, in the hands of somebody who knows what needs a screwdriver or a hammer. It doesn't replace them. It can't replace them. It's a tool that enhances the human, not an alternative.
I don't understand why this is not widely understood yet, but I'm sure it will in due course.
And I don't expect this to change. Even if the latest model scores 100% on every benchmark, all that really tells us is that it's now more productive/efficient than it was before at helping experts do that work, not that it can replace everyone in that category of work.
Each time the frontier models get better, I see another wave of AI doubters suddenly become believers. People say things like, "AI couldn't code last year, but now I use it for everything!" Interesting. Now we know how that the person who said this has the coding skills of a Claude Opus 4.5 or whenever the frontier was when they flipped.
Meanwhile, the rest of us keep using AI as simple tools, like the person in the article. I wonder how long it will take before computers can program better than me, and I flip too.
pixel_popping•1h ago
There is already a tipping point now in software engineering where we prefer to ask AI instead of humans because we believe accuracy will be better, see SO death as an example or just see the current state of online dev communities, it's getting deserted and between team members at work, we can also notice that people speak less and less.
Sad but I believe it.
Johnbot•1h ago
pixel_popping•15m ago
We are talking about "codebases" but realistically we won't even be checking the filetree of them soon, it will be all blind, containerized and verified with pseudo guarantees which are good enough to build serious things. We don't even write documentation for humans anymore, we need to look at the trends and the reality within companies, most developers became "callcenter agents" in a matter of only 2 years and literally most of them are not even using proper automated tooling yet as we can see the "vibe coding" trend with Claude Code which is weak, by far most work done daily by developers is already automatable entirely, but with exceptions, sure, but in a few years those exceptions will become rare.
There will be niche problems about legacy products, sure, but legacy products will all be replaced over time, if we think in depth, why do we even need that many languages, that many tools? Tomorrow AI will write 99% if not all code existing ("code" doesn't even matter anyway), so it's much better if it's specific to AI and not playing this dance where we think we are doing a meaningful human contribution on an "AI-made codebase".
For context, I have 2 decades of software dev behind me.
bigstrat2003•1h ago
That's nonsense. There is zero reason to believe that AI (with the current techniques) will ever become reliable enough to let it do its own thing, let alone better than a human. It's been years of development and you still can't trust it to get basic facts correct, not even "well it's better than it used to be". Saying it'll replace humans in 5-10 years is a fantasy (or a prediction that people are stupid enough to fall for hype, I guess).
graemep•1h ago
GP is is over the top ins saying humans will "be inferior soon" but AI can be a nice additional check so AI review might be come standard.
WillowWithAWand•1h ago
We would all do well to remember that and remember that each and every advancement and use case regarding AI is the result of choices by people (or the groups of people we call corporations) and are oftentimes motivated by the profit motive, not the best interest of humanity.
We could make different choices up to and including our own Butlerian Jihad where we ban all forms of AI but we could also do everything we can to prevent the worst fallout short of that.
There are only two types of problems in the universe: 1) those posed by the laws of physics 2) those posed by human choices
The problem of AI is one of the latter.
rootusrootus•1h ago
This plague of misanthropic doom is itself pretty depressing. Why do so many people think LLMs are in any way on a path to compete with human brains? Why do you think so little of yourself? The brain is magnificent and complex in ways that we are unable to decipher anytime soon, and it does way more than an LLM. Way, way more.
pixel_popping•29m ago
When I say we, I mean the general population really. There0-'ll always be the super bright ones, sure, but we gotta be realistic here. Most people already struggle to make any meaningful contribution because it's so hard to compete, and that gap is just gonna get bigger and bigger.
I agree the brain is pretty magnificent, but when it comes to stuff like language, figuring out if an idea actually works, or running business stuff, it's pretty obvious we'll be inferior. AI can already innovate and come up with new things way faster than any human could, so at some point (soon) => the majority of contributions are just gonna come from AI, not from us.