frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

How 'overworked, underpaid' humans train Google's AI to seem smart

https://www.theguardian.com/technology/2025/sep/11/google-gemini-ai-training-humans
70•Brajeshwar•1h ago

Comments

kerblang•1h ago
Are other AI companies doing the same thing? Would like to see more articles about this...
jkkola•1h ago
There's a YouTube video titled "AI is a hype-fueled dumpster fire" [0] that mentions OpenAI's shenanigans. I haven't fact checked that but I've heard enough stories to believe it.

[0] https://youtu.be/0bF_AQvHs1M?si=rpMG2CY3TxnG3EYQ

thepryz•1h ago
Scale AI’s entire business model was using people in developing countries to label data for training models. Once you look into it, it comes across as rather predatory.

This was one of the first links I found re: Scale’s labor practices https://techcrunch.com/2025/01/22/scale-ai-is-facing-a-third...

Here’s another: https://relationaldemocracy.medium.com/an-authoritarian-work...

lawgimenez•1h ago
Couple of months ago I received a job invite for Kotlin AI trainers from the team at Upwork. I asked what the job is about and she says something like "for the opportunity to review & evaluate content for generative AI." And I'm from a developed country too.
benreesman•51m ago
There's nontrivial historical precedent for this exact playbook: when a new paradigm (Lisp machines and GOFAI search, GPU backprop, softmax self-attention) is scaling fast, a lot of promises get made, a lot of national security money gets involved, and AI Summer is just balmy.

But the next paradigm breakthrough is hard to forecast, and the current paradigm's asymptote is just as hard to predict, so it's +EV to say "tomorrow" and "forever".

When the second becomes clear before the first, you turk and expert label like it's 1988 and pray that the next paradigm breakthrough is soon, you bridge the gap with expert labeling and compute until it works or you run out of money and the DoD guy stops taking your calls. AI Winter is cold.

And just like Game of Thrones, no I mean no one, not Altman, not Amodei, not Allah Most Blessed knows when the seasons in A Song of Math and Grift will change.

jhbadger•47m ago
Karen Hao's recent book "Empire of AI" about the rise of OpenAI goes into detail how people in Africa and South America were hired (and arguably exploited) for their training efforts.
cs702•1h ago
The title is biased, blaming Google for mistreating people and implying that Google's AI isn't smart, but the OP is worth reading, because it gives readers a sense of the labor and cost involved in providing AI models with human feedback, the HF in RLHF, to ensure the AI models are more aligned with human values and preferences.
lm28469•1h ago
> to ensure the AI models are more aligned with human values and preferences.

And which are these universal human values and preferences ? Or are we talking about silicon valley's executives values ?

giveita•1h ago
> Sawyer is one among the thousands of AI workers contracted for Google through Japanese conglomerate Hitachi’s GlobalLogic to rate and moderate the output of Google’s AI products...

Depends how you look at it. I think a brand like Google should vet a mere one level down the supply chain.

FirmwareBurner•49m ago
I had no idea Hitachi was also running software sweatshops.
rs186•1h ago
> to ensure the AI models are more aligned with human values and preferences.

to ensure the AI models are more aligned with Google's values and preferences.

FTFY

falcor84•48m ago
I'm a big fan of cyberpunk dystopian fiction, but I still can't quite understand what you're alluding to here. Can you give an example value that google align the AI with that you think isn't a positive human value?
Ygg2•44m ago
"Adtech is good. Adblockers are unnatural"
smokel•32m ago
Google Gemini 2.5 Pro actually has a quite nuanced reply when asked to consider this statement, including the following:

> "Massive privacy invasion: The core of modern adtech runs on tracking your behavior across different websites and apps. It collects vast amounts of personal data to build a detailed profile about your interests, habits, location, and more, often without your full understanding or consent."

ToucanLoucan•37m ago
Their entire business model? Making search results worse to juice page impressions? Every dark pattern they use to juice subscriptions like every other SaaS company? Brand lock-in for Android? Paying Apple for prominent placement of their search engine in iOS? Anti-competitive practices in the Play store? Taking a massive cut of Play Store revenue from people actually making software?
simonw•18m ago
How does all of that affect the desired outputs for their LLMs?
add-sub-mul-div•26m ago
Yes, and one more tweak: the values of Google or anyone paying Google to deliver their marketing or political messaging.
zozbot234•27m ago
RLHF (and its evolution, RLAIF) is actually used for more than setting "values and preferences". It's what makes AI models engage in recognizable behavior, as opposed to simply continuing a given text. It's how the "Chat" part of "ChatGPT" can be made to work in the first place.
throwaway106382•22m ago
What is a "human value" and whose preferences?
zerodaysbroker•1h ago
The title seems kinda misleading, this is from the article (GlobalLogic is the company contracted by Google):

"AI raters at GlobalLogic are paid more than their data-labeling counterparts in Africa and South America, with wages starting at $16 an hour for generalist raters and $21 an hour for super raters, according to workers. Some are simply thankful to have a gig as the US job market sours, but others say that trying to make Google’s AI products better has come at a personal cost."

imperio59•39m ago
It's employment at will. They are free to go work somewhere else if they don't like it...
teiferer•20m ago
That argument is as old as any mistreated worker complaining about their situation and as old as any argument against workers rights in general. Anybody not liking their job could just leave right? Simple! No, the world just isn't that simple and it didn't become simpler just because it happens in an AI context that produces a tool you like.

There are lots of jobs out there that suck and people do them anyway. Because the freedom that they supposedly have is not as free as you imagine.

mallowdram•1h ago
Gemini is faked.

How this industry managed to not grasp that meaning exists entirely separate from words is altogether bizarre.

dolphinscorpion•47m ago
"Google" posted a job opening. They applied for and took the job, agreeing to posted pay and conditions. End of the story. It's not up to the Guardian to decide
xkbarkar•44m ago
I agree, article is pretty low quality ragebait. Not good journalism at all.
lysace•5m ago
It is amazing how much their quality levels have fallen during the past two decades.

I used to point to their reporting as models that my nation’s newspapers should seek to emulate.

iandanforth•45m ago
"Google said in a statement: “Quality raters are employed by our suppliers and are temporarily assigned to provide external feedback on our products. Their ratings are one of many aggregated data points that help us measure how well our systems are working, but do not directly impact our algorithms or models.” GlobalLogic declined to comment for this story." (emphasis mine)

How is this not a straight up lie? For this to be true they would have to throw away labeled training data.

Gracana•38m ago
They probably don’t do it at a scale large enough to do RLHF with it, but it’s still useful feedback the people working on the projects / products.
zozbot234•31m ago
More recent models actually use "reinforcement learning from AI feedback", where the task of assigning a reward is essentially fed back into the model itself. Human feedback is then only used to ground the training, on selected examples (potentially even entirely artificial ones) where the AI is most highly uncertain about what feedback should be given.
creddit•32m ago
Because they are doing it to compute quality metrics not to implement RLHF. It’s not training data.
teiferer•25m ago
Key word: "directly"

It does so indirectly, so it's a true albeit misleading statement.

ants_everywhere•43m ago
When they switch to aligning with algorithms instead of humans we'll get another story about how terrible it was that they removed the jobs that were terrible when they existed.

This doesn't sound as bad to me as the Facebook moderator job or even a call center job, but it does sound pretty tedious.

lysace•39m ago
with wages starting at $16 an hour for generalist raters and $21 an hour for super raters, according to workers

That’s sort of what I expect the Guardian’s UK online non-sub readers to make.

simonw•20m ago
Something I'd be interested to understand is how widespread this practice is. Are all of the LLMs trained using human labor that is sometimes exposed to extreme content?

There are a whole lot of organizations training competent LLMs these days in addition to the big three (OpenAI, Google, Anthropic).

What about Mistral and Moonshot and Qwen and DeepSeek and Meta and Microsoft (Phi) and Hugging Face and Ai2 and MBZUAI? Do they all have their own (potentially outsourced) teams of human labelers?

I always look out for notes about this in model cards and papers but it's pretty rare to see any transparency about how this is done.

yvdriess•17m ago
One of the key innovations behind the DNN/CNN models was Mechanical Turk. OpenAI used a similar system extensively to improve the early GPT models. I would not be surprised that the practice continues today; NN models needs a lot of quality ground truth training data.
simonw•8m ago
Right, but where are the details?

Given the number of labs that are competing these days on "open weights" and "transparency" I'd be very interested to read details of how some of them are handling the human side of their model training.

I'm puzzled at how little information I've been able to find.

whilenot-dev•10m ago
So why do you think asking this question here would yield a satisfying answer, especially how the HN community likes to dispute any vague conclusions for anything as hyped as AI training?

To counter your question, what makes you think that's not the case? Do you think Mistral/Moonshot/Qwen/etc. are all emloying their own data labelers? Why would you expect this kind of transparency from for-profit bodies that are evaluated in the billions?

philipallstar•20m ago
If they're underpaid and overworked, by definition words that are relative to other options, they should go to one of the better options.
CPLX•13m ago
Glad to learn from your post that the labor market has recently become perfectly competitive and efficient.
bflesch•11m ago
The way you defend against an article citing "thousands of workers" by using a nitpicky criticism about grammar style makes me suspect that it raises a cognitive dissonance in your head that you are not ready to address yet.
Group_B•8m ago
Comments like these are why HN is the best
blactuary•6m ago
Yeah they should simply buy widgets from the abundance of other widget sellers since this is a perfectly competitive market with no transaction costs and perfectly symmetric information
a3w•18m ago
AI means actual indians, did we not learn that from the initial OpenAI GPT 3.0 training? It made it to HN.
wslh•17m ago
It seems a deja vu of previous Amazin's Mechanical Turk[1] discussions[2] but with AI.

[1] https://www.mturk.com/

[2] https://tinyurl.com/4r2p39v3

Show HN: A Tree-sitter based code chunking library for code search

https://github.com/sirasagi62/code-chopper
1•Sirasagi62•2m ago•0 comments

Optimization Pathways for Long-Context Agentic LLM Inference

https://arxiv.org/abs/2509.09505
1•ceolin•4m ago•0 comments

Show HN: Markdown Cheat Sheet

https://markdowncheatsheet.com
1•QingWu•6m ago•0 comments

Anchored Persona Reinforcement (APR)

https://zenodo.org/records/17025697
1•JDaily•8m ago•1 comments

Show HN: ChildSafe Media – AI parental guidance for movies, shows, and games

https://childsafe.media
1•moorst•9m ago•0 comments

The Bear Blog license change

https://grizzlygazette.bearblog.dev/on-the-bear-blog-license-change/
1•emschwartz•10m ago•0 comments

My First Impressions of Gleam

https://mtlynch.io/notes/gleam-first-impressions/
2•AlexeyBrin•11m ago•0 comments

Larry Ellison, the tech world's great survivor

https://www.ft.com/content/19b2735f-b91e-4cc6-94cc-32322c21eb77
1•bookofjoe•14m ago•1 comments

Another Pig Quest

https://www.indieretronews.com/2025/09/another-pig-quest-last-pig-piggy18-team.html
2•ibobev•16m ago•0 comments

The Weekly Win

https://yusufaytas.com/the-weekly-win/
6•yusufaytas•20m ago•0 comments

Online therapy for digital nomads and expats

https://psychologistattachmentexpert.carrd.co
1•ElysiumAbove•20m ago•2 comments

Trustworthy Systems R&D Update – Gernot Heiser, UNSW Sydney [video]

https://www.youtube.com/watch?v=wP48V34lDhk
1•snvzz•21m ago•0 comments

Charlie Kirk was killed by a meme

https://www.garbageday.email/p/charlie-kirk-was-killed-by-a-meme
3•ParacelsusOfEgg•27m ago•0 comments

Methane emissions driven by aerotolerant methanogens using seaweed and seagrass

https://www.nature.com/articles/s41561-025-01768-3
1•PaulHoule•29m ago•0 comments

Extremism and Radicalisation in the Manosphere: Beta Uprising

https://www.lrb.co.uk/the-paper/v47/n16/emily-witt/do-you-feel-like-a-failure
2•mitchbob•30m ago•1 comments

Hourflowers

https://blog.cloudflare.com/staying-ahead-of-openssl-vulnerabilities/
1•sobhimohieldien•33m ago•0 comments

The New Status Game: Longevity

https://www.newinternet.tech/p/the-new-status-game-longevity
1•tortilla•33m ago•0 comments

Ask HN: Is it worth going to Germany to do my bachelors in CS/AI?

2•DingDongPing•34m ago•0 comments

The H-2A Visa Trap

https://projects.propublica.org/h2a-visa-farmworkers-operation-blooming-onion/
1•mitchbob•36m ago•0 comments

Tcpreplay – Modify and Replay Packet Captures Back into the Network

https://thegraynode.io/posts/tcpreplay_introduction/
2•elcritch•36m ago•0 comments

Your call is important to us – which is why we're connecting you to a human

https://www.theregister.com/2025/09/12/gartner_support_ai/
1•dijksterhuis•38m ago•0 comments

The Future Belongs To People Who Do Things: The 9 month recap on AI in industry [video]

https://www.youtube.com/watch?v=siLksTW5DBA
1•ghuntley•40m ago•1 comments

Get Out of Technology

https://geohot.github.io//blog/jekyll/update/2025/09/13/get-out-of-technology.html
8•pankalog•41m ago•6 comments

60 years after Gemini, newly processed images reveal details

https://arstechnica.com/space/2025/09/60-years-after-gemini-newly-processed-images-reveal-incredi...
3•sohkamyung•43m ago•0 comments

Noob questions about Data Engineering

1•mrkaka•45m ago•0 comments

How Long Till Death Revokes Your Library Card?

https://www.millersbookreview.com/p/how-long-till-death-revokes-your-library-card
2•ingve•46m ago•0 comments

Facebook's settlement payments are going out. Here's what to expect

https://www.cnn.com/2025/09/13/tech/facebook-settlement-payments-privacy-breach
1•beatthatflight•48m ago•0 comments

Feds try to dodge lawsuit against their bogus climate report

https://arstechnica.com/science/2025/09/department-of-energy-gets-rid-of-climate-skeptics-group-t...
4•ndsipa_pomu•48m ago•2 comments

Transmission Grid Mapping in OpenStreetMap

https://MapYourGrid.org/
2•protontypes•48m ago•0 comments

RFK Jr's vaccine advisers will soon review four shots: what's at stake

https://www.nature.com/articles/d41586-025-02914-y
9•rntn•48m ago•1 comments