frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do companies like OpenAI, Perplexity fine tune rich output?

8•agaase19•7mo ago
I see fine tune as one of the major ways companies like OpenAI, Perplexity, Claude companies differ when it comes to provide higher quality of answers (correct me if I am wrong).

One curious question is how do they fine tune rich data (markdown, html outputs, tables, graphs etc) at scale. Currently, performing fine tuning involves the laborious process of carefully editing inputs (prompts) and outputs one by one. Becomes more difficult as the data context increases and one has to carefully examine the input data and provide the right output including things like formatting, grammar, UI etc.

Considering such a wide variety of questions they are processing, it amazes me how are they doing it at scale. Any thoughts?

Comments

pizza•7mo ago
Anything with a linter means, at minimum, free verifiable rewards for RL (though whether something parses versus looks good is another story). That, plus, they have more data than anyone, and also it seems somewhat reasonable that stronger models could learn 'more' from a given instance or set of examples.
agaase19•7mo ago
Can you elaborate on "linter means and verifiable rewards for RL"? Is this something others would find extremely difficult to do ?
holden_nelson•7mo ago
They’re saying that they can use linters to check the output from a reinforcement learning model and reward it for correct output.

Former Tumblr Head Jeff D'Onofrio Steps in as Acting CEO at the Washington Post

https://www.theverge.com/tech/875433/tumblr-jeff-donofrio-ceo-washington-post-layoffs
1•bookofjoe•1m ago•0 comments

Bounded Flexible Arrays in C

https://people.kernel.org/kees/bounded-flexible-arrays-in-c
1•fanf2•1m ago•0 comments

The Invisible Labor Force Powering AI

https://cacm.acm.org/news/the-invisible-labor-force-powering-ai/
1•pseudolus•3m ago•0 comments

Reading Recursion via Pascal

https://journal.paoloamoroso.com/reading-recursion-via-pascal
1•AlexeyBrin•4m ago•0 comments

Show HN: I made a website that finds patterns on your spreadsheet

https://analyzetable.com
1•kouhxp•4m ago•0 comments

Jokes on You AI: Turning the Tables – LLMs for Learning

https://www.dev-log.me/jokes_on_you_ai_llms_for_learning/
1•wazHFsRy•5m ago•0 comments

You don't need RAG in 2026

https://ryanlineng.substack.com/p/you-dont-need-rag-in-2026
1•kareninoverseas•6m ago•0 comments

WatchLLM – Cost kill switch for AI agents (with loop detection)

https://www.watchllm.dev/
1•Kaadz•9m ago•2 comments

I turned myself into an AI-generated deathbot – here's what I found

https://www.bbc.com/news/articles/c93wjywz5p5o
1•cmsefton•20m ago•0 comments

Management style doesn't predict survival

https://orchidfiles.com/management-style-doesnt-predict-survival/
1•theorchid•21m ago•0 comments

One Generation Runs the Country. The Next Cashed in on Crypto

https://www.wsj.com/finance/currencies/trump-sons-crypto-billions-1e7f1414
1•impish9208•22m ago•1 comments

"I Was Wrong": Why the Civil War Is Running Late [video][2h21m]

https://www.youtube.com/watch?v=RDmkKZ7vAkI
1•Bender•23m ago•0 comments

Show HN: A sandboxed execution environment for AI agents via WASM

https://github.com/Parassharmaa/agent-sandbox
1•paraaz•26m ago•0 comments

Wine-Staging 11.2 Brings More Patches to Help Adobe Photoshop on Linux

https://www.phoronix.com/news/Wine-Staging-11.2
2•doener•26m ago•0 comments

The Nature of the Beast

https://cinemasojourns.com/2026/02/07/the-nature-of-the-beast/
1•jjgreen•26m ago•0 comments

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

1•JanusPater•26m ago•0 comments

Show HN: Curated list of 1000 open source alternatives to proprietary software

https://opensrc.me
1•ZenithSoftware•28m ago•0 comments

AI's Real Problem Is Illegitimacy, Not Hallucination

1•JanusPater•29m ago•1 comments

'I fell into it': ex-criminal hackers urge UK pupils to use web skills for good

https://www.theguardian.com/technology/2026/feb/08/i-fell-into-it-ex-criminal-hackers-urge-manche...
1•robaato•30m ago•0 comments

Why 175-Year-Old Glassmaker Corning Is Suddenly an AI Superstar

https://www.wsj.com/tech/corning-fiber-optics-ai-e045ba3b
1•bookofjoe•31m ago•1 comments

Keeping WSL Alive

https://shift1w.com/blog/keeping-wsl-alive/
1•jakesocks•32m ago•0 comments

Unlocking core memories with GoldSrc engine and CS 1.6 (2025)

https://www.danielbrendel.com/blog/43-unlocking-core-memories-with-goldsrc-engine
3•foxiel•33m ago•0 comments

Gtrace an advanced network path analysis tool

https://github.com/hervehildenbrand/gtrace
2•jimaek•33m ago•0 comments

America does not trust Putin or Trump

https://re-russia.net/en/review/809/
1•mnky9800n•36m ago•0 comments

Let's Do Music in Linux [video]

https://www.youtube.com/watch?v=IHgsOdoLuBU
1•mariuz•37m ago•0 comments

"Nothing" is the secret to structuring your work

https://www.vangemert.dev/blog/nothing
1•spmvg•41m ago•0 comments

AI Makes the Easy Part Easier and the Hard Part Harder

https://www.blundergoat.com/articles/ai-makes-the-easy-part-easier-and-the-hard-part-harder
1•birdculture•42m ago•0 comments

Show HN: Fine-tuned Qwen2.5-7B on 100 films for probabilistic story graphs

https://cinegraphs.ai/
1•graphpilled•43m ago•1 comments

A failed wantrepreneur's view on common startup advice

https://developerwithacat.com/blog/202602/startup-advice/
1•mmarian•43m ago•0 comments

Show HN: BestClaw Simple OpenClaw/MoltBot for non tech people

https://bestclaw.host/
2•nihey•44m ago•0 comments