In any case, platforms like tinker.ai support both SFT and RL.
- PaddleOCR, a 0.9B model that reaches SOTA accuracy across text, tables, formulas, charts & handwriting. [0]
- A 3B and 8B model which performs HTML to json extraction at GPT-5 level accuracy at 40-80x less cost, and faster inference. [1]
I think it makes sense to fine tune when you're optimizing for a specific task.
[0] https://huggingface.co/papers/2510.14528
[1] https://www.reddit.com/r/LocalLLaMA/comments/1o8m0ti/we_buil...
oli5679•1h ago
It requires no local gpus, just creating a json and posting to OpenAI
https://platform.openai.com/docs/guides/model-optimization
deaux•43m ago