frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Are You Using Finetuning?

3•nate•1h ago
How? For what?

fintuneing seems to be out of fashion (if it were really ever in fashion), but I still see folks like Karpathy mention reaching for it as a tool.

But is anyone in any business capacity on here doing that? Are you finetuning any remote LLM or something self-hosted? What for?

I’m just curious where the line is of “oh this is better encoded in the models weights rather than in RAG/thinking over context stuff it needs to figure out.

Comments

BoredPositron•1h ago
We mainly do full finetunes on diffusion models and their text encoders like z-image, flux2 klein to adapt them to our clients visual style and train LoRas for people and products. The quality goes up immensely if the model has a better grasp of professional visual terms. Training the right kind of leather or plastic (mainly for the pattern) helps when you are scaling to 12-16k and want 99.9% reproduction, everything becomes a texture at that size and if you don't have them trained it's a mess.
nate•1h ago
Ah. That makes sense. Is this something where you do it once and you are done? Or is it something you re-finetune based on performance or reviews you get back from the client. i.e. Client doesn't like something so you go back for another cycle of

Also, is this something that's a pain in the ass to manage multiple versions of the model? One (maybe more in draft mode) for each client?

BoredPositron•1h ago
We do one finetune on the base model to iron out a few of its problems, like plastic skin and its poor understanding of visual terms and reproduction. It also really helps it understand the normal maps we use for perspective templating.

What we are mostly producing are LoRAs, and we put them through a staged training process. The first stage is all about the textures, the second stage focuses on the product itself, and the last stage dials in the exact perspectives we need.

Despite what the research out there says, we actually get better results sticking with LoRAs instead of LoKRs. The pain is generating the dataset because you have to adapt it for every product. The actual training is basically just fire and forget.

S3 Files

https://aws.amazon.com/blogs/aws/launching-s3-files-making-s3-buckets-accessible-as-file-systems/
1•akshaysaxena•41s ago•0 comments

What AI is Doing to the Workforce [video]

https://www.youtube.com/watch?v=ncmuXQGGqBM
1•johnath•1m ago•0 comments

Where are we converging with all AI companies running the same business model

1•elmlabs•2m ago•0 comments

Show HN: Hive – full dev workspace using (Kanban/chat mode,multi-repo,agent-SDK)

https://github.com/morapelker/hive
1•moropex•2m ago•0 comments

Eyes on the Far Side of the Moon

https://www.nytimes.com/interactive/2026/04/07/science/space/moon-photos-artemis-2-nasa.html
1•saikatsg•3m ago•0 comments

The V32 numbers station: a mysterious Cold War spying system revived in Iran

https://english.elpais.com/international/2026-03-19/the-v32-numbers-station-a-mysterious-cold-war...
1•mazokum•3m ago•0 comments

SCOTUS overturns 5th Circuit ruling that told ISP to kick pirates off Internet

https://arstechnica.com/tech-policy/2026/04/scotus-overturns-5th-circuit-ruling-that-told-isp-to-...
2•CharlesW•5m ago•0 comments

Ask HN: What is the most useful website you've discovered?

1•hopefully_can•6m ago•0 comments

Give Your AI Eyes: Introducing Chrome DevTools MCP

https://addyosmani.com/blog/devtools-mcp/
1•fagnerbrack•7m ago•0 comments

Will you run one reproducibility check? (Voynich structural analysis)

https://github.com/Frederick-Stalnecker/voynich-section-grammarInstructions:REPRODUCE.md
1•TheosResearch•8m ago•1 comments

Show HN: I built an AI coding agent 50% cheaper than Claude Code (same prompts)

https://github.com/kirby88/vix-releases
1•kirby88•8m ago•0 comments

Duolingo-style GitHub streak widget for READMEs

https://github-streak.rahuldhole.com/
1•rahuldhole•8m ago•0 comments

Before We Start on Quantum

https://scottaaronson.blog/?p=9668
1•whatisabcdefgh•10m ago•0 comments

1•illegalmemory•13m ago

Visual DLQ monitoring and replay for RabbitMQ

https://queueforgehq.com
1•rootN•13m ago•0 comments

Why IPv6 is the only way forward

https://ankshilp.in/posts/for-the-love-of-internet/
5•quaintdev•14m ago•0 comments

Roger Fenton's Valley of the Shadow of Death (1855)

https://publicdomainreview.org/collection/roger-fenton-valley-of-the-shadow-of-death/
1•speckx•14m ago•0 comments

Apple Studio Display XDR Now FDA-Cleared for Diagnostic Radiology Use

https://www.macrumors.com/2026/04/07/studio-display-xdr-fda-clearance/
2•tosh•15m ago•0 comments

Spacebot: Agentic AI system where LLM process has a dedicated role, OpenClaw alt

https://spacebot.sh/
1•maxloh•15m ago•0 comments

Hotcopy Coding CLI with no context ceiling and agents that learn across sessions

https://hotcopy.ai/
3•antoniomadams•17m ago•0 comments

China is winning one AI race, the US another – but either might pull ahead

https://www.bbc.com/news/articles/c145enxln0go
4•devonnull•17m ago•0 comments

Building a framework-agnostic Ruby gem (and making sure it doesn't break)

https://newsletter.masilotti.com/p/on-building-a-framework-agnostic
1•joemasilotti•18m ago•0 comments

Another Memory Corruption Case

https://trofi.github.io/posts/347-another-memory-corruption-case.html
1•speckx•18m ago•0 comments

NRR doesn't have to compress as you scale (data from 37 devtools)

https://twitter.com/evilmartians/status/2041228426712101311
1•camimirabal•19m ago•0 comments

Show HN: A VS Code extension that points tickets based on tech debt

https://marketplace.visualstudio.com/items?itemName=BrooksForsyth.storypointinator
1•bforsyth•19m ago•0 comments

We fix your broken Rhino models

https://arcol.io/blog/how-we-fix-your-broken-rhino-models
1•skoodge•20m ago•0 comments

Tailslayer: Library for reducing tail latency in RAM reads

https://github.com/LaurieWired/tailslayer
2•hasheddan•20m ago•1 comments

Wireless festival cancelled after Kanye West banned from entering UK

https://www.theguardian.com/music/2026/apr/07/home-office-bans-kanye-west-from-entering-uk-wirele...
1•thinkingemote•22m ago•0 comments

RAM Has a Design Flaw from 1966. I Bypassed It [video]

https://www.youtube.com/watch?v=KKbgulTp3FE
1•surprisetalk•22m ago•0 comments

Show HN: A Little Excursion

https://alittleexcursion.com/
2•zzzzzzzzzzzxc•23m ago•0 comments