I think this is a very valuable exercise if you try to understand how LLMs work and if you have the time.
rvnx•10m ago
Having the money is really what you need, not time.
Nowadays training very powerful LLMs is easy because all the tooling, source-codes, training datasets, and teaching agents are available.
Having money is not, unless you are selling AI snake-oil type of companies.
contrast•6m ago
You seem to be talking about a production-grade model rather than building an LLM as an exercise? Or if not, why do you disagree with the article's example of building a small LLM for $100?
ducktective•9m ago
Are off-shelf GPUs (like one 3090) suitable for modern academic research on current AI advancements or is it better to rent some cloud compute?
DeathArrow•13m ago
rvnx•10m ago
Nowadays training very powerful LLMs is easy because all the tooling, source-codes, training datasets, and teaching agents are available.
Having money is not, unless you are selling AI snake-oil type of companies.
contrast•6m ago