Training Compute-Optimal Large Language Models (2022) https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (2024) https://arxiv.org/abs/2404.10102
adityaathalye•1h ago
Training Compute-Optimal Large Language Models (2022) https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (2024) https://arxiv.org/abs/2404.10102