arXivPDF
Scaling Laws for Neural Language Models: A Comprehensive Study
We investigate the scaling properties of Transformer language models across multiple dimensions including model size, data, and compute...
J. Smith, A. Johnson, M. Williams
May 8, 2026