Explaining neural scaling laws

Y Bahri, E Dyer, J Kaplan, J Lee, U Sharma - Proceedings of the National …, 2024 - pnas.org
… trained deep neural networks often follows precise power-law scalingexplains the origins
of and connects these scaling laws. We identify variance-limited and resolution-limited scaling

Scaling laws for deep learning

JS Rosenfeld - arXiv preprint arXiv:2108.07686, 2021 - arxiv.org
… of deep neuralexplaining the scaling laws has followed and broadened a similar
approximation-centric [4, 59] analysis yielding a rigorous theoretical understanding of the scaling

Revisiting neural scaling laws in language and vision

IM Alabdulmohsin, B Neyshabur… - Advances in Neural …, 2022 - proceedings.neurips.cc
… Power law scaling in deep neural architectures has been verified … To explain this theoretically,
at least for data scaling, several … of scaling laws and an improved estimator of scaling law

A dynamical model of neural scaling laws

B Bordelon, A Atanasov, C Pehlevan - arXiv preprint arXiv:2402.01092, 2024 - arxiv.org
… To attempt to explain these phenomena, we develop a mathematically tractable model of
neural scaling laws which allows one to simultaneously vary time, model size, and dataset size. …

Beyond neural scaling laws: beating power law scaling via data pruning

B Sorscher, R Geirhos, S Shekhar… - … in Neural …, 2022 - proceedings.neurips.cc
… Widely observed neural scaling laws, in which error falls off as a power of the training set …
scaling alone require considerable costs in compute and energy. Here we focus on the scaling

Explaining scaling laws of neural network generalization

Y Bahri, E Dyer, J Kaplan, J Lee, U Sharma - 2021 - openreview.net
… well-trained neural networks often follows precise power-law scaling relations with … explains
and connects these scaling laws. We identify variance-limited and resolution-limited scaling

Broken neural scaling laws

E Caballero, K Gupta, I Rish, D Krueger - arXiv preprint arXiv:2210.14891, 2022 - arxiv.org
… broken power law functional form (referred to by us as a broken neural scaling law (BNSL))
that accurately models and extrapolates the scaling behaviors of deep neural networks (ie …

Neural scaling laws on graphs

J Liu, H Mao, Z Chen, T Zhao, N Shah… - arXiv preprint arXiv …, 2024 - arxiv.org
… The main focus of this work is on the two basic forms of neural scaling laws for the graph …
two basic forms of neural scaling laws: the model scaling law and the data scaling law. We first …

Scaling laws for neural language models

J Kaplan, S McCandlish, T Henighan, TB Brown… - arXiv preprint arXiv …, 2020 - arxiv.org
… 5 Scaling Laws with Model Size and Training Time In this section we will demonstrate that
a simple scaling law … First we will explain how to use the results of [MKAT18] to define a …

Scaling laws in cognitive sciences

CT Kello, GDA Brown, R Ferrer-i-Cancho… - Trends in cognitive …, 2010 - cell.com
… theories to explain scaling laws in terms that can cross or integrate disciplines. … of scaling
laws in other sciences. Here, we review evidence of scaling laws in cognitive science, at neural