Explaining neural scaling laws
… trained deep neural networks often follows precise power-law scaling … explains the origins
of and connects these scaling laws. We identify variance-limited and resolution-limited scaling …
of and connects these scaling laws. We identify variance-limited and resolution-limited scaling …
Scaling laws for deep learning
JS Rosenfeld - arXiv preprint arXiv:2108.07686, 2021 - arxiv.org
… of deep neural … explaining the scaling laws has followed and broadened a similar
approximation-centric [4, 59] analysis yielding a rigorous theoretical understanding of the scaling …
approximation-centric [4, 59] analysis yielding a rigorous theoretical understanding of the scaling …
Revisiting neural scaling laws in language and vision
IM Alabdulmohsin, B Neyshabur… - Advances in Neural …, 2022 - proceedings.neurips.cc
… Power law scaling in deep neural architectures has been verified … To explain this theoretically,
at least for data scaling, several … of scaling laws and an improved estimator of scaling law …
at least for data scaling, several … of scaling laws and an improved estimator of scaling law …
A dynamical model of neural scaling laws
… To attempt to explain these phenomena, we develop a mathematically tractable model of
neural scaling laws which allows one to simultaneously vary time, model size, and dataset size. …
neural scaling laws which allows one to simultaneously vary time, model size, and dataset size. …
Beyond neural scaling laws: beating power law scaling via data pruning
… Widely observed neural scaling laws, in which error falls off as a power of the training set …
scaling alone require considerable costs in compute and energy. Here we focus on the scaling …
scaling alone require considerable costs in compute and energy. Here we focus on the scaling …
Explaining scaling laws of neural network generalization
… well-trained neural networks often follows precise power-law scaling relations with … explains
and connects these scaling laws. We identify variance-limited and resolution-limited scaling …
and connects these scaling laws. We identify variance-limited and resolution-limited scaling …
Broken neural scaling laws
… broken power law functional form (referred to by us as a broken neural scaling law (BNSL))
that accurately models and extrapolates the scaling behaviors of deep neural networks (ie …
that accurately models and extrapolates the scaling behaviors of deep neural networks (ie …
Neural scaling laws on graphs
… The main focus of this work is on the two basic forms of neural scaling laws for the graph …
two basic forms of neural scaling laws: the model scaling law and the data scaling law. We first …
two basic forms of neural scaling laws: the model scaling law and the data scaling law. We first …
Scaling laws for neural language models
… 5 Scaling Laws with Model Size and Training Time In this section we will demonstrate that
a simple scaling law … First we will explain how to use the results of [MKAT18] to define a …
a simple scaling law … First we will explain how to use the results of [MKAT18] to define a …
Scaling laws in cognitive sciences
… theories to explain scaling laws in terms that can cross or integrate disciplines. … of scaling
laws in other sciences. Here, we review evidence of scaling laws in cognitive science, at neural…
laws in other sciences. Here, we review evidence of scaling laws in cognitive science, at neural…
Recherches associées
- data manifold neural scaling law
- data pruning neural scaling laws
- language and vision neural scaling laws
- large scale graphs neural scaling law
- solvable model neural scaling laws
- resource model neural scaling law
- dynamical model neural scaling laws
- adaptive model training scaling laws
- cognitive sciences scaling laws
- node classification neural scaling law