https://arxiv.org/pdf/2208.12591.pdf
Question: why do models generalize even as weights increase?
Believed that deep-learning algorithms exhibit some implicit bias, or regularization, where model leans toward simpler modelling of data, like penalizing if there is too much complexity in data
Double