https://arxiv.org/pdf/2208.12591.pdf

Question: why do models generalize even as weights increase?

Believed that deep-learning algorithms exhibit some implicit bias, or regularization, where model leans toward simpler modelling of data, like penalizing if there is too much complexity in data

Double