Los Alamos National Laboratory
Life Is Easier for Big Networks: Neural networks learn better with more parameters.
According to the lottery ticket hypothesis, the bigger the neural network, the more likely some of its weights are initialized to values that are well suited to learning to perform the task at hand. But just how big does it need to be?