AdaBelief
Striding Toward the Minimum: A faster way to optimize the loss function for deep learning.
When you’re training a deep learning model, it can take days for an optimization algorithm to minimize the loss function. A new approach could save time.