Momentum-based optimizers

  1. Advanced techniques for early stopping: Learning rate schedules, adaptive optimization, and more