Momentum-based optimizers
Advanced techniques for early stopping: Learning rate schedules, adaptive optimization, and more