AdaDelta

Adaptive learning rate optimizer that refines AdaGrad by limiting aggressive decay via running averages.