Acknowledgements
adabelief
Adabelief
AdaBelief
AdaBound
Adabound
adabound
Adadelta
Adagrad
AdamW
adamw
Adaptivity
authors’
Beale
CMD
Codecov
Dazat
Dozat
Defazio
Devendra
Dvornek
dfalbel
Falbel
Gao
Haoming
Hutter
ICLR
Ilya
jIp
jvwB
Jianfeng
Jiang
Jiawei
Jelassi
juntang
Juntang
Kingma
Kumar
Keskar
Liangchen
Liu
Liyuan
LiyuanLucasLiu
Loshchilov
Luo
Luolc
MADGRAD
Manzil
Momentumized
Mrpatekful
Nadam
nadam
Nesterov
Nesterov’s
NeurIPS
Nicha
Nikolay
Nitish
Nonconvex
Novik
Oponski
openreview
Patrik
Papademetris
Pengcheng
Purgai
QH
qhadam
QHAdam
qhoptim
RAdam
radam
RMSProp
RMSprop
Reddi
SGD
Sachan
Samy
Sanjiv
Sashank
Satyen
Sekhar
Shekar
Shirish
Sochee
Socher
warmup
Tatikonda
Xiong
Xiaodong
Xu
Weizhu
Yan
Yarats
Yifan
Yuanhao
Zaheer
zhuang
Zhuang
ZJjtNEZ
al
arXiv
arxiv
bff
colllin
doi
et
facebookresearch
gifski
github
https
inequivalence
jettify
madgrad
mlverse
nonconvex
optimizers
Optimizers
preprint
py
pytorch
rescaled
th
verison
viridis
wikipedia
