Super-Convergence on CIFAR10
sophia cifar10 lion second-order-optimization adamw super-convergence weight-decay sharpness-aware-minimization madgrad large-batch-optimization lion-optimizer
-
Updated
Jun 17, 2024 - Python