Finite-sum optimization: Adaptivity to smoothness and loopless variance reduction

Bastien Batardière; Julien Chiquet; Joon Kwon

Pré-Publication, Document De Travail Année : 2023

Finite-sum optimization: Adaptivity to smoothness and loopless variance reduction

(1) , (1, 1) , (1)

Bastien Batardière

Fonction : Auteur
PersonId : 1304669

Mathématiques et Informatique Appliquées

Julien Chiquet

Fonction : Auteur
PersonId : 15350
IdHAL : julien-chiquet
ORCID : 0000-0002-3629-3429
IdRef : 119561077

Mathématiques et Informatique Appliquées

Joon Kwon

Fonction : Auteur
PersonId : 181898
IdHAL : joon-kwon
ORCID : 0000-0002-3464-9081
IdRef : 197710840

Mathématiques et Informatique Appliquées

Résumé

For finite-sum optimization, variance-reduced gradient methods (VR) compute at each iteration the gradient of a single function (or of a mini-batch), and yet achieve faster convergence than SGD thanks to a carefully crafted lower-variance stochastic gradient estimator that reuses past gradients. Another important line of research of the past decade in continuous optimization is the adaptive algorithms such as AdaGrad, that dynamically adjust the (possibly coordinate-wise) learning rate to past gradients and thereby adapt to the geometry of the objective function. Variants such as RMSprop and Adam demonstrate outstanding practical performance that have contributed to the success of deep learning. In this work, we present AdaVR, which combines the AdaGrad algorithm with variance-reduced gradient estimators such as SAGA or L-SVRG. We assess that AdaVR inherits both good convergence properties from VR methods and the adaptive nature of AdaGrad: in the case of $L$-smooth convex functions we establish a gradient complexity of $O(n+(L+\sqrt{nL})/\varepsilon)$ without prior knowledge of $L$. Numerical experiments demonstrate the superiority of AdaVR over state-of-the-art methods. Moreover, we empirically show that the RMSprop and Adam algorithm combined with variance-reduced gradients estimators achieve even faster convergence.

Domaines

Mathématiques [math] Informatique [cs]

Joon Kwon : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-04217357

Soumis le : lundi 25 septembre 2023-17:00:28

Dernière modification le : jeudi 14 mars 2024-03:16:19

Dates et versions

hal-04217357 , version 1 (25-09-2023)

Identifiants

HAL Id : hal-04217357 , version 1
ARXIV : 2307.12615

Citer

Bastien Batardière, Julien Chiquet, Joon Kwon. Finite-sum optimization: Adaptivity to smoothness and loopless variance reduction. 2023. ⟨hal-04217357⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

AGROPARISTECH MIA-PARIS UNIV-PARIS-SACLAY INRAE GS-MATHEMATIQUES GS-COMPUTER-SCIENCE MATHNUM

45 Consultations

0 Téléchargements

Finite-sum optimization: Adaptivity to smoothness and loopless variance reduction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager