Computer Science > Machine Learning

arXiv:2408.08222 (cs)

[Submitted on 15 Aug 2024]

Title:Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

Authors:Xuehao Wang, Weisen Jiang, Shuai Fu, Yu Zhang

Abstract:Sharpness-aware minimization (SAM) is to improve model generalization by searching for flat minima in the loss landscape. The SAM update consists of one step for computing the perturbation and the other for computing the update gradient. Within the two steps, the choice of the perturbation radius is crucial to the performance of SAM, but finding an appropriate perturbation radius is challenging. In this paper, we propose a bilevel optimization framework called LEarning the perTurbation radiuS (LETS) to learn the perturbation radius for sharpness-aware minimization algorithms. Specifically, in the proposed LETS method, the upper-level problem aims at seeking a good perturbation radius by minimizing the squared generalization gap between the training and validation losses, while the lower-level problem is the SAM optimization problem. Moreover, the LETS method can be combined with any variant of SAM. Experimental results on various architectures and benchmark datasets in computer vision and natural language processing demonstrate the effectiveness of the proposed LETS method in improving the performance of SAM.

Comments:	Accepted by ECML PKDD 2024
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2408.08222 [cs.LG]
	(or arXiv:2408.08222v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.08222

Submission history

From: Xuehao Wang [view email]
[v1] Thu, 15 Aug 2024 15:40:57 UTC (586 KB)

Computer Science > Machine Learning

Title:Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators