Computer Science > Cryptography and Security

arXiv:2408.09469 (cs)

[Submitted on 18 Aug 2024 (v1), last revised 20 Aug 2024 (this version, v2)]

Title:Enhancing Adversarial Transferability with Adversarial Weight Tuning

Authors:Jiahao Chen, Zhou Feng, Rui Zeng, Yuwen Pu, Chunyi Zhou, Yi Jiang, Yuyou Gan, Jinbao Li, Shouling Ji

Abstract:Deep neural networks (DNNs) are vulnerable to adversarial examples (AEs) that mislead the model while appearing benign to human observers. A critical concern is the transferability of AEs, which enables black-box attacks without direct access to the target model. However, many previous attacks have failed to explain the intrinsic mechanism of adversarial transferability. In this paper, we rethink the property of transferable AEs and reformalize the formulation of transferability. Building on insights from this mechanism, we analyze the generalization of AEs across models with different architectures and prove that we can find a local perturbation to mitigate the gap between surrogate and target models. We further establish the inner connections between model smoothness and flat local maxima, both of which contribute to the transferability of AEs. Further, we propose a new adversarial attack algorithm, \textbf{A}dversarial \textbf{W}eight \textbf{T}uning (AWT), which adaptively adjusts the parameters of the surrogate model using generated AEs to optimize the flat local maxima and model smoothness simultaneously, without the need for extra data. AWT is a data-free tuning method that combines gradient-based and model-based attack methods to enhance the transferability of AEs. Extensive experiments on a variety of models with different architectures on ImageNet demonstrate that AWT yields superior performance over other attacks, with an average increase of nearly 5\% and 10\% attack success rates on CNN-based and Transformer-based models, respectively, compared to state-of-the-art attacks.

Comments:	13 pages
Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2408.09469 [cs.CR]
	(or arXiv:2408.09469v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2408.09469

Submission history

From: Jiahao Chen [view email]
[v1] Sun, 18 Aug 2024 13:31:26 UTC (497 KB)
[v2] Tue, 20 Aug 2024 05:28:55 UTC (497 KB)

Computer Science > Cryptography and Security

Title:Enhancing Adversarial Transferability with Adversarial Weight Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Enhancing Adversarial Transferability with Adversarial Weight Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators