Computer Science > Machine Learning

arXiv:2112.09164 (cs)

[Submitted on 16 Dec 2021 (v1), last revised 16 Aug 2022 (this version, v2)]

Title:High Fidelity Visualization of What Your Self-Supervised Representation Knows About

Authors:Florian Bordes, Randall Balestriero, Pascal Vincent

View PDF

Abstract:Discovering what is learned by neural networks remains a challenge. In self-supervised learning, classification is the most common task used to evaluate how good a representation is. However, relying only on such downstream task can limit our understanding of what information is retained in the representation of a given input. In this work, we showcase the use of a Representation Conditional Diffusion Model (RCDM) to visualize in data space the representations learned by self-supervised models. The use of RCDM is motivated by its ability to generate high-quality samples -- on par with state-of-the-art generative models -- while ensuring that the representations of those samples are faithful i.e. close to the one used for conditioning. By using RCDM to analyze self-supervised models, we are able to clearly show visually that i) SSL (backbone) representation are not invariant to the data augmentations they were trained with -- thus debunking an often restated but mistaken belief; ii) SSL post-projector embeddings appear indeed invariant to these data augmentation, along with many other data symmetries; iii) SSL representations appear more robust to small adversarial perturbation of their inputs than representations trained in a supervised manner; and iv) that SSL-trained representations exhibit an inherent structure that can be explored thanks to RCDM visualization and enables image manipulation.

Comments:	Accepted at TMLR 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.09164 [cs.LG]
	(or arXiv:2112.09164v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.09164

Submission history

From: Florian Bordes [view email]
[v1] Thu, 16 Dec 2021 19:23:33 UTC (90,596 KB)
[v2] Tue, 16 Aug 2022 15:41:14 UTC (47,704 KB)

Computer Science > Machine Learning

Title:High Fidelity Visualization of What Your Self-Supervised Representation Knows About

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:High Fidelity Visualization of What Your Self-Supervised Representation Knows About

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators