Skip to main content

Showing 1–43 of 43 results for author: Qiang, W

  1. arXiv:2410.12816  [pdf, other

    cs.CV cs.CL cs.LG

    Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective

    Authors: Yanan Zhang, Jiangmeng Li, Lixiang Liu, Wenwen Qiang

    Abstract: Foundational Vision-Language models such as CLIP have exhibited impressive generalization in downstream tasks. However, CLIP suffers from a two-level misalignment issue, i.e., task misalignment and data misalignment, when adapting to specific tasks. Soft prompt tuning has mitigated the task misalignment, yet the data misalignment remains a challenge. To analyze the impacts of the data misalignment… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted by NeurIPS 2024

  2. arXiv:2410.00772  [pdf, other

    cs.CV cs.LG

    On the Generalization and Causal Explanation in Self-Supervised Learning

    Authors: Wenwen Qiang, Zeen Song, Ziyin Gu, Jiangmeng Li, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: Self-supervised learning (SSL) methods learn from unlabeled data and achieve high generalization performance on downstream tasks. However, they may also suffer from overfitting to their training data and lose the ability to adapt to new tasks. To investigate this phenomenon, we conduct experiments on various SSL methods and datasets and make two observations: (1) Overfitting occurs abruptly in lat… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  3. arXiv:2409.08474  [pdf, other

    cs.LG cs.CV

    Rethinking Meta-Learning from a Learning Lens

    Authors: Jingyao Wang, Wenwen Qiang, Jiangmeng Li, Lingyu Si, Changwen Zheng

    Abstract: Meta-learning has emerged as a powerful approach for leveraging knowledge from previous tasks to solve new tasks. The mainstream methods focus on training a well-generalized model initialization, which is then adapted to different tasks with limited data and updates. However, it pushes the model overfitting on the training tasks. Previous methods mainly attributed this to the lack of data and used… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  4. arXiv:2407.14069  [pdf, other

    cs.CV

    Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective

    Authors: Zeen Song, Jingyao Wang, Jianqi Zhang, Changwen Zheng, Wenwen Qiang

    Abstract: Video contrastive learning (v-CL) has gained prominence as a leading framework for unsupervised video representation learning, showcasing impressive performance across various tasks such as action classification and detection. In the field of video representation learning, a feature extractor should ideally capture both static and dynamic semantics. However, our series of experiments reveals that… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  5. arXiv:2407.14058  [pdf, other

    cs.LG

    On the Causal Sufficiency and Necessity of Multi-Modal Representation Learning

    Authors: Jingyao Wang, Wenwen Qiang, Jiangmeng Li, Lingyu Si, Changwen Zheng, Bing Su

    Abstract: An effective paradigm of multi-modal learning (MML) is to learn unified representations among modalities. From a causal perspective, constraining the consistency between different modalities can mine causal representations that convey primary events. However, such simple consistency may face the risk of learning insufficient or unnecessary information: a necessary but insufficient cause is invaria… ▽ More

    Submitted 30 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

  6. arXiv:2407.13541  [pdf, other

    cs.CV

    On the Discriminability of Self-Supervised Representation Learning

    Authors: Zeen Song, Wenwen Qiang, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: Self-supervised learning (SSL) has recently achieved significant success in downstream visual tasks. However, a notable gap still exists between SSL and supervised learning (SL), especially in complex downstream tasks. In this paper, we show that the features learned by SSL methods suffer from the crowding problem, where features of different classes are not distinctly separated, and features with… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  7. arXiv:2407.12415  [pdf, other

    cs.LG

    Not All Frequencies Are Created Equal:Towards a Dynamic Fusion of Frequencies in Time-Series Forecasting

    Authors: Xingyu Zhang, Siyu Zhao, Zeen Song, Huijie Guo, Jianqi Zhang, Changwen Zheng, Wenwen Qiang

    Abstract: Long-term time series forecasting is a long-standing challenge in various applications. A central issue in time series forecasting is that methods should expressively capture long-term dependency. Furthermore, time series forecasting methods should be flexible when applied to different scenarios. Although Fourier analysis offers an alternative to effectively capture reusable and periodic patterns… ▽ More

    Submitted 18 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: Accpeted by ACMMM2024

  8. arXiv:2406.11517  [pdf, other

    cs.LG cs.AI

    Revisiting Spurious Correlation in Domain Generalization

    Authors: Bin Qin, Jiangmeng Li, Yi Li, Xuesong Wu, Yupeng Wang, Wenwen Qiang, Jianwen Cao

    Abstract: Without loss of generality, existing machine learning techniques may learn spurious correlation dependent on the domain, which exacerbates the generalization of models in out-of-distribution (OOD) scenarios. To address this issue, recent works build a structural causal model (SCM) to describe the causality within data generation process, thereby motivating methods to avoid the learning of spurious… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.11501  [pdf, other

    cs.LG cs.AI stat.ME

    Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality

    Authors: Jiangmeng Li, Bin Qin, Qirui Ji, Yi Li, Wenwen Qiang, Jianwen Cao, Fanjiang Xu

    Abstract: Leveraging the development of structural causal model (SCM), researchers can establish graphical models for exploring the causal mechanisms behind machine learning techniques. As the complexity of machine learning applications rises, single-world interventionism causal analysis encounters theoretical adaptation limitations. Accordingly, cross-world counterfactual approach extends our understanding… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2406.11490  [pdf, other

    cs.LG stat.ME

    Interventional Imbalanced Multi-Modal Representation Learning via $β$-Generalization Front-Door Criterion

    Authors: Yi Li, Jiangmeng Li, Fei Song, Qingmeng Zhu, Changwen Zheng, Wenwen Qiang

    Abstract: Multi-modal methods establish comprehensive superiority over uni-modal methods. However, the imbalanced contributions of different modalities to task-dependent predictions constantly degrade the discriminative performance of canonical multi-modal methods. Based on the contribution to task-dependent predictions, modalities can be identified as predominant and auxiliary modalities. Benchmark methods… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2405.15289  [pdf, other

    cs.CV

    Learning Invariant Causal Mechanism from Vision-Language Models

    Authors: Zeen Song, Siyu Zhao, Xingyu Zhang, Jiangmeng Li, Changwen Zheng, Wenwen Qiang

    Abstract: Large-scale pre-trained vision-language models such as CLIP have been widely applied to a variety of downstream scenarios. In real-world applications, the CLIP model is often utilized in more diverse scenarios than those encountered during its training, a challenge known as the out-of-distribution (OOD) problem. However, our experiments reveal that CLIP performs unsatisfactorily in certain domains… ▽ More

    Submitted 12 August, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  12. arXiv:2405.01053  [pdf, other

    cs.LG cs.AI

    Explicitly Modeling Universality into Self-Supervised Learning

    Authors: Jingyao Wang, Wenwen Qiang, Zeen Song, Lingyu Si, Jiangmeng Li, Changwen Zheng, Bing Su

    Abstract: The goal of universality in self-supervised learning (SSL) is to learn universal representations from unlabeled data and achieve excellent performance on all samples and tasks. However, these methods lack explicit modeling of the universality in the learning objective, and the related theoretical understanding remains limited. This may cause models to overfit in data-scarce situations and generali… ▽ More

    Submitted 23 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 28 pages, submitted to ICML24 with 7766

  13. arXiv:2404.12024  [pdf, other

    cs.CV

    Meta-Auxiliary Learning for Micro-Expression Recognition

    Authors: Jingyao Wang, Yunhan Tian, Yuxuan Yang, Xiaoxin Chen, Changwen Zheng, Wenwen Qiang

    Abstract: Micro-expressions (MEs) are involuntary movements revealing people's hidden feelings, which has attracted numerous interests for its objectivity in emotion detection. However, despite its wide applications in various scenarios, micro-expression recognition (MER) remains a challenging problem in real life due to three reasons, including (i) data-level: lack of data and imbalanced classes, (ii) feat… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 10 pages, 7 figures, 3 tables

  14. arXiv:2404.10337  [pdf, other

    cs.AI

    Intriguing Properties of Positional Encoding in Time Series Forecasting

    Authors: Jianqi Zhang, Jingyao Wang, Wenwen Qiang, Fanjiang Xu, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: Transformer-based methods have made significant progress in time series forecasting (TSF). They primarily handle two types of tokens, i.e., temporal tokens that contain all variables of the same timestamp, and variable tokens that contain all input time points for a specific variable. Transformer-based methods rely on positional encoding (PE) to mark tokens' positions, facilitating the model to pe… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  15. arXiv:2403.01549  [pdf, other

    cs.CV

    Self-Supervised Representation Learning with Meta Comprehensive Regularization

    Authors: Huijie Guo, Ying Ba, Jie Hu, Lingyu Si, Wenwen Qiang, Lei Shi

    Abstract: Self-Supervised Learning (SSL) methods harness the concept of semantic invariance by utilizing data augmentation strategies to produce similar representations for different deformations of the same input. Essentially, the model captures the shared information among multiple augmented views of samples, while disregarding the non-shared information that may be beneficial for downstream tasks. To add… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  16. arXiv:2401.14166  [pdf, other

    cs.CL cs.AI

    BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction

    Authors: Jiangmeng Li, Fei Song, Yifan Jin, Wenwen Qiang, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: As a novel and effective fine-tuning paradigm based on large-scale pre-trained language models (PLMs), prompt-tuning aims to reduce the gap between downstream tasks and pre-training objectives. While prompt-tuning has yielded continuous advancements in various tasks, such an approach still remains a persistent defect: prompt-tuning methods fail to generalize to specific few-shot patterns. From the… ▽ More

    Submitted 20 March, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR2024

  17. arXiv:2312.14222  [pdf, other

    cs.LG cs.AI

    Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

    Authors: Jiangmeng Li, Yifan Jin, Hang Gao, Wenwen Qiang, Changwen Zheng, Fuchun Sun

    Abstract: Graph contrastive learning (GCL) aims to align the positive features while differentiating the negative features in the latent space by minimizing a pair-wise contrastive loss. As the embodiment of an outstanding discriminative unsupervised graph representation learning approach, GCL achieves impressive successes in various graph benchmarks. However, such an approach falls short of recognizing the… ▽ More

    Submitted 25 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  18. arXiv:2312.05771  [pdf, other

    cs.LG stat.ML

    Hacking Task Confounder in Meta-Learning

    Authors: Jingyao Wang, Yi Ren, Zeen Song, Jianqi Zhang, Changwen Zheng, Wenwen Qiang

    Abstract: Meta-learning enables rapid generalization to new tasks by learning knowledge from various tasks. It is intuitively assumed that as the training progresses, a model will acquire richer knowledge, leading to better generalization performance. However, our experiments reveal an unexpected result: there is negative knowledge transfer between tasks, affecting generalization performance. To explain thi… ▽ More

    Submitted 29 May, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted by IJCAI 2024, 9 pages, 5 figures, 4 tables

  19. arXiv:2308.15724  [pdf, other

    cs.CV

    Background Debiased SAR Target Recognition via Causal Interventional Regularizer

    Authors: Hongwei Dong, Fangzhou Han, Lingyu Si, Wenwen Qiang, Lamei Zhang

    Abstract: Recent studies have utilized deep learning (DL) techniques to automatically extract features from synthetic aperture radar (SAR) images, which shows great promise for enhancing the performance of SAR automatic target recognition (ATR). However, our research reveals a previously overlooked issue: SAR images to be recognized include not only the foreground (i.e., the target), but also a certain size… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 38 pages, 8 figures

  20. arXiv:2308.14267  [pdf, other

    cs.LG cs.CV

    Unleash Model Potential: Bootstrapped Meta Self-supervised Learning

    Authors: Jingyao Wang, Zeen Song, Wenwen Qiang, Changwen Zheng

    Abstract: The long-term goal of machine learning is to learn general visual representations from a small amount of data without supervision, mimicking three advantages of human cognition: i) no need for labels, ii) robustness to data scarcity, and iii) learning from experience. Self-supervised learning and meta-learning are two promising techniques to achieve this goal, but they both only partially capture… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: submitted to NIPS

  21. arXiv:2308.10522  [pdf, other

    cs.CV cs.LG eess.IV

    Information Theory-Guided Heuristic Progressive Multi-View Coding

    Authors: Jiangmeng Li, Hang Gao, Wenwen Qiang, Changwen Zheng

    Abstract: Multi-view representation learning aims to capture comprehensive information from multiple views of a shared context. Recent works intuitively apply contrastive learning to different views in a pairwise manner, which is still scalable: view-specific noise is not filtered in learning view-shared representations; the fake negative pairs, where the negative terms are actually within the same class as… ▽ More

    Submitted 23 August, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: This paper is accepted by the jourcal of Neural Networks (Elsevier) by 2023. arXiv admin note: substantial text overlap with arXiv:2109.02344

  22. Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

    Authors: Yujie Zhou, Wenwen Qiang, Anyi Rao, Ning Lin, Bing Su, Jiaqi Wang

    Abstract: Zero-shot skeleton-based action recognition aims to recognize actions of unseen categories after training on data of seen categories. The key is to build the connection between visual and semantic space from seen to unseen classes. Previous studies have primarily focused on encoding sequences into a singular feature vector, with subsequent mapping the features to an identical anchor point within t… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  23. Spatio-Temporal Branching for Motion Prediction using Motion Increments

    Authors: Jiexin Wang, Yujie Zhou, Wenwen Qiang, Ying Ba, Bing Su, Ji-Rong Wen

    Abstract: Human motion prediction (HMP) has emerged as a popular research topic due to its diverse applications, but it remains a challenging task due to the stochastic and aperiodic nature of future poses. Traditional methods rely on hand-crafted features and machine learning techniques, which often struggle to model the complex dynamics of human motion. Recent deep learning-based methods have achieved suc… ▽ More

    Submitted 17 July, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: The incremental information of our paper includes the displacement information from the last frame of the historical sequence, derived from the motion information of the first frame in the future sequence and the motion information of the last frame of the historical sequence. This implicitly contains future information, inadvertently giving an unfair advantage in the human motion prediction task

    Journal ref: ACM MM 2023

  24. arXiv:2307.08924  [pdf, other

    cs.LG cs.CV

    Towards Task Sampler Learning for Meta-Learning

    Authors: Jingyao Wang, Wenwen Qiang, Xingzhe Su, Changwen Zheng, Fuchun Sun, Hui Xiong

    Abstract: Meta-learning aims to learn general knowledge with diverse training tasks conducted from limited data, and then transfer it to new tasks. It is commonly believed that increasing task diversity will enhance the generalization ability of meta-learning models. However, this paper challenges this view through empirical and theoretical analysis. We obtain three conclusions: (i) there is no universal ta… ▽ More

    Submitted 2 June, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: accepted by IJCV

  25. arXiv:2307.08913  [pdf, ps, other

    cs.LG cs.CV

    Towards the Sparseness of Projection Head in Self-Supervised Learning

    Authors: Zeen Song, Xingzhe Su, Jingyao Wang, Wenwen Qiang, Changwen Zheng, Fuchun Sun

    Abstract: In recent years, self-supervised learning (SSL) has emerged as a promising approach for extracting valuable representations from unlabeled data. One successful SSL method is contrastive learning, which aims to bring positive examples closer while pushing negative examples apart. Many current contrastive learning approaches utilize a parameterized projection head. Through a combination of empirical… ▽ More

    Submitted 19 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 9 pages,3 figures

  26. arXiv:2307.08199  [pdf, other

    cs.CV

    Unbiased Image Synthesis via Manifold Guidance in Diffusion Models

    Authors: Xingzhe Su, Daixi Jia, Fengge Wu, Junsuo Zhao, Changwen Zheng, Wenwen Qiang

    Abstract: Diffusion Models are a potent class of generative models capable of producing high-quality images. However, they often inadvertently favor certain data attributes, undermining the diversity of generated images. This issue is starkly apparent in skewed datasets like CelebA, where the initial dataset disproportionately favors females over males by 57.9%, this bias amplified in generated data where f… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  27. arXiv:2306.15977  [pdf, other

    cs.CV cs.AI

    A Dimensional Structure based Knowledge Distillation Method for Cross-Modal Learning

    Authors: Lingyu Si, Hongwei Dong, Wenwen Qiang, Junzhi Yu, Wenlong Zhai, Changwen Zheng, Fanjiang Xu, Fuchun Sun

    Abstract: Due to limitations in data quality, some essential visual tasks are difficult to perform independently. Introducing previously unavailable information to transfer informative dark knowledge has been a common way to solve such hard tasks. However, research on why transferred knowledge works has not been extensively explored. To address this issue, in this paper, we discover the correlation between… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  28. arXiv:2305.19507  [pdf, other

    cs.CV eess.IV

    Manifold Constraint Regularization for Remote Sensing Image Generation

    Authors: Xingzhe Su, Changwen Zheng, Wenwen Qiang, Fengge Wu, Junsuo Zhao, Fuchun Sun, Hui Xiong

    Abstract: Generative Adversarial Networks (GANs) have shown notable accomplishments in remote sensing domain. However, this paper reveals that their performance on remote sensing images falls short when compared to their impressive results with natural images. This study identifies a previously overlooked issue: GANs exhibit a heightened susceptibility to overfitting on remote sensing images.To address this… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  29. arXiv:2303.05240  [pdf, other

    cs.CV eess.IV

    Intriguing Property and Counterfactual Explanation of GAN for Remote Sensing Image Generation

    Authors: Xingzhe Su, Wenwen Qiang, Jie Hu, Fengge Wu, Changwen Zheng, Fuchun Sun

    Abstract: Generative adversarial networks (GANs) have achieved remarkable progress in the natural image field. However, when applying GANs in the remote sensing (RS) image generation task, an extraordinary phenomenon is observed: the GAN model is more sensitive to the size of training data for RS image generation than for natural image generation. In other words, the generation quality of RS images will cha… ▽ More

    Submitted 14 May, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

  30. arXiv:2301.08496  [pdf, other

    cs.LG

    Introducing Expertise Logic into Graph Representation Learning from A Causal Perspective

    Authors: Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Xingzhe Su, Fengge Wu, Changwen Zheng, Fuchun Sun

    Abstract: Benefiting from the injection of human prior knowledge, graphs, as derived discrete data, are semantically dense so that models can efficiently learn the semantic information from such data. Accordingly, graph neural networks (GNNs) indeed achieve impressive success in various fields. Revisiting the GNN learning paradigms, we discover that the relationship between human expertise and the knowledge… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  31. arXiv:2209.15278  [pdf, other

    cs.LG cs.AI cs.NE

    Rethinking skip connection model as a learnable Markov chain

    Authors: Dengsheng Chen, Jie Hu, Wenwen Qiang, Xiaoming Wei, Enhua Wu

    Abstract: Over past few years afterward the birth of ResNet, skip connection has become the defacto standard for the design of modern architectures due to its widespread adoption, easy optimization and proven performance. Prior work has explained the effectiveness of the skip connection mechanism from different perspectives. In this work, we deep dive into the model's behaviors with skip connections which c… ▽ More

    Submitted 2 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 12 pages, 4 figures

  32. arXiv:2209.07902  [pdf, other

    cs.LG cs.CV

    MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning

    Authors: Jiangmeng Li, Wenwen Qiang, Yanan Zhang, Wenyi Mo, Changwen Zheng, Bing Su, Hui Xiong

    Abstract: As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample. While contrastive learning has yielded continuous advancements in sampling strategy and architecture design, it still remains two persistent defects: the interference of task-irrelevant information and sample inefficiency, which are related to… ▽ More

    Submitted 9 August, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted by NeurIPS 2022 as Spotlight

  33. Modeling Multiple Views via Implicitly Preserving Global Consistency and Local Complementarity

    Authors: Jiangmeng Li, Wenwen Qiang, Changwen Zheng, Bing Su, Farid Razzak, Ji-Rong Wen, Hui Xiong

    Abstract: While self-supervised learning techniques are often used to mining implicit knowledge from unlabeled data via modeling multiple views, it is unclear how to perform effective representation learning in a complex and inconsistent context. To this end, we propose a methodology, specifically consistency and complementarity network (CoCoNet), which avails of strict global inter-view consistency and loc… ▽ More

    Submitted 9 August, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE) 2022; Refer to https://ieeexplore.ieee.org/document/9857632

  34. arXiv:2208.12681  [pdf, other

    cs.CV

    Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective

    Authors: Jiangmeng Li, Yanan Zhang, Wenwen Qiang, Lingyu Si, Chengbo Jiao, Xiaohui Hu, Changwen Zheng, Fuchun Sun

    Abstract: Few-shot learning models learn representations with limited human annotations, and such a learning paradigm demonstrates practicability in various tasks, e.g., image classification, object detection, etc. However, few-shot object detection methods suffer from an intrinsic defect that the limited training data makes the model cannot sufficiently explore semantic information. To tackle this, we intr… ▽ More

    Submitted 9 December, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Accepted by AAAI 2023

  35. arXiv:2208.08584  [pdf, other

    cs.LG stat.ME

    Robust Causal Graph Representation Learning against Confounding Effects

    Authors: Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Bing Xu, Changwen Zheng, Fuchun Sun

    Abstract: The prevailing graph neural network models have achieved significant progress in graph representation learning. However, in this paper, we uncover an ever-overlooked phenomenon: the pre-trained graph representation learning model tested with full graphs underperforms the model tested with well-pruned graphs. This observation reveals that there exist confounders in graphs, which may interfere with… ▽ More

    Submitted 10 February, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted by AAAI 2023 as Oral Presentation

  36. arXiv:2206.14702  [pdf, other

    cs.CV

    Interventional Contrastive Learning with Meta Semantic Regularizer

    Authors: Wenwen Qiang, Jiangmeng Li, Changwen Zheng, Bing Su, Hui Xiong

    Abstract: Contrastive learning (CL)-based self-supervised learning models learn visual representations in a pairwise manner. Although the prevailing CL model has achieved great progress, in this paper, we uncover an ever-overlooked phenomenon: When the CL model is trained with full images, the performance tested in full images is better than that in foreground areas; when the CL model is trained with foregr… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted by ICML 2022

  37. arXiv:2205.11100  [pdf, other

    cs.CV

    Supporting Vision-Language Model Inference with Confounder-pruning Knowledge Prompt

    Authors: Jiangmeng Li, Wenyi Mo, Wenwen Qiang, Bing Su, Changwen Zheng, Hui Xiong, Ji-Rong Wen

    Abstract: Vision-language models are pre-trained by aligning image-text pairs in a common space to deal with open-set visual concepts. To boost the transferability of the pre-trained models, recent works adopt fixed or learnable prompts, i.e., classification weights are synthesized from natural language describing task-relevant categories, to reduce the gap between tasks in the training and test phases. How… ▽ More

    Submitted 23 March, 2024; v1 submitted 23 May, 2022; originally announced May 2022.

  38. arXiv:2203.05119  [pdf, other

    cs.CV

    MetAug: Contrastive Learning via Meta Feature Augmentation

    Authors: Jiangmeng Li, Wenwen Qiang, Changwen Zheng, Bing Su, Hui Xiong

    Abstract: What matters for contrastive learning? We argue that contrastive learning heavily relies on informative features, or "hard" (positive or negative) features. Early works include more informative features by applying complex data augmentations and large batch size or memory bank, and recent works design elaborate sampling approaches to explore informative features. The key challenge toward exploring… ▽ More

    Submitted 9 August, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by ICML 2022

  39. Robust Local Preserving and Global Aligning Network for Adversarial Domain Adaptation

    Authors: Wenwen Qiang, Jiangmeng Li, Changwen Zheng, Bing Su, Hui Xiong

    Abstract: Unsupervised domain adaptation (UDA) requires source domain samples with clean ground truth labels during training. Accurately labeling a large number of source domain samples is time-consuming and laborious. An alternative is to utilize samples with noisy labels for training. However, training with noisy labels can greatly reduce the performance of UDA. In this paper, we address the problem that… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE) 2022; Refer to https://ieeexplore.ieee.org/document/9540279

  40. arXiv:2201.03812  [pdf, other

    cs.LG cs.AI

    Bootstrapping Informative Graph Augmentation via A Meta Learning Approach

    Authors: Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Fuchun Sun, Changwen Zheng

    Abstract: Recent works explore learning graph representations in a self-supervised manner. In graph contrastive learning, benchmark methods apply various graph augmentation approaches. However, most of the augmentation methods are non-learnable, which causes the issue of generating unbeneficial augmented graphs. Such augmentation may degenerate the representation ability of graph contrastive learning method… ▽ More

    Submitted 26 May, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted by International Joint Conference on Artificial Intelligence (IJCAI) 2022

  41. arXiv:2109.02344   

    cs.CV cs.AI cs.LG

    Information Theory-Guided Heuristic Progressive Multi-View Coding

    Authors: Jiangmeng Li, Wenwen Qiang, Hang Gao, Bing Su, Farid Razzak, Jie Hu, Changwen Zheng, Hui Xiong

    Abstract: Multi-view representation learning captures comprehensive information from multiple views of a shared context. Recent works intuitively apply contrastive learning (CL) to learn representations, regarded as a pairwise manner, which is still scalable: view-specific noise is not filtered in learning view-shared representations; the fake negative pairs, where the negative terms are actually within the… ▽ More

    Submitted 21 August, 2023; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: We have uploaded a new version of this paper in arXiv:2308.10522, so that we have to withdrawal this paper

  42. arXiv:2107.03799  [pdf, other

    cs.CR

    Contrastive Learning for Robust Android Malware Familial Classification

    Authors: Yueming Wu, Shihan Dou, Deqing Zou, Wei Yang, Weizhong Qiang, Hai Jin

    Abstract: Due to its open-source nature, Android operating system has been the main target of attackers to exploit. Malware creators always perform different code obfuscations on their apps to hide malicious activities. Features extracted from these obfuscated samples through program analysis contain many useless and disguised features, which leads to many false negatives. To address the issue, in this pape… ▽ More

    Submitted 31 October, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

  43. arXiv:1903.12617  [pdf

    cs.HC

    Some Experimental Results of Relieving Discomfort in Virtual Reality by Disturbing Feedback Loop in Human Brain

    Authors: Wei Qionghua, Wang Hui, Wei Qiang

    Abstract: Recently, great progress has been made in virtual reality(VR) research and application. However, virtual reality faces a big problem since its appearance, i.e. discomfort (nausea, stomach awareness, etc). Discomfort can be relieved by increasing hardware (sensor, cpu and display) speed. But this will increase cost. This paper gives another low cost solution. The phenomenon of cybersickness is expl… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.