Computer Science > Machine Learning

arXiv:2406.13187 (cs)

[Submitted on 19 Jun 2024]

Title:Boosting Consistency in Dual Training for Long-Tailed Semi-Supervised Learning

Authors:Kai Gan, Tong Wei, Min-Ling Zhang

Abstract:While long-tailed semi-supervised learning (LTSSL) has received tremendous attention in many real-world classification problems, existing LTSSL algorithms typically assume that the class distributions of labeled and unlabeled data are almost identical. Those LTSSL algorithms built upon the assumption can severely suffer when the class distributions of labeled and unlabeled data are mismatched since they utilize biased pseudo-labels from the model. To alleviate this problem, we propose a new simple method that can effectively utilize unlabeled data from unknown class distributions through Boosting cOnsistency in duAl Training (BOAT). Specifically, we construct the standard and balanced branch to ensure the performance of the head and tail classes, respectively. Throughout the training process, the two branches incrementally converge and interact with each other, eventually resulting in commendable performance across all classes. Despite its simplicity, we show that BOAT achieves state-of-the-art performance on a variety of standard LTSSL benchmarks, e.g., an averaged 2.7% absolute increase in test accuracy against existing algorithms when the class distributions of labeled and unlabeled data are mismatched. Even when the class distributions are identical, BOAT consistently outperforms many sophisticated LTSSL algorithms. We carry out extensive ablation studies to tease apart the factors that are the most important to the success of BOAT. The source code is available at this https URL.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.13187 [cs.LG]
	(or arXiv:2406.13187v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.13187

Submission history

From: Kai Gan [view email]
[v1] Wed, 19 Jun 2024 03:35:26 UTC (3,670 KB)

Computer Science > Machine Learning

Title:Boosting Consistency in Dual Training for Long-Tailed Semi-Supervised Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Boosting Consistency in Dual Training for Long-Tailed Semi-Supervised Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators