Natural local density-based adaptive oversampling algorithm for imbalanced classification

W Wang, L Yang, J Zhang, J Yang, D Tang…�- Knowledge-Based�…, 2024 - Elsevier
W Wang, L Yang, J Zhang, J Yang, D Tang, T Liu
Knowledge-Based Systems, 2024Elsevier
Oversampling-based methods achieve impressive performance for classifying imbalanced
data. However, many existing oversampling algorithms are still very sensitive to noise.
Besides, these algorithms also require one or more parameters, how to set these
parameters is very challenging. Furthermore, the generated samples by these algorithms
are meaningless or unsafe. To solve these problems, a novel oversampling algorithm for
unbalanced classification is presented, named Natural Local Density-based Adaptive�…
Abstract
Oversampling-based methods achieve impressive performance for classifying imbalanced data. However, many existing oversampling algorithms are still very sensitive to noise. Besides, these algorithms also require one or more parameters, how to set these parameters is very challenging. Furthermore, the generated samples by these algorithms are meaningless or unsafe. To solve these problems, a novel oversampling algorithm for unbalanced classification is presented, named Natural Local Density-based Adaptive Oversampling algorithm (NLDAO). NLDAO has four main advantages: (a) it does not need any parameter due to the use of natural neighbors to calculate local density; (b) it applies a noisy filter to remove noises, which makes the sample boundary cleaner so that the proposed method is robust to noise; (c) it unevenly distributes the generated samples to the minority class, which maintains the original characteristics and enhances the boundary classification ability; (d) it determines the generated region by tracking the sample proportion drop that declines the randomness of the synthesized and reduces the noisy generation. Finally, in experiments, we apply the NLDAO algorithm to artificial datasets to visually demonstrate its effectiveness. Moreover, intensive experiments on real datasets show that NLDAO can achieve better performance than state-of-the-art methods.
Elsevier
Showing the best result for this search. See all results