Search Page | SpringerLink

Reduction Through Homogeneous Clustering: Variations for Categorical Data and Fast Data Reduction

Reduction through Homogeneous Clustering (RHC) and its editing variant (ERHC) represent effective methods for reducing data in the context of...

Stefanos Ougiaroglou, Nikolaos Papadimitriou, Georgios Evangelidis in SN Computer Science

Article 25 June 2024

Consistency-oriented clustering ensemble via data reconstruction

The study highlights that using different distance measures on the same dataset leads to varying clustering results, making the choice of distance...

Hengshan Zhang, Yun Wang, ... Jiaze Sun in Applied Intelligence

Article 19 July 2024

An Improved Water Flow Optimizer for Data Clustering

Recently, various meta-heuristic algorithms have been considered to allocate the data into different clusters based on similar information. These...

Prateek Thakral, Yugal Kumar in SN Computer Science

Article 17 July 2024

Clustering-based visualizations for diagnosing diseases on metagenomic data

Metagenomic data has recently become crucial for precision or personalized medicine. However, these data are often complex, challenging to observe...

Hai Thanh Nguyen, Trang Huyen Phan, ... Ngoc Huynh Pham in Signal, Image and Video Processing

Article 17 June 2024

Split incremental clustering algorithm of mixed data stream

Clustering has been recognized as one of the most prominent functions in data mining. It aims to partition a given set of elements into homogeneous...

Siwar Gorrab, Fahmi Ben Rejab, Kaouther Nouira in Progress in Artificial Intelligence

Article 07 March 2024

Clustering from Data Streams

Clustering is one of the most popular data mining techniques. In this article, we review the relevant methods and algorithms for designing cluster...

João Gama in Encyclopedia of Machine Learning and Data Science

Living reference work entry 2024

Semi-supervised sparse representation collaborative clustering of incomplete data

Sparse subspace clustering (SSC) focuses on revealing the structure and distribution of high dimensional data from an algebraic perspective. It is a...

Tingquan Deng, Jingyu Wang, ... Ming Yang in Applied Intelligence

Article 02 December 2023

Robust and compact maximum margin clustering for high-dimensional data

In the field of machine learning, clustering has become an increasingly popular research topic due to its critical importance. Many clustering...

Hakan Cevikalp, Edward Chome in Neural Computing and Applications

Article Open access 17 January 2024

Randomized self-updating process for clustering large-scale data

This paper introduces the randomized self-updating process (rSUP) algorithm for clustering large-scale data. rSUP is an extension of the...

Shang-Ying Shiu, Yen-Shiu Chin, ... Ting-Li Chen in Statistics and Computing

Article 24 November 2023

An effective clustering scheme for high-dimensional data

While the classical K -means algorithm has been widely used in many fields, it still has some defects. Therefore, this paper proposes a scheme to...

Xuansen He, Fan He, ... Allam Maalla in Multimedia Tools and Applications

Article 19 October 2023

DDSC-SMOTE: an imbalanced data oversampling algorithm based on data distribution and spectral clustering

Imbalanced data poses a significant challenge in machine learning, as conventional classification algorithms often prioritize majority class samples,...

Xinqi Li, Qicheng Liu in The Journal of Supercomputing

Article 02 May 2024

Density-Based Clustering for Incomplete Data

In real world, missing values exist in a lot of data sets and cause data incompleteness. However, traditional missing value imputation methods are...

Zhixin Qi, Hongzhi Wang, Zejiao Dong in Dirty Data Processing for Machine Learning

Chapter 2024

A Survey and Experimental Review on Data Distribution Strategies for Parallel Spatial Clustering Algorithms

The advent of Big Data has led to the rapid growth in the usage of parallel clustering algorithms that work over distributed computing frameworks...

Jagat Sesh Challa, Navneet Goyal, ... Poonam Goyal in Journal of Computer Science and Technology

Article 01 May 2024

Efficient Clustering on Encrypted Data

Clustering is a significant unsupervised machine learning task widely used for data mining and analysis. Fully homomorphic encryption allows data...

Mengyu Zhang, Long Wang, ... Han Bao in Applied Cryptography and Network Security

Conference paper 2024

Penalized model-based clustering of complex functional data

High dimensional data, large-scale data, imaging and manifold data are all fostering new frontiers of statistics. These type of data are commonly...

Nicola Pronello, Rosaria Ignaccolo, ... Sara Fontanella in Statistics and Computing

Article Open access 25 August 2023

Model-based clustering with missing not at random data

Model-based unsupervised learning, as any learning task, stalls as soon as missing data occurs. This is even more true when the missing data are...

Aude Sportisse, Matthieu Marbac, ... Christophe Biernacki in Statistics and Computing

Article 18 June 2024

Explainable AI for Mixed Data Clustering

Clustering, an unsupervised machine learning approach, aims to find groups of similar instances. Mixed data clustering is of particular interest...

Jonas Amling, Stephan Scheele, ... Ute Schmid in Explainable Artificial Intelligence

Conference paper 2024

Data clustering: application and trends

Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no...

Gbeminiyi John Oyewole, George Alex Thopil in Artificial Intelligence Review

Article 27 November 2022

CPOCEDS-concept preserving online clustering for evolving data streams

Clustering streaming data is challenging due to many temporal dynamics, such as concept drift, concept evolution, and feature evolution. Concept...

K. T. Jafseer, S. Shailesh, A. Sreekumar in Cluster Computing

Article 28 August 2023

An efficient meta-heuristic algorithm based on water flow optimizer for data clustering

Clustering is a popular data analysis technique that can explore the structure of data through cluster analysis. Similar data are put into the same...

Ramesh Chandra Sahoo, Tapas Kumar, ... Sanjay Singh in The Journal of Supercomputing

Article 21 December 2023

Search

Filters

Search Results

Search

Navigation