Balanced k-Means

Chen-Ling Tai¹⁷ &
Chen-Shu Wang¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10192))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

2383 Accesses
2 Citations

Abstract

K-Means is a very common method of unsupervised learning in data mining. It is introduced by Steinhaus in 1956. As time flies, many other enhanced methods of k-Means have been introduced and applied. One of the significant characteristic of k-Means is randomize. Thus, this paper proposes a balanced k-Means method, which means number of items distributed within clusters are more balanced, provide more equal-sized clusters. Cases those are suitable to apply this method are also discussed, such as Travelling Salesman Problem (TSP). In order to enhance the performance and usability, we are in the process of proposing a learning ability of this method in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Subset K-Means Approach for Handling Imbalanced-Distributed Data

An Enhancing K-Means Algorithm Based on Sorting and Partition

Kernel k’-means Algorithm for Clustering Analysis

Notes

1.
Travelling salesman problem, https://en.wikipedia.org/wiki/Travelling_salesman_problem.

References

Likas, A., Vlassis, N., Verbeek, J.: The global k-Means clustering algorithm. Technical report 12 (2011)
Google Scholar
Ball, G., Hall, D.: ISODATA, a novel method of data analysis and pattern classification. Technical report NTIS AD 699616. Stanford Research Institute, Stanford, CA (1965)
Google Scholar
Drineas, P., Frieze, A., Kannan, R., Vempala, S., Vinay, V.: Clustering large graphs via the singular value decomposition. Mach. Learn. 56(1–3), 9–33 (1999)
MATH Google Scholar
He, R., Xu, W., Sun, J., Zu, B.: Balanced k-Means algorithm for partitioning areas in large-scale vehicle routing problem. In: Third International Symposium on IEEE Intelligent Information Technology Application, IITA 2009, vol. 3, pp. 87–90 (2009)
Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)
Article Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Upper Saddle River (1988)
MATH Google Scholar
Jain, A.K.: Data clustering: 50 years beyond k-Means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Article Google Scholar
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inform. Theory 28, 129–137 (1982)
Article MathSciNet MATH Google Scholar
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Fifth Berkeley Symposium on Mathematics, Statistics and Probability, pp. 281–297. University of California Press (1967)
Google Scholar
Meila, M.: The uniqueness of a good optimum for k-Means. In: Proceedings of 23rd International Conference Machine Learning, pp. 625–632 (2006)
Google Scholar
Steinhaus, H.: Sur La Division des Corp Materiels En Parties. Bull. Acad. Polon. Sci. IV(C1.III), 801–804 (1956)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Finance Management, National Taipei University of Technology, Taipei, Taiwan
Chen-Ling Tai & Chen-Shu Wang

Authors

Chen-Ling Tai
View author publications
You can also search for this author in PubMed Google Scholar
Chen-Shu Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen-Shu Wang .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wroclaw, Poland
Ngoc Thanh Nguyen
Japan Advanced Institute of Science and Technology, Nomi, Japan
Satoshi Tojo
Japan Advanced Institute of Science and Technology, Nomi, Japan
Le Minh Nguyen
Wrocław University of Science and Technology, Wroclaw, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tai, CL., Wang, CS. (2017). Balanced k-Means. In: Nguyen, N., Tojo, S., Nguyen, L., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2017. Lecture Notes in Computer Science(), vol 10192. Springer, Cham. https://doi.org/10.1007/978-3-319-54430-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-54430-4_8
Published: 26 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54429-8
Online ISBN: 978-3-319-54430-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Subset K-Means Approach for Handling Imbalanced-Distributed Data

An Enhancing K-Means Algorithm Based on Sorting and Partition

Kernel k’-means Algorithm for Clustering Analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Balanced k-Means

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Subset K-Means Approach for Handling Imbalanced-Distributed Data

An Enhancing K-Means Algorithm Based on Sorting and Partition

Kernel k’-means Algorithm for Clustering Analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation