CLUSTERIZATION OF DATA ARRAYS BASED ON THE MODIFIED GRAY WOLF ALGORITHM
DOI:
https://doi.org/10.15588/1607-3274-2023-1-7Keywords:
fuzzy clustering, multi-extremal optimization, evolutionary methodAbstract
Context. The task of clustering arrays of multidimensional data, the main goal of which is to find classes of observations that are homogeneous in the sense of the accepted metric, is an important part of the intelligent data analysis of Data Mining. From a computational point of view, the problem of clustering turns into the problem of finding local extrema of a multiextreme function, which are repeatedly started from different points of the original data array. To speed up the process of finding these extrema using the ideas of evolutionary optimization, which includes algorithms inspired by nature, swarm algorithms, population algorithms, etc.
Objective. The purpose of the work is to introduce a procedure for clustering data arrays based on the improved gray wolf algorithm.
Method. A method of clustering data arrays based on the modified gray wolf algorithm is introduced. The advantage of the proposed approach is a reduction in the time of solving optimization problems in conditions where clusters are overlap. A feature of the proposed method is computational simplicity and high speed, due to the fact that the entire array is processed only once, that is, eliminates the need for multi-era self-learning, implemented in traditional fuzzy clustering algorithms.
Results. The results of the experiments confirm the effectiveness of the proposed approach in clustering problems under the condition of classes that overlap and allow us to recommend the proposed method for use in practice to solve problems of automatic clustering big data.
Conclusions. A method of clustering data arrays based on the modified gray wolf algorithm is introduced. The advantage of the proposed approach is the reduction of time for solving optimization problems. The results of the experiments confirm the effectiveness of the proposed approach in clustering problems under the conditions of overlapping clusters.
References
Gan G., Ma Ch., Wu J. Data Clustering: Theory, Algorithms and Applications. Philadiphia, Pensilvania, SIAM, 2007, 455 p. DOI: https://doi.org/10.1137/ 1.9780898718348
Abonyi J., Feil D. Cluster Analisis for Data Mining and System Identification. Basel, Birlhause, 2007, 303 p.
Xu R., Wunsch D. C. Clustering. Hoboken N. J., John Wiley & Sons, Inc., 2009, 398 p.
Höppner F., Klawonn F., Kruse R., Runkler T. Fuzzy Clustering Analysis: Methods for Classification, Data Analisys and Image Recognition. Chichester, John Wiley &Sons, 1999, 300 p.
Bezdek J. C. et al. Fuzzy models and algorithms for pattern recognition and image processing. Springer Science & Business Media, 1999, Т. 4.
Engelbrecht A. Computational intelligence: an introduction. Sidney, John Wiley & Sons, 2007, 597 p.
Rutkowski L. Computational Intelligence Methods and Techniques. Berlin Heidelberg, Springer-Verlag, 2008, 514 p.
Kroll A. Computational Intelligence. Eine Einfürung in Problelme, Methoden and Tchnische Anwendungen. München, Oldenbourg Verlag, 2013, 428 p.
Bezdek J. C., Keller J., Krishnapuram R., Pal N. R. Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. N.Y., Springer Science + Business Media, Inc., 2015, 776 p.
Mumford C. L., Jain L. C. Computational Intelligence. Berlin, Springer-Verlag, 2009, 729 p.
Kroll A. Computational Intelligence. Eine Einfürung in Problelme, Methoden and Tchnische Anwendungen. München, Oldenbourg Verlag, 2013, 428 p.
Mirjalili S. The ant lion optimizer, Advances in Engineering Software, 2015, Vol. 83, pp. 80–98. doi: https://doi.org/10.1016/j.advengsoft.2015.01.010.
Yang X. S., Chen S. F., and Ting T. O. Bio-inspired Computation in Telecommunications. Morgan Kaufmann, Boston, MA, USA, 2015.
Syberfeldt A., Lidberg S. Real-world simulation-based manufacturing optimizations using cuckoo search, Proceedings of the 2012 Winter Simulation Conference (WSC). Berlin, Germany, December 2012, proceedings, pp. 1–12. doi: 10.1109/WSC.2012.6465158
Coelho L. D. S. and Mariani V. C. Improved firefly algorithm approach applied to chiller loading for energy conservation, Energy and Buildings, 2013, Vol. 59, pp. 273–278. doi: https://doi.org/10.1016/j.enbuild. 2012.11.030
Juan Z., Zheng-Ming G. The Bat Algorithm and Its Parameters, Electronics, Communications and Networks IV. CRC Press, Boca Raton, FL, USA, 2015.
Yu. J. J. Q., Li V. O. K. A social spider algorithm for global optimization, Applied Soft Computing, 2015, Vol. 30, pp. 614– 627. doi: https://doi.org/10.48550/ arXiv.1502.02407
Azizi R. Empirical study of artificial fish swarm algorithm, International Journal of Computing, Communications and Networking, 2014, Vol. 3, No. 1–3, pp. 1–7.
Yan-Xia L., Lin L., and Zhaoyang Improved ant colony algorithm for evaluation of graduates, Physical conditions, measuring technology and mechatronics automation (ICMTMA): proceedings of the 2014 Sixth International Conference on Measuring Technology and Mechatronics Automation. Zhangjiajie, China, January 2014, pp. 333–336.
Xiu Z., Xin Z., Ho S. L., and Fu W. N. A modification of artificial bee colony algorithm applied to loudspeaker design problem, IEEE Transactions on Magnetics, 2014, Vol. 50, No. 2, pp. 737–740. doi: 10.1109/TMAG.2013. 2281818.
Marichelvam M. K., Prabaharan T., and Yang X. S. A discrete firefly algorithm for the multi-objective hybrid flowshop scheduling problems, IEEE Transactions on Evolutionary Computation, 2014, Vol. 18, No. 2, pp. 301–305.
Hathaway R. J., Bezdek J. C. Optimization of clustering criteria by reformulation, IEEE Transactions Fuzzy Systems, 1995, No. 3, P.241–245.
Pal N. R., Bezdek J. C., Hathaway R. J. Sequental competitive learning algorithm, Neural Networks, 1996, Vol. 9, № 5 pp. 787–796.
Mirjalili S. M. and Lewis A. Grey wolf optimizer, Advances in Engineering Software, 2014, Vol. 69, pp. 46–61.
Bodyanskiy Ye. V., Pliss I. P., Shafronenko A. Yu. Clusterization of data arrays based on combined optimization of distribution density functions and the evolutionary method of cat swarm, Radio Electronics, Computer Science, Control, 2022, №4, pp. 61–70. doi: 10.15588/1607-3274-2022-4-5
Bodyanskiy Y. V., Shafronenko A. Y. , Pliss I. P. Credibilistic fuzzy clustering based on evolutionary method of crazy cats, System Research and Information Technologies, 2021 (3), pp. 110–119.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 А. Ю. Шафроненко, Є. В. Бодянський, О. О. Головін
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.