ONLINE FUZZY CLUSTERING OF INCOMPLETE DATA USING CREDIBILISTIC APPROACH AND SIMILARITY MEASURE OF SPECIAL TYPE
Keywords:fuzzy clustering, distorted data, credibilistic fuzzy clustering, similarity measure.
Context. In most clustering (classification without a teacher) tasks associated with real data processing, the initial information is usually distorted by abnormal outliers (noise) and gaps. It is clear that “classical” methods of artificial intelligence (both batch and online) are ineffective in this situation.The goal of the paper is to propose the procedure of fuzzy clustering of incomplete data using credibilistic approach and similarity measure of special type.
Objective. The goal of the work is credibilistic fuzzy clustering of distorted data, using of credibility theory.
Method. The procedure of fuzzy clustering of incomplete data using credibilistic approach and similarity measure of special type based on the use of both robust goal functions of a special type and similarity measures, insensitive to outliers and designed to work both in batch and its recurrent online version designed to solve Data Stream Mining problems when data are fed to processing sequentially in real time.
Results. The introduced methods are simple in numerical implementation and are free from the drawbacks inherent in traditional methods of probabilistic and possibilistic fuzzy clustering data distorted by abnormal outliers (noise) and gaps.
Conclusions. The conducted experiments have confirmed the effectiveness of proposed methods of credibilistic fuzzy clustering of distorted data operability and allow recommending it for use in practice for solving the problems of automatic clusterization of distorted data. The proposed method is intended for use in hybrid systems of computational intelligence and, above all, in the problems of learning artificial neural networks, neuro-fuzzy systems, as well as in the problems of clustering and classification.
Aggarwal C. C. Data Mining. Switzerland : Springer, 2015, 727 p. DOI https://doi.org/ 10.1007 / 978-3-319-14142-8.
Bezdek J.C. Pattern recognition with fuzzy objective function algorithms. New York: Springer, 1981, 253 p. DOI https://doi.org/10.1007/978-1-4757-0450-1.
Höppner F., Klawonn F., Kruse R., Runkler T. Fuzzy Clustering Analysis: Methods for Classification, Data Analisys and Image Recognition. Chichester, John Wiley &Sons, 1999, 300 p.
Xu R., D. C. Wunsch Clustering. Hoboken N. J., John Wiley & Sons, Inc., 2009, 398 p.
Krishnapuram R., Keller J. M. A Possibilistic Approach to Clustering, IEEE Transactions on Fuzzy Systems, May 1993: proceedings, IEEE, 1993, Vol. 1, pp. 98–110. DOI: 10.1109/91.227387.
Park D. C., Dagher I. Gradient based fuzzy c-means (GBFCM) algorithm, IEEE International Conference on Neural Networks, 28 June – 2July,1984: proceedings. Orlando, IEEE, 1984, pp. 1626–1631. DOI: 10.1109 / ICNN. 1994.374399.
Chung, F. L., Lee T. Fuzzy competitive learning, Neural Networks, 1994, Vol. 7, № 3, pp. 539–552. DOI: https://doi.org/10.1016/0893-6080(94)90111-2.
Bodyanskiy Ye. Computational intelligence techniques for data analisys, Lecture Notes in Informatics. Bonn, Gesellschaft für Informatik, 2005, pp. 15–36.
Hu Zh., Bodyanskiy Ye. V., Tyshchenko O. K. A deep cascade neuro-fuzzy system for high-dimensional online fuzzy clustering, 2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP), Lviv, 23–27 August, 2016: proceedings. Lviv, IEEE, 2016, pp. 318–322. DOI: 10.1109/DSMP.2016.7583567.
Hu. Zh., Bodyanskiy Ye. V., Tyshchenko O. K. A cascade deep neuro-fuzzy system for high-dimensional online possibilistic fuzzy clustering, 2016 XI-th International Scientific and Technical Conference Computer Sciences and Information Technologies (CSIT), Lviv, 6–10 September, 2016: proceedings. Lviv, IEEE, 2016, pp. 119–122. DOI: 10.1109/STC-CSIT.2016.7589884.
Chintalapudi K. K., Kam M. A noise resistant fuzzy cmeans algorithm for clustering, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228) 4–9 May 1998: proceedings. Anchorage, IEEE, 1998, Vol. 2, pp. 1458–1463. DOI: 10.1109/FUZZY.1998.686334
Hathaway R. J., Bezdek J. C., Hu Y. Generalized fuzzy cmeans clustering strategies using L/sub p/ norm distances, IEEE Transactions on Fuzzy Systems, IEEE, 2000, Vol. 8 (5), pp. 576–582. DOI: 10.1109/91.873580.
Marwala T. Computational Intelligence for Missing Data Imputation Estimation and Management: Knowledge Optimization Techniques. Hershey-New York, Information Science Reference, 2009, 326 p.
Hu Zh., Bodyanskiy Ye., Tyshchenko O., Shafronenko A. Fuzzy clustering of incomplete data by means of similarity measures/ Hu Zh., // 2019 IEEE 2nd Ukraine Conference on Electrical and Computer Engineering (UKRCON), 2–6 July 2019 Lviv 2019: proceedings, IEEE, 2019. – Track 6. –Lviv, Ukraine, 2019. – P.149–152. DOI: 10.1109 /UKRCON. 2019.8879844
Bodyanskiy Ye., Shafronenko A., Mashtalir S. Online robust fuzzy clustering of data with omissions using similarity measure of special type, Lecture Notes in Computational Intelligence and Decision. Waking-Cham, Springer, 2020, Vol. 1020, pp. 637–646. DOI: https://doi.org/10.1007/9783-030-26474-1_44.
Zhou J., Wang Q., Hung C.-C., Yi X. Credibilistic clustering: the model and algorithms, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2015, Vol. 23, No. 4, pp. 545–564. DOI: https://doi.org/ 10.1142/S0218488515500245
Zhou J., Wang Q., Hung C. C. Credibilistic clustering algorithms via alternating cluster estimation, Journal of Intelligent Manufacturing, 2017, Vol. 28, pp. 727–738. DOI: https://doi.org/10.1007/s10845-014-1004-6.
Liu B. A survey of credibility theory, Fuzzy Optimization and Decision Making, 2006, No. 4, pp. 387–408. DOI: https://doi.org/10.1007/s10700-006-0016-x.
Zhao F., Jiao L., Liu H. Fuzzy c-means clustering with nonlocals partial information for noisy image segmentation, Frontiers of Computer Science. China, 2011, Vol. 5(1), pp. 45–56. DOI: https://doi.org/10.1007/s11704-010-0393-8.
Yang Y. K., Shieh H. L., Lee C. N. Constructing a fuzzy clustering model based on its data distribution, International Conference on Computational Intelligence for Modeling. Control and Automation (CIMCA 2004), Gold Coast 2004: proceedings. Gold Coast, Australia, 2004.
Bodyanskiy Ye. V., Shafronenko A. Yu., Rudenko D. O., Klymova I. M. Online recurrent method of credibilistic fuzzy clustering, Topical issues of the development of modern science. 5th International scientific and practical conference, Sofia 2020, proceedings. Sofia, Publishing House “ACCENT”, 2020, pp. 37–40.
Bodyanskiy Ye., Shafronenko A., Volkova V. Adaptive clustering of incomplete data using neuro-fuzzy Kohonen network, Artificial Intelligence Methods and Techniques for Business and Engineering Applications, Rzeszow-Sofia, ITHEA, 2012, pp. 287–296.
Bodyanskiy Ye., Shafronenko A. Online algorithm for possibilitic fuzzy clustering based on evolutionary cat swarm optimization, Science and Education a New Dimension. Natural and Technical Sciences, 2019, Vol. 193, pp. 86–88. DOI: 10.31174/SEND-NT2019-193VII23-22
Shafronenko A., Bodyanskiy Ye., Klymova I., Holovin O. Online credibilistic fuzzy clustering of data using membership functions of special type[Electronic resource], Proceedings of The Third International Workshop on Computer Modeling and Intelligent Systems (CMIS-2020), April 27–1 May 2020. Zaporizhzhia, 2020. Access mode: http://ceurws.org/Vol-2608/paper56.pdf.
How to Cite
Copyright (c) 2021 Є. В. Бодянський , А. Ю. Шафроненко , І. М. Клімова
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.