THE SYSTEM OF CRITERIA FOR FEATURE INFORMATIVENESS ESTIMATION IN PATTERN RECOGNITION
DOI:
https://doi.org/10.15588/1607-3274-2017-4-10Keywords:
Data sample, pattern recognition, feature selection, informativeness criterion, individual informativeness, group informativeness.Abstract
Context. The task of automation of feature informativeness estimation process in diagnostics and pattern recognition problems i solved. The object of the research is the process of informative feature selection. The subject of the research are the criteria of feature informativeness estimation.
Objective. The research objective is to develop the system of criteria for feature informativeness estimation which enables to comput informativeness of interdependent feature sets.
Method. The system of criteria for feature informativeness estimation is proposed. The proposed system is based on the idea tha feature significance is computed according to spatial location of observations of different classes (size of changing of output parameter) The developed criteria system enables to estimate individual and group feature informativeness in classification and regression problems in situations when initial data samples contain redundant and interdependent features as well as observations with missing values. The proposed criteria don’t require to construct models based on the estimated feature combinations, in such a way considerably reducing time and computing costs for informative feature selection. Application of the proposed criteria for estimation and selection of informative feature allows to reduce structural complexity of synthesized diagnosis and recognition models, to raise its interpretability and generalization ability due to removing of insignificant, interdependent and redundant features in diagnostics and pattern recognition problems.
Results. The software which implements the proposed system of criteria for feature informativeness estimation and allows to selec informative features for synthesis of recognition models based on the given data samples has been developed.
Conclusions. The conducted experiments have confirmed operability of the proposed system of criteria for feature informativenes estimation and allow to recommend it for processing of data sets for pattern recognition in practice. The prospects for further researche may include the modification of the known feature selection methods and the development of new ones based on the proposed system o criteria for individual and group feature informativeness estimation.References
Jensen R., Shen Q. Computational intelligence and feature selection: rough and fuzzy approaches. Hoboken, John Wiley & Sons, 2008, 339 p. DOI: 10.1002/9780470377888.
Mulaik S. A. Foundations of Factor Analysis. Boca Raton, Florida, CRC Press, 2009, 548 p.
Lee J. A., Verleysen M. Nonlinear dimensionality reduction. New York, Springer, 2007, 308 p. DOI: 10.1007/978-0-387-39351-3.
Bezdek J. C. Pattern Recognition with Fuzzy Objective Function Algorithms. N.Y., Plenum Press, 1981, 272 p. DOI: 10.1007/978-1-4757-0450-1.
Hyvarinen A., Karhunen J., Oja E. Independent component analysis. New York, John Wiley & Sons, 2001, 481 p. DOI: 10.1002/0471221317.
Fedotov N. G. Teorija priznakov raspoznavanija obrazov na osnove stohasticheskoj geometrii i funkcional’nogo analiza. Moscow, Fizmatlit, 2010, 304 p. (In Russian).
Guyon I., Elisseeff A. An introduction to variable and feature selection, Journal of machine learning research, 2003, No. 3, pp. 1157–1182.
McLachlan G. Discriminant Analysis and Statistical Pattern Recognition. New Jersey, John Wiley & Sons, 2004, 526 p. DOI: 10.1002/0471725293.
Oliinyk A. A., Skrupsky S. Yu., Shkarupylo V. V., Blagodariov O. Parallel multiagent method of big data reduction for pattern recognition, Radio Electronics, Computer Science, Control, No. 2. 2017, pp. 82–92.
Oliinyk A. Production rules extraction based on negative selection, Radio Electronics, Computer Science, Control, 2016, Vol. 1, pp. 40–49. DOI: 10.15588/1607-3274-2016-1-5.
Oliinyk A., Skrupsky S., Subbotin S., Blagodariov O., Gofman Ye. Parallel computing system resources planning for neuro-fuzzy models synthesis and big data processing, Radio Electronics, Computer Science, Control, 2016, Vol. 4, pp. 61–69. DOI: 10.15588/1607-3274-2016-4-8.
Oliinyk A. A., Skrupsky S. Yu., Shkarupylo V. V., Subbotin S. A. The model for estimation of computer system used resources while extracting production rules based on parallel computations, Radio Electronics, Computer Science, Control, 2017, No. 1, pp. 142–152. DOI: 10.15588/1607-3274-2017-1-16.
Subbotin S., Oliinyk A. The Sample and Instance Selection for Data Dimensionality Reduction, Recent Advances in Systems, Control and Information Technology. Advances in Intelligent Systems and Computing, 2017, Vol. 543, pp. 97–103. DOI: 10.1007/978-3-319-48923-0_13.
Shitikova O. V., Tabunshchyk G. V. Method of Managing Uncertainty in Resource-Limited Settings, Radio Electronics, Computer Science, Control, 2015, No. 2, pp. 87–95. DOI: 10.15588/1607-3274-2015-2-11.
Tabunshchyk G. V., Kaplienko T. I., Shitikova O. V. Verification model of systems with limited resources, Radio Electronics, Computer Science, Control, 2017, No. 4.
Bodyanskiy Ye., Vynokurova O. Hybrid adaptive wavelet-neuro-fuzzy system for chaotic time series identification, Information Sciences, 2013, Vol. 220, pp. 170–179. DOI: 10.1016/j.ins.2012.07.044.
Kononenko I. Estimating Attributes: Analysis And Extensions Of Relief, Machine Learning : European Conference on Machine Learning ECML-94, Catania, 6–8 April 1994 : proceedings of the conference. Berlin, Springer, 1994, pp. 171–182. DOI:10.1007/3-540-57868-4_57.
Kira K., Rendell L. A practical approach to feature selection, Machine Learning : International Conference on Machine Learning ML92, Aberdeen, 1–3 July 1992 : proceedings of the conference. New York, Morgan Kaufmann, 1992, pp. 249–256. DOI: 10.1016/B978-1-55860-247-2.50037-1.
Salfner F., Lenk M., Malek M. A survey of online failure prediction methods, ACM computing surveys, 2010, Vol. 42, Issue 3, pp. 1–42. DOI: 10.1145/1670679.1670680.
Shin Y. C. Intelligent systems : modeling, optimization, and control / C. Y. Shin, C. Xu. – .Boca Raton, CRC Press, 2009, 456 p. DOI: 10.1201/9781420051773.
Oliinyk A. A., Subbotin S. A., Skrupsky S. Yu., Lovkin V. M., Zaiko T. A. Information Technology of Diagnosis Model Synthesis Based on Parallel Computing, Radio Electronics Computer Science Control, 2017, No. 3, pp. 139–151.
Subbotin S., Oliinyk A., Oliinyk O. Noniterative, evolutionary and multi-agent methods of fuzzy and neural network models synthesis : monograph. Zaporizhzhya, ZNTU, 2009, 375 p. (In Ukrainian).
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2017 A. Oliinyk, S. Subbotin, V. Lovkin, O. Blagodariov, T. Zaiko
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.