ADVANCED TECHNOLOGIES OF BIG DATA RESEARCH IN DISTRIBUTED INFORMATION SYSTEMS
DOI:
https://doi.org/10.15588/1607-3274-2017-4-8Keywords:
System, technology, big data, information, technique, database, Web application, modeling, processing, analytics.Abstract
Context. Considered question correct interpretation information flow in distributed information systems. The object of study methods are promotion “big data” on cluster system.
Objective. Is the study promising areas and technology for the analysis of structures data in distributed information systems.
Method. The big data tendency prospects as well as timeliness of the problem are studied in this paper. The principles of work with them are addressed. Big data processing technologies are provided. The analysis of each one is performed. An example of “MapReduce” paradigm application, uploading of big volumes of data, processing and analyzing of unstructured information and its distribution into the clustered database is provided. The article summarizes the concept of “big data”. Examples of methods for working with arrays of unstructured data. Dedicated scientific guidance for analyzing big data. The principles of unstructured data in distributed information systems. Driven work platform “Hadoop MapReduce” and “Apache Spark”. Analyzed their properties and given the differences. An analysis of comparative performance against both platforms – the performance of the number of iterations. Consider ways to create RDD: parallelization transmitted collection program and a link to an external file system in “Hadoop”. There is an example rozparalelenoyi system RDD. Proposed work lone class for basic database operations: database connection, create a table, a table, get in line id, returning all elements of the database, update, delete and create the line.
Results. The analysis Models Spark and Hadoop MapReduce for phased construction distributed information system. built up SparkConf object, containing information about applique and is the final version of the experiment.
Conclusions. Conducted experiment confirmed efficiency the proposed method, are capable process horizontal data arrays, that parallelization by defective presentation of information. These promising areas of analyze structure data for the purpose of forecast results and create algorithms advanced correlation, contributing new understanding activity distributed information systems further research can consist in wide use information systems, that would provide a full range technological process adaptation information flows in clusters.References
What is Big Data [Electronic resource]. Access mode: http://datascience.berkeley.edu/what-is-big-data/
Shaw J. Why “Big Data” Is a Big Deal [Electronic resource]. Rezhym dostupu: http://harvardmag.com/pdf/2014/03-pdfs/0314-HarvardMag.pdf
Schutt P. What is Big Data? [Electronic resource]. Rezhym dostupu: https://blogs.oracle.com/bigdata/big-data-and-analytic-top-10-trends-for-2014
Boyko N., Pobereyko P. Basic concepts of dynamic recurrent neural networks development, ECONTECHMOD : an international quarterly journal on economics of technology and modelling processes. Lublin, Polish Academy of Sciences, 2016, Vol. 5, No. 2, pp. 63–68.
Leskovec J., Rajaraman A., Ullman J. D. Mining of massive datasets. Massachusetts, Cambridge University Press, 2014, 470 р.
Mayer-Schoenberger V., Cukier K. A revolution that will transform how we live, work, and think. Boston New York, 2013, 230 р.
Boyko N. A look trough methods of intellectual data analysis and their applying in informational systems, Komp”yuterni nauky ta informatsiyni tekhnolohiyi CSIT 2016 : Materialy XI Mizhnarodnoyi naukovo-tekhnichnoyi konferentsiyi CSIT 2016 : proceedings. L’viv, Vydavnytstvo L’vivs’koyi politekhniky, 2016, pp. 183–185.
Benderskaia E. N., Zhukova S. V. Ostsilliatornye neironnye seti s khaoticheskoi dinamikoi v zadachakh klasternogo analiza, Neirokomp”iutery: razrabotka, primenenie; Radiotekhnika : proceedings. Moscow, Radyotekhnyka, 2011, No. 7, pp. 74–86.
Benderskaia E. N., Nikitin K. V. Modelirovanie neironnoi aktivnosti mozga i bionspirirovannye vychisleniia, Nauchno-tekhnicheskie vedomosti SPbGPU. Informatika. Telecommunicatcii. Upravlenie : proceedings. St.-Petersburg, Izd-vo Politehn. un-ta, 2011, No. 6–2(138), pp. 34–40.
Benderskaia E. N. Vozmozhnosti primeneniia nekotorykh kharakteristik sinkhronizatsii dlia vyiavleniia samoorganizuiushchikhsia klasterov v ostsilliatornoi neironnoi seti s khaoticheskoi dinamikoi, Neirokomp”iutery: razrabotka, primenenie: nauchno-tekhnicheskii zhurnal : proceedings. Moscow, Nauchnyi tsentr neirokomp”iuterov, 2012, No. 11, pp. 69–73.
Feng J., Brown D. Fixed-point attractor analysis for a class of neurodynamics, Neural Computation : proceedings. Massachusetts, MIT Press Cambridge, 1998, Vol. 10, pp. 189–213.
Kaneko K. Life: an introduction to complex systems biology. Berlin, Springer-Verlag, 2006, 369 p.
Maass W., Natschger T., Markram H. Real-time computing without stable states: a new framework for neural computations based on perturbations, Neural Computation : proceedings. Switzerland, Institute for Theoretical Computer Science, 2002, Vol. 11, pp. 2531–2560.
Schrauwen B., Verstraeten D., Campenhout J. V. An overview of reservoir computing theory, applications and implementations, Proc. of the 15th European Symp. on Artificial Neural Networks : proceedings. Belgium, Bruges, 2007, pp. 471–482.
Coombes S. Waves, bumps, and patterns in neural field theories, Biological Cybernetics : proceedings. Nottingham, University of Nottingham, 2005, Vol. 93, No. 2, pp. 91–108.
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2017 N. І. Boyko
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.