SENTIMENT ANALYSIS TECHNOLOGY FOR USER FEEDBACK SUPPORT IN E-COMMERCE SYSTEMS BASED ON MACHINE LEARNING
DOI:
https://doi.org/10.15588/1607-3274-2023-3-11Keywords:
NLP, text pre-processing, sentiment analysis, feedback, comment, e-commerce, e-business, machine learning, content analysisAbstract
Context. The interaction between a company and its target audience has been studied for centuries. From the very beginning of commercial relations, the relationship between the service provider and the recipient has been valued almost above all else. Trade is built on trust and respect. The image of an entrepreneur is often more important than the product he sells. For hundreds of years, the relationship between the merchant and the buyer, the entrepreneur and the client has not lost its importance, and in the era of mass digitalization, the quality of the relationship between the company and the target audience of different sizes and professional feedback support with clients often start the success of e-business. To provide these additional tools and information technologies to help businessmen monitor e-business development opportunities in a specific location, as well as establish feedback with users through social networks and mass media. Obtaining such tools will significantly expand the vision of market opportunities for ebusiness, it will clarify which of them make sense to invest in, and which ones are not worth paying time for. Also see what idea has the future and what business model needs to be implemented/maintained/developed for the rapid development of territorial/interregional e-business. It will also help to understand which levers have the greatest effect for business changes: what not to touch, and what policies to change to ensure high speed in the implementation of the plan based on the analysis of relevant research results, for example, to receive: direct feedback from customers, the dynamics of changes in overall satisfaction or interest of the target audience and advantages/disadvantages from users using NLP analysis; support for the development of e-business in relation to the location of their enterprise and the best directions; – graphs of business development (improvement/deterioration) depending on the content of comments.
Objective of the study is to develop information technology to support the development of e-business by analyzing business locations, processing feedback from users, analyzing and classifying customer feedback in real time from social networks: Twitter, Reddit, Facebook and others using deep learning and Natural methods. Language Processing of Ukrainian-speaking and Englishspeaking texts.
Method. NLP-methods were used to analyze the opinions of users and customers. Among the methods of implementing the main functions of English-language news classification, the following machine learning methods are used: naive Bayesian classifier, logistic regression, and the method of support vectors. The Naive Bayes algorithm was used to classify Ukrainian-language user feedback, as it performs well on small amounts of data, is easy to train and operate, and works well with text data. Naive Bayes classifier is a very good option for our system and considering that the number of responses in the dataset is smaller compared to the averages.
Results. A machine learning model was developed for the analysis and classification of Ukrainian- and English-language reviews from users of e-commerce systems.
Conclusions. The created model shows excellent classification results on test data. The overall accuracy of the sentimental model for the analysis of Ukrainian-language content is quite satisfactory, 92.3%. The logistic regression method coped best with the task of analyzing the impact of English-language news on the financial market, which showed an accuracy of 75.67%. This is certainly not the desired result, but it is the largest indicator of all considered. The support vector method (SVM) coped somewhat worse with the task, which showed an accuracy of 72.78%, which is a slightly worse result than the one obtained thanks to the logistic regression method. And the naïve Bayesian classifier method did the worst with the task, which achieved an accuracy of 71.13%, which is less than the two previous methods.
References
Kuzminov M. Modern Development of Small Business in Ukraine, Sciences of Europe, 2022, Vol. 107, pp. 29–31. DOI: 10.5281/zenodo.7479719
Definition of customer support. Access mode: https://www.helpscout.com/helpu/definition-of-customersupport.
Edvardsson I. R., Durst S., Oskarsson G. K. Strategic outsourcing in SMEs, Journal of small business and enterprise development, 2020, Vol. 27(1), pp. 73–84. DOI: 10.1108/JSBED-09-2019-0322
Sarkar D. Text analytics with Python: a practitioner’s guide to natural language processing. Bangalore, Apress, 2019, 674 p. DOI: 10.1007/978-1-4842-4354-1
Eisenstein J. Introduction to natural language processing. Cambridge, MIT press, 2019, 536 p.
Goldberg Y. A Primer on Neural Network Models for Natural Language Processing. Access mode: https://jair.org/index.php/jair/article/view/11030/26198
Britannica dictionary. Access mode: https://www.britannica.com/topic/outsourcing
Sykes. Access mode: https://www.sykes.com
Sensee. Access mode: https://www.sensee.co.uk/index.html
Serco. Access mode: https://www.serco.com
Teleperformance. Access mode: https://www.teleperformance.com/en-us
Repustate. Using NLP for business success. Access mode: https://www.repustate.com/blog/using-nlp-for-businesssuccess/
Repustate. How can sentiment analysis help you with Patient Voice? Access mode: https://www.repustate.com/patient-voice/
SkywellSoftware. How does Siri work: technology and algorithm. Access mode: https://skywell.software/blog/howdoes-siri-worktechnology-and-algorithm/
Grammarly. How Grammarly uses Natural Language Processing and Machine Learning to identify the main points in a message. Access mode: https://www.grammarly.com/blog/engineering/nlpmlidentify-main-points/
Klevu. Smart Search Overview. Access mode: https://www.klevu.com/smart-search/
IBM. Natural Language Processing (NLP). What is natural language processing? Access mode: https://www.ibm.com/cloud/learn/natural-languageprocessing#tocwhat-is-na-jLju4DjE
SaS. Natural Language Processing (NLP). What it is and why it matters. Access mode: https://www.sas.com/en_us/insights/analytics/what-isnatural-languageprocessing-nlp.html
MonkeyLearn, What is NLP. Access mode: https://monkeylearn.com/blog/what-is-natural-languageprocessing/
Topic Analysis: The Ultimate Guide. Access mode: https://monkeylearn.com/topic-analysis/
Sentiment Analysis Explained. Access mode: https://www.lexalytics.com/technology/sentiment-analysis/
MonkeyLearn. Intent Classification: How to Identify What Customers Want. Access mode: https://monkeylearn.com/blog/intentclassification/
MonkeyLearn. Keyword Extraction. Access mode: https://monkeylearn.com/keyword-extraction/
Edia. What is Keyword Extraction? Access mode: https://www.edia.nl/keyword-extraction
Stemming vs. Lemmatization in NLP. Access mode: https://towardsdatascience.com/stemming-vslemmatizationin-nlp-dea008600a0
Analytics steps. What is Stemming and Lemmatization in NLP?. Access mode: https://www.analyticssteps.com/blogs/what-stemmingandlemmatization-nlp
What is Tokenization in NLP? Access mode: https://www.analyticsvidhya.com/blog/2020/05/what-istokenizationnlp/
Stanford. Machine Translation. Access mode: https://nlp.stanford.edu/projects/mt.shtml
Data Science UA. Machine Translation. Access mode: https://data-science-ua.com/wiki/natural-languageprocessingnlp/machine-translation/
Text Summarization in NLP. Access mode: https://www.topcoder.com/thrive/articles/textsummarization-in-nlp
What Is Text Summarization in NLP? Access mode: https://www.analyticssteps.com/blogs/what-textsummarization-nlp
Dict_uk Github repository. Access mode: https://github.com/brown-uk/dict_uk/tree/master/data
Advantages and disadvantages of different classification models. Access mode: https://www.geeksforgeeks.org/advantages-anddisadvantages-of-different-cl assification-models/
Naive Bayes Classifier. Access mode: https://www.upgrad.com/blog/naive-bayes-classifier/
Coursera. What Is Python Used For? Access mode: https://www.coursera.org/articles/what-is-python-used-forabeginners-guide-to-using-python
Python.org. Executive Summary. Access mode: https://www.python.org/doc/essays/blurb/
PymBook. Introduction to Flask. Access mode: https://pymbook.readthedocs.io/en/latest/flask.html
FastApi. – Access mode: https://fastapi.tiangolo.com/
NLTK. Natural Language Toolkit. Access mode: https://www.nltk.org/
AWS. What is Apache Kafka? Access mode: https://aws.amazon.com/ru/msk/what-is-kafka/
Tutorialspoint. System Analysis and Design – Overview. Access mode: https://www.tutorialspoint.com/system_analysis_and_design /system_ana lysis_and_design_overview.htm
Lonnie D. Bentley. System Analysis and Design for the Global Enterprise. Columbus, McGraw-Hill Education Ltd, 2007, 747 p.
WeyBackMachine. System Analysis. Access mode: https://web.archive.org/web/20070822025602/http://pespmc 1.vub.ac.be/ASC/SYSTEM_ANALY.html
Ritchey T. Analysis and synthesis: on scientific methodbased on a study by Bernhard Riemann, Systems research, 1991, Vol. 8(4), pp. 21–41. DOI: 10.1002/sres.3850080402
Booch G., Rumbaugh J., Jacobson I. Unified Modeling Language User Guide. Boston, AddisonWesley, 2005, 391 p.
Iso.org. ISO/IEC 19501:2005 – Information technology – Open Distributed Processing – Unified Modeling Language (UML) Version 1.4.2. Access mode: https://www.iso.org/standard/32620.html
Iso.org. ISO/IEC 19505-1:2012 – Information technology – Object Management Group Unified Modeling Language (OMG UML), Part 1, Infrastructure. Access mode: https://www.iso.org/standard/32624.html
WeyBackMachine. Basic UML. Access mode: https://web.archive.org/web/20121214050605/http://ooad.as f.ru/Files/U ML.djvu.zip
Chen F., Cheung C. C. H., Peng G. Linguistic tone and nonlinguistic pitch imitation in children with autism spectrum disorders: A cross-linguistic investigation, Journal of Autism and Developmental Disorders, 2022, Vol. 52(5), pp. 2325– 2343. DOI: 10.1007/s10803-021-05123-4
Robertson A., Magdy W., Goldwater S. Emoji skin tone modifiers: Analyzing variation in usage on social media, ACM Transactions on Social Computing, 2020, Vol. 3(2). – pp. 1–25. DOI: 10.1145/3377479
Tree of endings in Ukrainian language. Access mode: http://www.senyk.poltava.ua/projects/ukr_stemming/ukr_en dings.html
Ishihara S. Score-based likelihood ratios for linguistic text evidence with a bag-of-words model, Forensic Science International, 2021, Vol. 327, P. 110980. DOI: 10.1016/j.forsciint.2021.110980
Understanding the Classification report through sklearn. – Access mode: https://muthu.co/understanding-theclassification-report-in-sklearn/
Markiv O., Vysotska V., Chyrun L., Voloshyn S., Dyyak I., Panasyuk V. Emotion recognition system project of English newspapers to regional E-business adaptation, Computer science and information technologies, IEEE 17th International conference, Lviv, Ukraine, 10–12 November 2022, proceedings. Lviv, IEEE, 2022, pp. 392–397. https://doi.org/10.1109/CSIT56902.2022.10000527
Vysotska V., Markiv O., Voloshyn S., Dyyak I., Budz I., Schuchmann V. Sentiment analysis technology of English newspapers quotes based on neural network as public opinion influences identification tool, Computer science and information technologies, IEEE 17th International conference, Lviv, Ukraine, 10–12 November 2022, proceedings. Lviv, IEEE, 2022, pp. 83–88. https://doi.org/10.1109/CSIT56902.2022.10000627
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 С. А. Тчинецький, Б. О. Поліщук, В. А. Висоцька
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.