SENTIMENT ANALYSIS TECHNOLOGY FOR USER FEEDBACK SUPPORT IN E-COMMERCE SYSTEMS BASED ON MACHINE LEARNING

Authors

  • S. Tchynetskyi Lviv Polytechnic National University, Lviv, Ukraine, Ukraine
  • B. Polishchuk Lviv Polytechnic National University, Lviv, Ukraine, Ukraine
  • V. Vysotska Lviv Polytechnic National University, Lviv, Ukraine, Ukraine

DOI:

https://doi.org/10.15588/1607-3274-2023-3-11

Keywords:

NLP, text pre-processing, sentiment analysis, feedback, comment, e-commerce, e-business, machine learning, content analysis

Abstract

Context. The interaction between a company and its target audience has been studied for centuries. From the very beginning of commercial relations, the relationship between the service provider and the recipient has been valued almost above all else. Trade is built on trust and respect. The image of an entrepreneur is often more important than the product he sells. For hundreds of years, the relationship between the merchant and the buyer, the entrepreneur and the client has not lost its importance, and in the era of mass digitalization, the quality of the relationship between the company and the target audience of different sizes and professional feedback support with clients often start the success of e-business. To provide these additional tools and information technologies to help businessmen monitor e-business development opportunities in a specific location, as well as establish feedback with users through social networks and mass media. Obtaining such tools will significantly expand the vision of market opportunities for ebusiness, it will clarify which of them make sense to invest in, and which ones are not worth paying time for. Also see what idea has the future and what business model needs to be implemented/maintained/developed for the rapid development of territorial/interregional e-business. It will also help to understand which levers have the greatest effect for business changes: what not to touch, and what policies to change to ensure high speed in the implementation of the plan based on the analysis of relevant research results, for example, to receive: direct feedback from customers, the dynamics of changes in overall satisfaction or interest of the target audience and advantages/disadvantages from users using NLP analysis; support for the development of e-business in relation to the location of their enterprise and the best directions; – graphs of business development (improvement/deterioration) depending on the content of comments.

Objective of the study is to develop information technology to support the development of e-business by analyzing business locations, processing feedback from users, analyzing and classifying customer feedback in real time from social networks: Twitter, Reddit, Facebook and others using deep learning and Natural methods. Language Processing of Ukrainian-speaking and Englishspeaking texts.

Method. NLP-methods were used to analyze the opinions of users and customers. Among the methods of implementing the main functions of English-language news classification, the following machine learning methods are used: naive Bayesian classifier, logistic regression, and the method of support vectors. The Naive Bayes algorithm was used to classify Ukrainian-language user feedback, as it performs well on small amounts of data, is easy to train and operate, and works well with text data. Naive Bayes classifier is a very good option for our system and considering that the number of responses in the dataset is smaller compared to the averages.

Results. A machine learning model was developed for the analysis and classification of Ukrainian- and English-language reviews from users of e-commerce systems.

Conclusions. The created model shows excellent classification results on test data. The overall accuracy of the sentimental model for the analysis of Ukrainian-language content is quite satisfactory, 92.3%. The logistic regression method coped best with the task of analyzing the impact of English-language news on the financial market, which showed an accuracy of 75.67%. This is certainly not the desired result, but it is the largest indicator of all considered. The support vector method (SVM) coped somewhat worse with the task, which showed an accuracy of 72.78%, which is a slightly worse result than the one obtained thanks to the logistic regression method. And the naïve Bayesian classifier method did the worst with the task, which achieved an accuracy of 71.13%, which is less than the two previous methods.

Author Biographies

S. Tchynetskyi, Lviv Polytechnic National University, Lviv, Ukraine

PhD student of Information Systems and Networks Department

B. Polishchuk, Lviv Polytechnic National University, Lviv, Ukraine

PhD student of Information Systems and Networks Department

V. Vysotska, Lviv Polytechnic National University, Lviv, Ukraine

PhD, Associate Professor of Information Systems and Networks Department

References

Kuzminov M. Modern Development of Small Business in Ukraine, Sciences of Europe, 2022, Vol. 107, pp. 29–31. DOI: 10.5281/zenodo.7479719

Definition of customer support. Access mode: https://www.helpscout.com/helpu/definition-of-customersupport.

Edvardsson I. R., Durst S., Oskarsson G. K. Strategic outsourcing in SMEs, Journal of small business and enterprise development, 2020, Vol. 27(1), pp. 73–84. DOI: 10.1108/JSBED-09-2019-0322

Sarkar D. Text analytics with Python: a practitioner’s guide to natural language processing. Bangalore, Apress, 2019, 674 p. DOI: 10.1007/978-1-4842-4354-1

Eisenstein J. Introduction to natural language processing. Cambridge, MIT press, 2019, 536 p.

Goldberg Y. A Primer on Neural Network Models for Natural Language Processing. Access mode: https://jair.org/index.php/jair/article/view/11030/26198

Britannica dictionary. Access mode: https://www.britannica.com/topic/outsourcing

Sykes. Access mode: https://www.sykes.com

Sensee. Access mode: https://www.sensee.co.uk/index.html

Serco. Access mode: https://www.serco.com

Teleperformance. Access mode: https://www.teleperformance.com/en-us

Repustate. Using NLP for business success. Access mode: https://www.repustate.com/blog/using-nlp-for-businesssuccess/

Repustate. How can sentiment analysis help you with Patient Voice? Access mode: https://www.repustate.com/patient-voice/

SkywellSoftware. How does Siri work: technology and algorithm. Access mode: https://skywell.software/blog/howdoes-siri-worktechnology-and-algorithm/

Grammarly. How Grammarly uses Natural Language Processing and Machine Learning to identify the main points in a message. Access mode: https://www.grammarly.com/blog/engineering/nlpmlidentify-main-points/

Klevu. Smart Search Overview. Access mode: https://www.klevu.com/smart-search/

IBM. Natural Language Processing (NLP). What is natural language processing? Access mode: https://www.ibm.com/cloud/learn/natural-languageprocessing#tocwhat-is-na-jLju4DjE

SaS. Natural Language Processing (NLP). What it is and why it matters. Access mode: https://www.sas.com/en_us/insights/analytics/what-isnatural-languageprocessing-nlp.html

MonkeyLearn, What is NLP. Access mode: https://monkeylearn.com/blog/what-is-natural-languageprocessing/

Topic Analysis: The Ultimate Guide. Access mode: https://monkeylearn.com/topic-analysis/

Sentiment Analysis Explained. Access mode: https://www.lexalytics.com/technology/sentiment-analysis/

MonkeyLearn. Intent Classification: How to Identify What Customers Want. Access mode: https://monkeylearn.com/blog/intentclassification/

MonkeyLearn. Keyword Extraction. Access mode: https://monkeylearn.com/keyword-extraction/

Edia. What is Keyword Extraction? Access mode: https://www.edia.nl/keyword-extraction

Stemming vs. Lemmatization in NLP. Access mode: https://towardsdatascience.com/stemming-vslemmatizationin-nlp-dea008600a0

Analytics steps. What is Stemming and Lemmatization in NLP?. Access mode: https://www.analyticssteps.com/blogs/what-stemmingandlemmatization-nlp

What is Tokenization in NLP? Access mode: https://www.analyticsvidhya.com/blog/2020/05/what-istokenizationnlp/

Stanford. Machine Translation. Access mode: https://nlp.stanford.edu/projects/mt.shtml

Data Science UA. Machine Translation. Access mode: https://data-science-ua.com/wiki/natural-languageprocessingnlp/machine-translation/

Text Summarization in NLP. Access mode: https://www.topcoder.com/thrive/articles/textsummarization-in-nlp

What Is Text Summarization in NLP? Access mode: https://www.analyticssteps.com/blogs/what-textsummarization-nlp

Dict_uk Github repository. Access mode: https://github.com/brown-uk/dict_uk/tree/master/data

Advantages and disadvantages of different classification models. Access mode: https://www.geeksforgeeks.org/advantages-anddisadvantages-of-different-cl assification-models/

Naive Bayes Classifier. Access mode: https://www.upgrad.com/blog/naive-bayes-classifier/

Coursera. What Is Python Used For? Access mode: https://www.coursera.org/articles/what-is-python-used-forabeginners-guide-to-using-python

Python.org. Executive Summary. Access mode: https://www.python.org/doc/essays/blurb/

PymBook. Introduction to Flask. Access mode: https://pymbook.readthedocs.io/en/latest/flask.html

FastApi. – Access mode: https://fastapi.tiangolo.com/

NLTK. Natural Language Toolkit. Access mode: https://www.nltk.org/

AWS. What is Apache Kafka? Access mode: https://aws.amazon.com/ru/msk/what-is-kafka/

Tutorialspoint. System Analysis and Design – Overview. Access mode: https://www.tutorialspoint.com/system_analysis_and_design /system_ana lysis_and_design_overview.htm

Lonnie D. Bentley. System Analysis and Design for the Global Enterprise. Columbus, McGraw-Hill Education Ltd, 2007, 747 p.

WeyBackMachine. System Analysis. Access mode: https://web.archive.org/web/20070822025602/http://pespmc 1.vub.ac.be/ASC/SYSTEM_ANALY.html

Ritchey T. Analysis and synthesis: on scientific methodbased on a study by Bernhard Riemann, Systems research, 1991, Vol. 8(4), pp. 21–41. DOI: 10.1002/sres.3850080402

Booch G., Rumbaugh J., Jacobson I. Unified Modeling Language User Guide. Boston, AddisonWesley, 2005, 391 p.

Iso.org. ISO/IEC 19501:2005 – Information technology – Open Distributed Processing – Unified Modeling Language (UML) Version 1.4.2. Access mode: https://www.iso.org/standard/32620.html

Iso.org. ISO/IEC 19505-1:2012 – Information technology – Object Management Group Unified Modeling Language (OMG UML), Part 1, Infrastructure. Access mode: https://www.iso.org/standard/32624.html

WeyBackMachine. Basic UML. Access mode: https://web.archive.org/web/20121214050605/http://ooad.as f.ru/Files/U ML.djvu.zip

Chen F., Cheung C. C. H., Peng G. Linguistic tone and nonlinguistic pitch imitation in children with autism spectrum disorders: A cross-linguistic investigation, Journal of Autism and Developmental Disorders, 2022, Vol. 52(5), pp. 2325– 2343. DOI: 10.1007/s10803-021-05123-4

Robertson A., Magdy W., Goldwater S. Emoji skin tone modifiers: Analyzing variation in usage on social media, ACM Transactions on Social Computing, 2020, Vol. 3(2). – pp. 1–25. DOI: 10.1145/3377479

Tree of endings in Ukrainian language. Access mode: http://www.senyk.poltava.ua/projects/ukr_stemming/ukr_en dings.html

Ishihara S. Score-based likelihood ratios for linguistic text evidence with a bag-of-words model, Forensic Science International, 2021, Vol. 327, P. 110980. DOI: 10.1016/j.forsciint.2021.110980

Understanding the Classification report through sklearn. – Access mode: https://muthu.co/understanding-theclassification-report-in-sklearn/

Markiv O., Vysotska V., Chyrun L., Voloshyn S., Dyyak I., Panasyuk V. Emotion recognition system project of English newspapers to regional E-business adaptation, Computer science and information technologies, IEEE 17th International conference, Lviv, Ukraine, 10–12 November 2022, proceedings. Lviv, IEEE, 2022, pp. 392–397. https://doi.org/10.1109/CSIT56902.2022.10000527

Vysotska V., Markiv O., Voloshyn S., Dyyak I., Budz I., Schuchmann V. Sentiment analysis technology of English newspapers quotes based on neural network as public opinion influences identification tool, Computer science and information technologies, IEEE 17th International conference, Lviv, Ukraine, 10–12 November 2022, proceedings. Lviv, IEEE, 2022, pp. 83–88. https://doi.org/10.1109/CSIT56902.2022.10000627

Published

2023-10-13

How to Cite

Tchynetskyi, S., Polishchuk, B., & Vysotska, V. (2023). SENTIMENT ANALYSIS TECHNOLOGY FOR USER FEEDBACK SUPPORT IN E-COMMERCE SYSTEMS BASED ON MACHINE LEARNING. Radio Electronics, Computer Science, Control, (3), 104. https://doi.org/10.15588/1607-3274-2023-3-11

Issue

Section

Neuroinformatics and intelligent systems