
Sentiment analysis of Amazon product reviews
- 1 University College London
- 2 Beijing University of Technology
- 3 University of British Columbia
- 4 Shenzhen International Foundation College
* Author to whom correspondence should be addressed.
Abstract
The rapid development of online shopping sites has pushed people's shopping to a new way. Online shopping not only provides convenience to people but also "suggestions." Moreover, there are always many reviews from previous consumers on shopping websites, helping people know more about the product and make decisions. This paper represents the sentiment analysis of Amazon reviews using three models: Random Forest, Naive Bayes, and SVM. These models are trained with token counts, and term frequency-inverse document frequency (TF-IDF) features to make better comparisons. Classification performances are evaluated by precision, recall, and F-1 scores, and exploration is implemented into the dataset providing information about Amazon reviews. The results show that Random Forest and SVM models perform well on positive-labeled data but provide suboptimal results on negative-labeled and neutral-labeled data. Overall, Naive Bayes has the best performance for all three classifications. However, classifications might be biased during the analysis. Thus, more improvements are expected in future research about this topic to obtain more accurate and ideal results, and more machine learning models are supposed to be implemented.
Keywords
sentiment analysis, machine learning, AM
[1]. Askalidis, Georgios, and Edward C. Malthouse. "The value of online customer reviews." Proceedings of the 10th ACM Conference on Recommender Systems. 2016.
[2]. Devika, M. D., Cª Sunitha, and Amal Ganesh. "Sentiment analysis: a comparative study on different approaches." Procedia Computer Science 87 (2016): 44-49.
[3]. Hu, Minqing, and Bing Liu. "Mining opinion features in customer reviews." AAAI. Vol. 4. No. 4. 2004.
[4]. Agrawal, Rakesh, and Ramakrishnan Srikant. "Fast algorithms for mining association rules." Proc. 20th int. conf. very large data bases, VLDB. Vol. 1215. 1994.
[5]. Tan,Wanliang,Xinyu Wang,and Xinyu Xu.”Sentiment analysis for Amazon reviews.” International Conference. 2018
[6]. Rain, Callen. "Sentiment analysis in amazon reviews using probabilistic machine learning." Swarthmore College (2013).
[7]. Bhatt, Aashutosh, et al. "Amazon review classification and sentiment analysis." International Journal of Computer Science and Information Technologies 6.6 (2015): 5107-5110.
[8]. Mouthami, K., K. Nirmala Devi, and V. Murali Bhaskaran. "Sentiment analysis and classification based on textual reviews." 2013 international conference on Information communication and embedded systems (ICICES). IEEE, 2013.
[9]. Thet, Tun Thura, Jin-Cheon Na, and Christopher SG Khoo. "Aspect-based sentiment analysis of movie reviews on discussion boards." Journal of information science 36.6 (2010): 823-848.
[10]. Yang, Li, et al. "Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning." IEEE access 8 (2020): 23522-23530.
[11]. Gräbner, Dietmar, et al. "Classification of customer reviews based on sentiment analysis." ENTER. 2012.
[12]. Baid, Palak, Apoorva Gupta, and Neelam Chaplot. "Sentiment analysis of movie reviews using machine learning techniques." International Journal of Computer Applications 179.7 (2017): 45-49.
[13]. Ruder, Sebastian, Parsa Ghaffari, and John G. Breslin. "A hierarchical model of reviews for aspect-based sentiment analysis." arXiv preprint arXiv:1609.02745 (2016).
[14]. Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9.8 (1997): 1735-1780.
[15]. Haque, Tanjim Ul, Nudrat Nawal Saber, and Faisal Muhammad Shah. "Sentiment analysis on large scale
[16]. Jordan, Michael I., and Tom M. Mitchell. "Machine learning: Trends, perspectives, and prospects." Science 349.6245 (2015): 255-260.
[17]. Shinde, Pramila P., and Seema Shah. "A review of machine learning and deep learning applications." 2018 Fourth international conference on computing communication control and automation (ICCUBEA). IEEE, 2018.
[18]. Nasteski, Vladimir. "An overview of the supervised machine learning methods." Horizons. b 4 (2017): 51-62.
Cite this article
Xu,B.;Gan,H.;Sun,X.;Shao,X. (2023). Sentiment analysis of Amazon product reviews. Applied and Computational Engineering,6,1673-1681.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 3rd International Conference on Signal Processing and Machine Learning
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).