
A comprehensive review of probabilistic and statistical methods in social network sentiment analysis
- 1 University College London
* Author to whom correspondence should be addressed.
Abstract
In the era of rapid digital transformation, social networks generate huge amounts of textual data every day, making sentiment analysis an essential tool for understanding public opinion. This study focuses on the application of probabilistic and statistical methods to sentiment analysis in social networks, highlighting their effectiveness in dealing with uncertainty and modeling the distribution of emotions. The main objective is to evaluate the role of Naïve Bayesian (NB), Hidden Markov models (HMMs), and Bayesian networks in emotion classification, emotion propagation, and dynamic emotion tracking. Through literature review and comparative analysis, this study examines the existing research, computational efficiency, and real-world applications of probabilistic classification models. The results show that Naive Bayes is computationally efficient and effective for large-scale emotion classification, while HMM and Bayesian networks excel in sequential emotion prediction and user behavior modeling. The study highlights the advantages of probabilistic methods in sentiment analysis, while acknowledging their limitations, such as their reliance on probabilistic assumptions and the challenges of capturing deep contextual semantics. Future research should explore hybrid approaches that combine probabilistic models with deep learning techniques to improve the predictive performance and scalability of real-time sentiment analysis.
Keywords
sentiment analysis, naïve bayes, hidden markov models, latent dirichlet allocation
[1]. Dorendro, A., & Devi, H. M. (2024). A Literature Review on Sentiment Analysis. Applied Intelligence and Computing, 303–312. https://doi.org/10.56155/978-81-955020-9-7-29
[2]. Liu, B. (2020). Sentiment Analysis: Mining Opinions, Sentiments, and Emotions (2nd ed.). Cambridge University Press. ISBN 9781108788212
[3]. Ruscica, G., Tucci, G., & Carneiro, B. (2025). TeleCatch: An open-access software for visualizing, filtering and extracting Telegram messages data. Software Impacts, 23, 100736.
[4]. Rakshit, P., & Sarkar, A. (2024). A supervised deep learning-based sentiment analysis by the implementation of Word2Vec and GloVe Embedding techniques. Multimedia Tools and Applications, 1–34.
[5]. Zainuddin, N., Selamat, A., & Ibrahim, R. (2018). Hybrid sentiment classification on twitter aspect-based sentiment analysis. Applied Intelligence, 48, 1218–1232.
[6]. Mor, B., Garhwal, S., & Kumar, A. (2021). A systematic review of hidden Markov models and their applications. Archives of Computational Methods in Engineering, 28, 1429–1448.
[7]. Alshaer, H.N., Otair, M.A., Abualigah, L.,Alshinwan, M., & Khasawneh, A.M. (2021). Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application. Multimedia Tools and Applications, 80, 10373–10390.
[8]. Phan, H. T., Nguyen, N. T., & Hwang, D. (2023). Aspect-level sentiment analysis: A survey of graph convolutional network methods. Information Fusion, 91, 149–172.
[9]. Govindarajan, M. (2013). Sentiment analysis of movie reviews using hybrid method of naive bayes and genetic algorithm. International Journal of Advanced Computer Research, 3(4), 139.
[10]. Zadeh, A., Chen, M., Poria, S., Cambria, E., & Morency, L.-P. (2017). Tensor fusion network for multimodal sentiment analysis. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (pp. 1103–1114). Copenhagen, Denmark: Association for Computational Linguistics.
[11]. Tu, H. T., Phan, T. T., & Nguyen, K. P. (2022). Modeling information diffusion in social networks with ordinary linear differential equations. Information Sciences, 593, 614–636.
[12]. Almgren, K., Kim, M., & Lee, J. (2017). Mining social media data using topological data analysis. 2017 IEEE International Conference on Information Reuse and Integration (IRI). IEEE.
[13]. Birjali, M., Kasri, M., & Beni-Hssane, A. (2021). A comprehensive survey on sentiment analysis: Approaches, challenges and trends. Knowledge-Based Systems, 226, 107134.
[14]. Ressan, M. B., & Hassan, R. F. (2022). Naïve-Bayes family for sentiment analysis during COVID-19 pandemic and classification tweets. Indonesian Journal of Electrical Engineering and Computer Science, 28(1), 375.
[15]. Danyal, M. M., Khan, M., Ghaffar, M., & Arshad, M. (2023). Sentiment Analysis Based on Performance of Linear Support Vector Machine and Multinomial Naïve Bayes Using Movie Reviews with Baseline Techniques. Journal on Big Data, 5.
[16]. Odumuyiwa, V., & Osisiogu, U. (2019). A systematic review on hidden markov models for sentiment analysis. 2019 15th International Conference on Electronics, Computer and Computation (ICECCO). IEEE.
[17]. Perikos, I., Kardakis, S., & Hatzilygeroudis, I. (2021). Sentiment analysis using novel and interpretable architectures of Hidden Markov Models. Knowledge-Based Systems, 229, 107332.
[18]. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993–1022.
[19]. Putri, I. R., & Kusumaningrum, R. (2017). Latent Dirichlet allocation (LDA) for sentiment analysis toward tourism review in Indonesia. Journal of Physics: Conference Series, 801(1). IOP Publishing.
[20]. Chien, J.-T., Lee, C.-H., & Tan, Z.-H. (2018). Latent Dirichlet mixture model. Neurocomputing, 278, 12–22.
[21]. Shiguihara, P., Lopes, A. D. A., & Mauricio, D. (2021). Dynamic Bayesian network modeling, learning, and inference: A survey. IEEE Access, 9, 117639–117648.
[22]. Liang, H., Ganeshbabu, U., & Thorne, T. (2020). A dynamic Bayesian network approach for analysing topic-sentiment evolution. IEEE Access, 8, 54164–54174.
Cite this article
Zheng,C. (2025). A comprehensive review of probabilistic and statistical methods in social network sentiment analysis. Advances in Engineering Innovation,16(3),38-43.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title:
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).