Research Article
Open access
Published on 20 March 2025
Download pdf
Export citation

A Review of Bayes Machine Learning for Spam Filtering Applications

Zehao Song *,1,
  • 1 School of Mathematics, Shanghai University of Finance and Economics, Shanghai, China, 200433

* Author to whom correspondence should be addressed.

https://doi.org/10.54254/2755-2721/2025.KL21549

Abstract

The Naive Bayes algorithm uses the theorem of Bayes to filter spam emails, achieving good filtering results. The improved Bayes algorithm addresses the assumption of "feature independence given the class" in Naive Bayes algorithm, allowing for a broader application range. This paper reviews the main content and representative achievements of both the Naive Bayes algorithm and the improved Bayes algorithm, and analyzes the advantages and disadvantages of each method. This study finds that the Naive Bayes algorithm has a limited application range due to the assumption of "feature independence given the class" while the improved Bayes algorithm effectively solves this problem and it has better applicability. This paper aims to help researchers engaged in spam filtering better understand and leverage the potential of the theorem of Bayes in spam filtering, providing a summary reference to promote technological innovation in related fields and better problem-solving, as well as facilitating the understanding of other readers and the application of Bayes filtering methods.

Keywords

Bayes, machine learning, spam, filtering

[1]. Qin Zhiguang, Luo Qin, Zhang Fengli. Research on a Hybrid Spam Filtering Algorithm [J]. Journal of University of Electronic Science and Technology of China, 2007, (03): 485-488.

[2]. Fan Shilun, Xue Tianjun, Xia Wei. Design and Implementation of a Spam Filtering System Based on Bayesian Algorithm and Fisher Algorithm [J]. Information Network Security, 2012, (09): 18-22.

[3]. Peng Ge. A Review of Research on Naive Bayes Algorithm in Spam Filtering [J]. Computer Knowledge and Technology, 2020, 16 (14): 244-245+247. DOI:10.14004/j.cnki.ckt.2020.1577

[4]. Zhao Zhiguo, Tan Minsheng, Li Zhimin. A Review of Improved Bayesian Spam Filtering Algorithms [J]. Journal of Nanhua University (Natural Science Edition), 2006, (01): 33-38. DOI:10.19431/j.cnki.1673-0062.2006.01.009.

[5]. Friedman, Goldszmidt. Building Classifiers using Bayesian Networks [A]. Proceedings of National Conference on Artificial Intelligence [C]. Menlo Park: CA: AAAI Press, 1996.

[6]. Xing Yongkang, Shen Yidong. Credibility Network Classifier [J]. Journal of Chongqing University (Natural Science Edition), 2000, 23(5): 49-52.

[7]. Rennie J. iFile: An application of machine learning to e-mail filtering[C]. Boston: Proceedings of KDD-2000 Text Mining Workshop, 2000.Rennie J. iFile

[8]. Lin Y P, Chen Z P, Yang X L, et al. Mail filtering based on the risk minimization Bayesian algorithm. 6th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2002)[J]. Proceedings-Industrial Systems and Engineering III, 2002, 17(3): 282-285.

[9]. Cai Lijun, Shi Ronghua. Design of a New Email Filtering System Model [J]. Computer Engineering, 2003, 29(16): 167-169.

[10]. Li Huijuan, Gao Feng, Guan Xiaohong, et al. A Spam Filtering Method Based on Bayesian Neural Networks [J]. Microelectronics and Computer, 2005, 22 (4): 107-111.

Cite this article

Song,Z. (2025). A Review of Bayes Machine Learning for Spam Filtering Applications. Applied and Computational Engineering,142,34-39.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of MSS 2025 Symposium: Automation and Smart Technologies in Petroleum Engineering

Conference website: https://2025.confmss.org/
ISBN:978-1-83558-999-1(Print) / 978-1-80590-000-9(Online)
Conference date: 16 June 2025
Editor:Mian Umer Shafiq
Series: Applied and Computational Engineering
Volume number: Vol.142
ISSN:2755-2721(Print) / 2755-273X(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).