Desta, Elifenesh Yitagesu and Gurmessa, Tekalign Tujo (2019) Analysis and result of classification algorithm on email classification. International Journal of Computer Engineering Research, 8 (1). pp. 1-9.
C9AC37C61384 - Published Version
Download (417kB)
Abstract
In this time, one of the most and fastest forms of communication is electronic mail or what we call e-mail. However, the increase of e-mail users has resulted in the dramatic increase of spam emails in the past few years. Spam is the use of electronic messaging systems to send bulk data. In this paper, e-mail data were classified as ham email and spam email using supervised learning algorithms. Three different classifiers such as Naïve Bayesian (NB) classifier, K-nearest neighbor (KNN) classifier and Support Vector Machine (SVM) classifier were used. The experiment was performed by applying filtering on the classifiers. The result shows the difference between the classifier before and after applying filtering algorithm. To examine the performance of the selected classification methods or algorithms, namely Naïve Bayes, SVM and KNN, true positive, false positive, precision, recall and F-measure were validated. There was a time difference using those classification algorithms. KNN and SMO algorithms are almost the best classifiers among the three before applying filtering algorithm. Sequential minimal optimization (SMO) is an algorithm used to solve quadratic programming (QP) problem that arises during the training of support vector machines (SVM) and after applying filtering algorithm. SMO algorithm is the best classifier algorithm. For this experiment, the data mining tool called WEKA was used.
Item Type: | Article |
---|---|
Subjects: | Eurolib Press > Computer Science |
Depositing User: | Managing Editor |
Date Deposited: | 01 Feb 2023 06:36 |
Last Modified: | 23 Mar 2024 04:14 |
URI: | http://info.submit4journal.com/id/eprint/1207 |