Persian text classification using naive bayes algorithms and support vector machine algorithm

Rezaeian, N.; Novikova, G.

Persian text classification using naive bayes algorithms and support vector machine algorithm

One of the several benefits of text classification is to automatically assign document in predefined category is one of the primary steps toward knowledge extraction from the raw textual data. In such tasks, words are dealt with as a set of features. Due to high dimensionality and sparseness of feature vector results from traditional feature selection methods, most of the proposed text classification methods for this purpose lack performance and accuracy. Many algorithms have been implemented to the problem of Automatic Text Categorization that’s why, we tried to use new methods like Information Extraction, Natural Language Processing, and Machine Learning. This paper proposes an innovative approach to improve the classification performance of the Persian text. Naive Bayes classifiers which are widely used for text classification in machine learning are based on the conditional probability. we have compared the Gaussian, Multinomial and Bernoulli methods of naive Bayes algorithms with SVM algorithm. for statistical text representation, TF and TF-IDF and character-level 3 (3-Gram) [1,2] were used. Finally, experimental results on 10 newsgroups. © 2018 Institute of Advanced Engineering and Science.

Authors

Rezaeian N. ¹ , Novikova G. ¹

Journal

Indonesian Journal of Electrical Engineering and Informatics

Publisher

Institute of Advanced Engineering and Science

Issue number

Language

English

Pages

178-188

State

Published

Link

External link

DOI

10.11591/IJEEI.V8I1.1696

Volume

Year

2020

Organizations

¹ Information Technologies Department, Peoples' Friendship University of Russia (RUDN University), Moscow, Russian Federation

Keywords

Bernoulli Naive Bayes; Gaussian Naive Bayes; Multinomial Naive Bayes; SVM; Text classifications; TF-IDF

Cite

ГОСТ MLA RIS BibTex

SIZE OF CANINE HEPATOCELLULAR CARCINOMA AS AN ADVERSE PROGNOSTIC FACTOR FOR SURGERY

Article

Vatnikov Y., Vilkovysky I., Kulikov E., Popova I., Khairova N., Gazin A., Zharov A., Lukina D.

Journal of Advanced Veterinary and Animal Research. Vol. 7. 2020. P.. 127-132

THE ANALYSIS OF SELF-EVALUATION OF STOMATOLOGICAL HEALTH IN PATIENTS WITH PARTIAL ABSENCE OF TEETH

Article

Akhmedova N.A.

Problemy sotsial'noi gigieny, zdravookhraneniia i istorii meditsiny. Vol. 28. 2020. P.. 291-293

Persian text classification using naive bayes algorithms and support vector machine algorithm

Other records

SIZE OF CANINE HEPATOCELLULAR CARCINOMA AS AN ADVERSE PROGNOSTIC FACTOR FOR SURGERY

THE ANALYSIS OF SELF-EVALUATION OF STOMATOLOGICAL HEALTH IN PATIENTS WITH PARTIAL ABSENCE OF TEETH

Cite