A Review of Machine Learning Algorithms for Text-Documents Classification

General Information

ISSN: 1798-2340 (Online)
Frequency: Monthly
DOI: 10.12720/jait
Indexing: ESCI (Web of Science), Scopus, CNKI, etc.
Acceptance Rate: 19%
APC: 500 USD
Average Days to Accept: 135 days
Journal Metrics:

Impact Factor 2022: 1.0

3.1

2022CiteScore

49th percentile

Powered by

Editor-in-Chief

Prof. Kin C. Yow

University of Regina, Saskatchewan, Canada

I'm delighted to serve as the Editor-in-Chief of Journal of Advances in Information Technology. JAIT is intended to reflect new directions of research and report latest advances in information technology. I will do my best to increase the prestige of the journal.

What's New

2024-03-28

Vol. 15, No. 3 has been published online!

2024-02-26

The papers published in Vol. 15, Nos. 1&2 have been registered with Crossref.

2024-02-26

Vol. 15, No. 2 has been published online!

Home > Published Issues > 2010 > Volume 1, No. 1, February 2010 >

Aurangzeb Khan1, Baharum Baharudin1, Lam Hong Lee2, and Khairullah khan1

1. Department of Computer and Information Science, Universiti Teknologi PETRONAS, Tronoh, Malaysia
2. Faculty of Science, Engineering and Technology, Universiti Tunku Abdul Rahman, Perak Campus, Kampar, Malaysia

Abstract— With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic categorization of documents became the key method for organizing the information and knowledge discovery. Proper classification of e-documents, online news, blogs, e-mails and digital libraries need text mining, machine learning and natural language processing techniques to get meaningful knowledge. The aim of this paper is to highlight the important techniques and methodologies that are employed in text documents classification, while at the same time making awareness of some of the interesting challenges that remain to be solved, focused mainly on text representation and machine learning techniques. This paper provides a review of the theory and methods of document classification and text mining, focusing on the existing literature.

Index Terms— Text mining, Web mining, Documents classification, Information retrieval.

Cite: Aurangzeb Khan, Baharum Baharudin, Lam Hong Lee, and Khairullah khan, "A Review of Machine Learning Algorithms for Text-Documents Classification," Journal of Advances in Information Technology, Vol. 1, No. 1, pp. 4-20, February, 2010.doi:10.4304/jait.1.1.4-20

v1n1-05

PREVIOUS PAPER

Introduction to the Inaugural Issue

NEXT PAPER

Multilingual Context Ontology Rule Enhanced Focused Web Crawler

Home

Author Guide

Editor Guide

Reviewer Guide

Published Issues

Special Issue

Sections and Topics

journal menu