Home > Published Issues > 2022 > Volume 13, No. 6, December 2022 >
JAIT 2022 Vol.13(6): 624-631
doi: 10.12720/jait.13.6.624-631

Dark Web Text Classification by Learning through SVM Optimization

Ch A. S. Murty 1 and Parag H. Rughani 2
1. Centre for Development of Advanced Computing (C-DAC), Hyderabad, India
2. National Forensic Sciences University, India

Abstract—The Darkweb has become the largest repository of unauthorized information compared to the surface web because of its benefit of anonymity and privacy. With these anonymity and privacy features, the dark web is also becoming a safe place for illegal activities and hence an increase of dark web usage and size of the onion-based URLs. With the increasing use of dark web users, it is the need for cybercrime investigators across the globe to classify dark web data for understanding various illegal activities to control and categorize URLs hosting such illicit activities with feature engineering. In this research, the Support Vector Machines (SVM) algorithm is used to understand the algorithm’s efficiency for a proposed model to classify dark web data with optimization techniques. Text-based keywords from more than 1800 websites were collected by applying feature engineering techniques and the system’s performance was evaluated with the SVM approach. The results are very encouraging as the Precision, Recall, and F-measure values are 0.83, 0.90 & 0.96 achieved with a dataset of 1800 URLs. 
Index Terms—Darkweb, SVM, classification, Darkweb content classification 
Cite: Ch A. S. Murty and Parag H. Rughani, "Dark Web Text Classification by Learning through SVM Optimization," Journal of Advances in Information Technology, Vol. 13, No. 6, pp. 624-631, December 2022.

Copyright © 2022 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.