Home > Published Issues > 2024 > Volume 15, No. 4, 2024 >
JAIT 2024 Vol.15(4): 480-491
doi: 10.12720/jait.15.4.480-491

A Hybrid Feature Extraction and Feature Selection Mechanism to Predict Disease in Plant Leaves

Abisha A. * and Bharathi N.
Department of Computer Science and Engineering, SRM Institute of Science and Technology, College of Engineering and Technology, Vadapalani Campus, Chennai, India
Email: aa7111@srmist.edu.in (A.A.); bharathn2@srmist.edu.in (B.N.)
*Corresponding author

Manuscript received July 19, 2023; revised September 6, 2023; accepted October 27, 2023; published April 9, 2024.

Abstract—The health of the plants is vital to meet the demands of the food cycle. As the symptoms of disease or infection are most commonly seen in plant leaves, selecting features from plant leaves that are highly impacting plant health is crucial. Plant health is a global imperative for food security and ecological balance and must be treated as the top priority. Feature Extraction (FE) and Feature Selection (FS) are significant in Deep Learning (DL) and Machine Learning (ML) models, which are used for classification and prediction. Xception-based feature extraction and random forest classification yield accurate predictions, offering interpretability and adaptability across diverse plant diseases and datasets, benefiting agriculture. In this article, FE is performed using an Xception pre-trained model and the extracted features are sent for FS. Further, six FS methods such as ANOVA, chi-square, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Lasso and Ridge, have been deployed and compared with machine learning algorithms such as Logistic Regression (LR), K Nearest Neighbours (KNN), Decision-Trees (DT), Random Forest (RF), Support Vector Machine (SVM), Naive-Bayes (NB) for classification. The article also proposes an Ensemble Feature Selection (EFS)-RF method, which combines feature sets from six feature selection algorithms and classifies based on majority voting. The methodology section details criteria for selecting FE and FS methods, utilizing an ensemble to maximize their respective benefits. The paper contributes to agriculture by employing a hybrid approach, integrating DL (Xception-based FE) and ML (RF-based Classification), utilizing an ensemble of FS methods to identify and assign higher weightage to features prevalent across subsets. The proposed method has outperformed other algorithms for both datasets with 98 % accuracy and 0.02 Mean Squared Error (MSE) for dataset I and 98.125 % accuracy and 0.01875 MSE for dataset II.
Keywords—feature extraction, feature selection, filter, wrapper, embedded, machine learning, ensemble

Cite: Abisha A. and Bharathi N., "A Hybrid Feature Extraction and Feature Selection Mechanism to Predict Disease in Plant Leaves," Journal of Advances in Information Technology, Vol. 15, No. 4, pp. 480-491, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.