Application of image processing and transfer learning for the detection of rust disease

Shahoveisi, Fereshteh; Taheri Gorji, Hamed; Shahabi, Seyedmojtaba; Hosseinirad, Seyedali; Markell, Samuel; Vasefi, Fartash

doi:10.1038/s41598-023-31942-9

Download PDF

Article
Open access
Published: 29 March 2023

Application of image processing and transfer learning for the detection of rust disease

Fereshteh Shahoveisi^1,2,
Hamed Taheri Gorji^3,6,
Seyedmojtaba Shahabi⁴,
Seyedali Hosseinirad^2,5,
Samuel Markell¹ &
…
Fartash Vasefi⁶

Scientific Reports volume 13, Article number: 5133 (2023) Cite this article

3463 Accesses
11 Citations
Metrics details

Subjects

Abstract

Plant diseases introduce significant yield and quality losses to the food production industry, worldwide. Early identification of an epidemic could lead to more effective management of the disease and potentially reduce yield loss and limit excessive input costs. Image processing and deep learning techniques have shown promising results in distinguishing healthy and infected plants at early stages. In this paper, the potential of four convolutional neural network models, including Xception, Residual Networks (ResNet)50, EfficientNetB4, and MobileNet, in the detection of rust disease on three commercially important field crops was evaluated. A dataset of 857 positive and 907 negative samples captured in the field and greenhouse environments were used. Training and testing of the algorithms were conducted using 70% and 30% of the data, respectively where the performance of different optimizers and learning rates were tested. Results indicated that EfficientNetB4 model was the most accurate model (average accuracy = 94.29%) in the disease detection followed by ResNet50 (average accuracy = 93.52%). Adaptive moment estimation (Adam) optimizer and learning rate of 0.001 outperformed all other corresponding hyperparameters. The findings from this study provide insights into the development of tools and gadgets useful in the automated detection of rust disease required for precision spraying.

Advanced deep learning techniques for early disease prediction in cauliflower plants

Article Open access 27 October 2023

Early stage black pepper leaf disease prediction based on transfer learning using ConvNets

Article Open access 16 January 2024

Recent advances in plant disease severity assessment using convolutional neural networks

Article Open access 09 February 2023

Introduction

‘Rust’ fungi (order Pucciniales, division Basidiomycota) are a major group of fungal plant pathogens that can affect the yield and quality of many field crops, including sunflower, soybean, field pea, dry bean, wheat, and barley. The level of yield loss caused by rust fungi varies among the host species. For instance, wheat leaf rust (Puccinia triticina Eriks.) can cause yield losses ranging between 3 to 50% depending on the geographical location¹. Soybean rust (Phakopsora pachyrhizi Syd. & P. Syd) usually cause sporadic yield losses in the United States; however, according to risk analyses, the pathogen can result in yield losses greater than 10% in any soybean growing region in the US while the losses can reach up to 50% in southeastern states². Sunflower rust (Puccinia helianthi Schwein) is one of the important diseases that limit sunflower yield. A comprehensive study conducted by Friskop et al. indicated that every 1% of disease severity could result in 6.6% yield reduction; however, yield losses up to 80% have been reported under a high disease severity³.

Three important rust diseases of field crops grown in the US Northern Great Plains include; sunflower rust, common bean rust (Uromyces appendiculatus F. Strauss), and field pea rust (Uromyces viciae-fabae (Pers.) de Bary) that affect sunflower (Helianthus annuus L.), dry bean (Phaseolus vulgaris L.), and field pea (Pisum sativum L.), respectively. These three rust pathogens have five spore stages (macrocyclic rust); while some of the spore stages are morphologically similar among the hosts, there are differences in signs of each pathogen and symptoms of the disease on hosts. In the first stage, basidiospores infect the plant and produce pycnia; these spores are not visible with the unaided eye; the earliest visible stage is the pycnial stage. On sunflower, pycnia appear as small yellow-orange spots on the top side of lower leaves and cotyledons. In the next stage, aecia will form in clusters of orange cups (approximately 0.5 cm in size) on the underside of the leaf. The most common and repeating stage of rust is uredinia which occurs after aecia. Uredinia are small pustules (approximately 0.15 cm) filled with cinnamon-brown spores (urediniospores) and appear on the upper or undersides of leaves. On sunflower, Urediniospores can infect stems, bracts, leaves, and petioles. These pustules can be rubbed off easily and may be surrounded by a chlorotic halo. Lastly, uredinia turn black and form black structures (telia) that could survive under unfavorable environmental conditions⁴. On dry bean, yellow to yellow–brown pycnia form on the upper surface of leaves and white aecia appear on the underside of leaves. Similar to sunflower rust, pycnia and aecia are difficult to detect and last only a few days. The visible symptoms start with small white or yellow raised spots on the upper or/and undersides of leaves. These spots enlarge and form reddish-brown uredinia that are about < 0.1–0.3 cm in diameter that are filled with dusty cinnamon-brown spores. Pustules may form on green pods, and occasionally on branches and stems and they may be surrounded by chlorotic halos. Premature leaf drop may be observed under high disease pressure⁵. On field pea, small whitish aecia (0.3–0.4 mm diameter) may be scattered on tissue or in groups surrounding the pycnia. Aecia enlarge and rupture the epidermis to produce uredinia (0.5–1 + mm diameter) filled with cinnamon-brown which could form on leaves, petioles, stems, and pods⁶. Details about the morphology of each rust pathogen and its specific host play a significant role in the early detection and management of the epidemic, which could prevent yield losses on several host species.

An integrated pest management (IPM) approach is commonly recommended for managing rust diseases. This includes planting a cultivar with genetics conferring resistance to the rust pathogen(s), timely application of an efficacious fungicide, crop rotation, excellent control of volunteer and or wild host species^7,8. Accurate identification and on-time detection of the disease could increase the efficiency of the management practices. Growers need to have ample tools and knowledge to identify a disease, which is not always possible. Further, detection of the all the infested areas in the field is not practical; therefore, growers consider spraying the whole field, regardless of the distribution and spread of the disease in the field “(personal communication with growers)”. Considering this challenge, tools that make disease identification and detection easier would be valuable for plant disease management. Automatic detection of diseases through machine learning could provide timely and accurate detection of plant diseases and could be used to spot the infected areas in the field and apply pesticides only on those areas⁹. Precise spraying could significantly minimize unnecessary pesticide applications.

Numerous studies have verified the efficiency of machine learning analyses in plant pathology. These studies are either focused on training disease prediction models using environmental factors (i.e., temperature, humidity, and wetness duration)^9,10,11,12 or the detection of plant diseases using image processing and machine learning¹³. The high accuracy of conventional machine learning algorithms such as artificial neural network (ANN), random forest (RF), support vector regression (SVR), multi-layer perceptron (MLP), extreme learning machine (ELM), and logistic regression (LR) in distinguishing healthy and infected samples has been reported repeatedly⁹. For example, Zhu et al.¹⁴ assessed the accuracy of different machine learning algorithms, including back-propagation neural network (BPNN), ELM, and least squares support vector machine (LS-SVM) in the detection of Tobacco mosaic virus using hyperspectral imaging. The majority of these models showed prediction accuracies over 85%.

In an earlier study conducted by Rumpf et al.¹⁵ the accuracy of ANN, support vector machine (SVM), and decision trees (DT) in the classification of healthy and inoculated sugar beet leaves with three diseases (Cercospora leaf spot, rust, and powdery mildew) was evaluated. Results of this work revealed that SVM accuracy in the detection of Cercospora leaf spot disease increased from 65% at 1–2% disease severity to 100% at 10% disease severity. The range of accuracy was similarly high for the other two tested diseases. While the classification accuracy of the conventional machine learning methods is promising, they require extracting meaningful information (feature extraction) from the input data. Feature extraction is an extra computational step, and the performance of the machine learning models depends on the type of extracted information. Convolutional neural network (CNN) models developed in recent years are capable of processing raw data directly and extracting the efficient features automatically^16,17. Further, CNNs could result in higher classification accuracies in comparison with traditional machine learning algorithms¹⁸.

Efficiency of CNN models has been reported in different fields such as medical sciences^19,20,21, food industry²², construction industry²³, weather prediction²⁴, advertisement²⁵, and hydrology²⁶. Application of CNN models in the detection of plant diseases has been studied to some extent. A review paper published by Boulent et al.²⁷ reported the results of several studies where the accuracy of CNN and traditional image processing methods in the prediction of plant diseases were compared. As a general trend, CNN outperformed models such as SVM and radial basis function (RBF) with differences in accuracy ranging between 3 and 29%^28,29. Another significant strength of CNNs is their high generalization capacity (how accurate a model can classify or predict previously unseen data) which results in increased robustness even when the data is heterogeneous, the image capturing conditions are different, and there are variabilities among classes. However, acquiring this robustness requires a large-scale training dataset²⁷ which is not always available when researchers tend to use their own dataset. Transfer learning models have become a reliable alternative to CNNs regardless of the training dataset size.

Pre-trained models such as EfficientNet and MobileNet, with varying depth layers, have been used in the classification of plant diseases^{30,31,32,33,34,35,36,37}. Wang et al.³⁶ compared the accuracy of shallow networks and three deep models in the classification of apple black rot images extracted from the PlantVillage dataset. Results of this study indicated that VGG16 (VGGNet with 16 weight layers) from the deep models outperformed other tested models, where the overall accuracy was 90.4% on the test dataset. In another study conducted by Zhang et al.³⁷, the accuracy of three models, including AlexNet, GoogLeNet, and ResNet, with different optimization methods (i.e., Stochastic Gradient Descent (SGD) and Adam) were evaluated. Most of the tested models showed accuracies greater than 94%, with ResNet_SGD resulting in the highest accuracy of 97.28%. These studies and several other reports have verified the efficiency of transfer learning in the detection of plant diseases. However, only limited studies have been conducted on the rust disease(s) using images from larger datasets such as PlantVillage. These images are commonly captured under controlled environments including homogeneous backgrounds, fixed light intensity and tissue/camera positions^38,39. In other studies models were developed to detect rust using only one host crop^40,41; this could limit the model generalization when is used for the disease detection in other crops. The present study was designed to fill the aforementioned gaps and indicate the application of transfer learning in the detection of rust disease using real-life images taken under field conditions. Supplementary Table S1 provides comparisons between the methodologies and results of our work and those of previously published studies reporting top classifiers and CNN models in the classification of binary (healthy vs infected) and multiple-class (several diseases) datasets. The objectives were (i) to evaluate the accuracy of four deep CNN models, including Xception, ResNet50, EfficientNet, and MobileNet, in the classification of images taken under field and greenhouse conditions into healthy and infected leaves (that displayed uredinia pustules of Puccinia. sp or Uromyces. Spp.; hereafter, “detection of rust disease” will refer to “detection of uredinia pustules”) of three economically important field crops (i.e., sunflower, dry bean, and field pea). (ii) to assess the role of different optimizers and learning rates in the performance of the models.

Results and discussion

The results of the transfer learning analyses conducted with different hyperparameters using images captured in greenhouse and field conditions are presented and discussed in this section.

The performance of four pre-trained models including ResNet, Xception, EfficientNetB4, and MobileNetV2 in the detection of rust disease on three hosts was evaluated using four commonly used optimizers and three different learning rates. In general, EfficientNetB4 with an average accuracy of 94.29% across all learning rates and optimizers was the most efficient model in distinguishing healthy and infected leaf tissues in all three hosts. The average accuracy of the ResNet50 model across all hyperparameters was only 0.77% less than EfficientNetB4 and was the second-best model (average accuracy = 93.52%). The average accuracies of MobileNet-V2 and Xception were 87.67 and 83.20% using different hyperparameters (Tables 1 and 2). The higher accuracy of the EfficientNetB4 model could be due to its architecture that not only balances the network dimension in terms of depth, width, and resolution of the input image but uses squeeze-and-excitation that enhance the representational power of the network⁴². The application and high efficiency of the EfficientNet model in the detection of other plant diseases have been reported previously^31,38,43.

Table 1 Statistical fitness metrics of the residual network (ResNet), Xception, EfficientNetB4, and MobileNetV2 pre-trained conventional neural network (CNN) models in the detection of rust disease on sunflower, dry bean, and field pea using Adaptive Moment Estimation (Adam), Follow The Regularized Leader (Ftrl), Stochastic Gradient Descent (SGD), and Root Mean Square Propagation (RMSprop) optimizers.

Full size table

Table 2 Statistical fitness metrics of the residual network (ResNet), Xception, EfficientNetB4, and MobileNetV2 pre-trained conventional neural network (CNN) models in the detection of rust disease on sunflower, dry bean, and field pea using 0.01, 0.001, and 0.0001 learning rates.

Full size table

In a study conducted by Atila et al.³⁸, the performance of several EfficientNet models, ResNet50, AlexNet, VCG16, and InceptionV3 using the PlantVillage dataset was assessed. The learning rate was set to 0.001 and 0.01 for Adam and SGD optimizers, respectively. The results indicated that the EfficientNetB4 model was the most accurate model in the detection of disease using the augmented data with an accuracy of 99.97% and EfficientNetB5 outperformed all other models (average accuracy = 99.91%) where the original dataset was used. However, the accuracy of other models in their study was very close to the top models; ranging from 99.45% for AlexNet in the original data set to 99.88% for ResNet50 in the augmented dataset. The previous studies that have reported the high efficiency of the EfficientNet model in the detection of plant diseases have mostly used PlantVillage which is a large publicly available dataset with edited images that have similar backgrounds and magnitudes. The result of the present study indicated that EfficientNet could be a good choice even for small datasets where a variety of images with different backgrounds, ambient light intensities, angles, and ages (for sunflower). Identifying models that perform well with these types of images is essential since scientists normally encounter such datasets in a real-life situation where acquired photos have complex background noise that makes the data analysis more challenging¹⁸.

Perusing literature indicated that ResNet50 models are among the most accurate and frequently used models in the detection of plant diseases. Several studies have reported the high efficiency of ResNet models where EfficientNet was not among the tested models^{34,37,44,45,46}. Our results indicated that ResNet50 was the second best model in the detection of rust disease, with slightly lower accuracy than EfficientNet. Therefore, it could be concluded that EfficientNet and ResNet are two of the strongest pre-trained CNN models in the detection of plant diseases and pests. However, as previously reported^37,38,43, model hyperparameters such as optimizers, learning rates, and batch size are highly determinant components in the performance of the models where the appropriate selection of these components in the training of a model warrants the highest accuracy.

Considering the significant role of the optimizer in training the deep learning models⁴⁷, the effect of four optimizers on the performance of models was evaluated. Fitness metrics of ResNet, Xception, EfficientNetB4, and MobileNetV2 models were tested using the four most widely used optimizers including Adam⁴⁸, Follow The Regularized Leader (Ftrl)⁴⁹, SGD, and Root Mean Square Propagation (RMSprop)⁵⁰. The results indicated that in general, the accuracy of models was higher with Adam optimizer, ranging between 84.84 and 95.56%. EfficientNetB4 with Adam optimizer and learning rate of 0.001 had an accuracy of 95.56%, precision of 96.86%, true positive rate of 93.65%, F-score of 95.18%, true negative rate of 97.25%, and the area under the receiver operating characteristic curve (AUC-ROC) of 96.92% and outperformed other models. Stochastic gradient descent was the second-best optimizer (after Adam) which resulted in an accuracy of 94.49% in EfficientNetB4. Root Mean Square Propagation and Ftrl were the least efficient optimizers and generated similar results in ResNet, Xception, and EfficientNetB4; however, the accuracy of MobileNetV2 decreased only by 0.3% when SGD was replaced with Ftrl. The details of the models’ fitness metrics are presented in Table 1. While the EfficientNetB4 model using the Adam optimizer outperformed the ResNet50 and the other models, the ResNet model was more stable with less fluctuation than the EfficientNetB4 regardless of epoch numbers (Fig. 1).

In a study conducted by Zhang et al.³⁷, SGD optimizer showed a better performance than Adam where AlexNet, GoogLeNet, and ResNet models were used. While the difference in accuracy between the optimizers was only 1.60 and 2.12% for GoogLeNet and ResNet, respectively, the accuracy declined by 81.97% when SGD was substituted with Adam in AlexNet. In the present study, Adam was a better optimizer for all models. However, only one model was common (ResNet) between the studies where the difference in accuracy was less than 2.6% when the two optimizers were used in either of the studies. This suggests that the efficiency of optimizers could be model and hyperparameter-dependent to some degree. Further, the epoch number was different in these studies; Zhang et al.³⁷ trained their model using 6240 epochs while we set the epoch to 100, and usually SGD could converge better than Adam with a longer training time⁵¹.

Learning rate is considered one of the most important hyperparameters that significantly impact the performance of CNN models⁵². Defining the optimum learning rate results in the highest performance of CNN models. Defining very small learning rates applies smaller changes to the weights and minimizes the model loss function; however, the model needs more epochs to learn the task. Selecting a high learning rate, on the other hand, speeds up the training process but it can increase the potential of generating unwanted divergent behavior in the model loss function⁵³. Therefore, in this study, three mid-range learning rates (0.01, 0.001, and 0.0001) were used in the training and testing of the models and their effects on the performance of the models were evaluated. Considering that Adam was the best optimizer for all models, it was used in analyses conducted to test the learning rates. Results of this section indicated that the learning rate of 0.001 was optimum for all the four pre-trained CNN models regardless of the epoch number (Table 2 and Fig. 2). EfficientNetB4 represented the highest accuracy (average accuracy = 94.70%) across all learning rates where the learning rate of 0.001 resulted in the maximum accuracy of 95.56%, precision of 96.86%, true positive rate of 93.65%, and true negative rate of 97.25%. ResNet50 was the second-best model regardless of the learning rate (average accuracy = 93.71%). The average accuracy of Xception and MobileNet-V2 were 82.5% and 87.91% across all learning rates (Table 2).

A closer look at Fig. 2 reveals that the optimum learning rate of 0.001 resulted in the highest accuracy compared to the other two learning rates and the fluctuation in percentage of accuracy minimizes after 20 epochs at this learning rate. Increasing or decreasing the learning rate led to the weaker performance of the models. The smallest value for the learning rate (0.0001 in this study) resulted in a slightly better performance than the highest value (0.01) for Xception and MobileNet-V2 (epochs of 100); however, the learning rate of 0.01 resulted in negligibly higher accuracy compared to 0.0001 in ResNet50 and EfficientNetB4 models. In general, EfficientNetB4 and ResNet50 were more robust to the change of the learning rate values, whereas Xception and MobileNet were more sensitive to either decreasing or increasing the learning rate.

The role of different learning rates on the performance of CNN models has been studied to some extent. For instance, Hassan et al.⁴³ tested a range of learning rates (0.01 to 0.0001) for their efficiency in training of several models including EfficientNetB0, MobileNetV2, InceptionV3, and InceptionResNetV2 using PlantVillage dataset; however, results of this study did not include the optimum learning rate used for each model. In another study conducted on PlantVillage and Nepal datasets, testing three learning rates (0.001, 0.0001, 0.00001) resulted in accuracies ranging between 99 to 100% in the CNN model and 81 to 100% in the Capsule Neural Network where the optimum learning rate of 0.0001 was reported as a result⁵⁴. Our results did not agree with this finding where our optimum rate was 0.001. However, several factors such as the model architecture, size and type of dataset, number of disease classes, and other hyperparameters are determinants in the performance of the models and these factors were not all similar between the two studies and might be the potential reasons of this discrepancy. Therefore, evaluating a range of hyperparameters, such as learning rate and optimizer, is an important task to tune the model parameters and obtain the maximum accuracy for the desired dataset.

Convolutional neural network models are becoming popular tools in the detection of plant diseases. The result of the present study indicated that EfficientNetB4 with the average accuracy of 94.29% outperformed the other models. ResNet50 (average accuracy of 93.52%) was the second-best architecture in the detection of rust disease on sunflower, dry bean, and field pea where a small dataset with uneven backgrounds and magnitudes were used. Further, four different optimizers and three learning rates were tested across all architectures; Adam optimizer and learning rate of 0.001 consistently performed better in all evaluated architectures. Moreover, k-fold cross validation (tenfold) on EfficientNetB4 with the Adam optimizer and learning rate of 0.001 was used to assess the effect of bias caused by random train-test split. The model achieved an average accuracy of 94.34% ± 0.010, precision of 96.09% ± 0.006, true positive rate of 93.37% ± 0.018, true negative rate of 97.22% ± 0.005, and F-score of 94.81% ± 0.011. As the average accuracy and standard deviations (the values after ± signs) show the model was highly accurate and stable.

Visual demonstrations were generated using Gradient-weighted Class Activation Mapping (Grad-CAM) with EfficientNetB4 as the base model to indicate how deep learning algorithms make decisions in differentiating rust-infested from healthy tissue (Fig. 3). The red area refers to the most important discriminative regions where the model pays the highest attention and the blue areas are the least critical. As Fig. 3A indicates the model mainly focuses on rust postulates and not the background or other injuries such as the one on top right corner of the leaf. Figure 3B shows the precision of the model in focusing on the leaf with rust pustulate while it is surrounded by other healthy leaves. Most importantly, the model is also focused on the rust pustulates located on the leaf tip, not insect injuries at the bottom of the leaf tissue. Figure 3C,D also verify that the rust infested areas on the leaves are correctly recognized as discriminative regions.

As a future plan of this study, the rust disease dataset could be expanded by adding more images of tested crops and additional field crops such as wheat and corn. A larger dataset would allow the validation of the top architectures and, subsequently developing tools and/or mobile applications that assist growers and plant pathologists in a fast and cost-effective plant disease diagnosis. Although diseases such as rust could be detectable by naked eyes, it is not practical to screen the whole field for the presence of the disease and therefore, farmers typically spray the entire field when some symptoms occur. Large scale pesticide applications are costly, labor-intensive, time consuming, and most importantly endanger the environmental health and safety. Application of technology and remote sensing is becoming more common in agriculture with the goal of making precision agriculture accessible to a majority of farmers. The first step toward achieving this goal is the development and validation of accurate machine learning models with a high level of generalization. In this study, our aim was to develop a reliable model for the detection of rust disease on several hosts that could be incorporated into drones and/or handheld devices that facilitate precision spraying.

Conclusion

Image processing and deep learning algorithms have demonstrated encouraging results in differentiating healthy and infected plants at different stages of the disease progress. In this study, the ability of four different pre-trained CNN models including Xception, ResNet50, EfficientNetB4, and MobileNet were evaluated to detect rust disease on three commercially important field crops. Images from greenhouse and field were used in the training and testing the models to represent the variation of the natural conditions. The performance of the models was evaluated using two important hyperparameters, i.e., learning rate and optimization algorithm. EfficientNetB4 trained by Adam optimizer and learning rate of 0.001 was the most accurate model for discriminating healthy and rust-infested tissues with the average accuracy of 94.29%. These results demonstrated that EfficientNetB4 could be a reliable model to detect rust on several host species and therefore be incorporated into tools and devices used in precision management of the disease such as pesticide spraying drones and robots. Intelligent spray systems are a major component of precision agriculture that results in reducing pesticide applications and protecting environmental and human health.

Materials and methods

Image dataset

Three crops, including sunflower, dry bean, and field pea, which are susceptible to rust pathogens, were used in this study. Images with different backgrounds and magnitudes were taken preliminary in North Dakota, USA, between 2007 and 2020 under greenhouse and agricultural field environments. The presence of three rust species, including Puccinia helianthi on sunflower, Uromyces appendiculatus on dry bean, and U. viciae-fabae on field pea, on infected plants was visually verified. A total of 1764 images (907 healthy and 857 infected) from three crops were collected and pooled for data analysis where 70% of the photos were used for training. A subsample of healthy and infected plants of three crops is presented in Fig. 4.

All methods were performed in accordance with the relevant guidelines, regulations, and legislation. No animals or human participants were used in this study. Further, plant materials were not collected/planted; only images were taken under field and greenhouse conditions. All study methods followed North Dakota State University guidelines.

Pre-trained CNN models

In this study, the transfer learning approach using four pre-trained CNN architectures, including Xception, ResNet50, EfficientNetB4, and MobileNet-V2 were adopted for the detection of rust disease on sunflower, field pea, and dry bean. In transfer learning, the model could use previously learned knowledge from other tasks to solve a new problem⁵⁵. Since in transfer learning, the models are already trained on a massive dataset, compared with training a model from scratch, less data is required, and also it could save training time and improve the model performance. In this study, all the models were fine-tuned to diagnose the rust disease using pre-trained weights of the models on the ImageNet dataset which has approximately 1.4 million images in 1000 classes⁵⁶. Further, to adjust the models to our task, which is a binary classification (healthy versus infected), the fully connected layers of the CNN pre-trained models were changed to two, which represents the dimensionality of the output space. The “Sigmoid” function was chosen as the activation function, and “binary cross-entropy” was selected as the loss function.

The architecture of ResNet50 with a total of 16 residual blocks is represented in Fig. 5. The architecture of the Xception model is shown in Fig. 6. The architecture of EfficientNetB4 with dimensions of 224 × 224 of input images, three channels, and an initial 3 × 3 kernel size convolution is shown in Fig. 7. The architecture of MobileNetV2 includes an initial fully convolution layer with 32 filters, followed by 19 residual bottleneck layers is represented in Fig. 8. The details of the tested pre-trained CNN models’ architecture could be found in the supplementary Note 1.

Further, Grad-CAM⁵⁷ was used to identify the regions of an input picture that have the greatest effect on the classification score. The Grad-CAM method relates the use of the gradients of the classification score to the final convolutional feature map. Locations with a high value of this gradient reflect those with the most data dependence in the final score.

Model evaluation metrics

The performance of the four deep CNN models was evaluated using some well-known metrics, including accuracy, true negative rate, precision, true positive rate, F-score, and AUC-ROC score. The confusion matrix was used to calculate the performance metrics of the models.

The definitions of these parameters are provided in the next section.

$$ {\text{Accuracy}} = { }\frac{{\left( {{\text{TP}} + {\text{TN}}} \right)}}{{\left( {{\text{TP}} + {\text{TN}} + {\text{FP}} + {\text{FN}}} \right)}} $$

(1)

$$ {\text{True}}\,{\text{ negative}}\,{\text{ rate}} = { }\frac{{{\text{TN}}}}{{{\text{TN}} + {\text{FP}}}} $$

(2)

$$ {\text{Precision}} = { }\frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FP}}}} $$

(3)

$$ {\text{True }}\,{\text{positive }}\,{\text{rate}} = { }\frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FN}}}} $$

(4)

$$ {\text{F}}_{{{\text{score}}}} = 2 \times \frac{{{\text{Precision}} \times {\text{True}}\,{\text{ positive}}\,{\text{ rate}}}}{{{\text{Precision}} + {\text{True}}\,{\text{ positive}}\,{\text{ rate}}}} $$

(5)

where TP, TN, FP, and FN represent true positives, true negative, false positives, and false negative, respectively. Based on the Eqs. (1–6), the accuracy of a model is defined as the number of all correct predictions divided by the total number of predictions. True negative rate or specificity, is defined as the correct classification of negative instances, and precision, also known as sensitivity, indicates the total number of correctly classified positive observations. True positive rate or recall refers to the proportion of correctly classified instances, and the F-Score, or F1 score, is the harmonic mean of precision and true positive rate and can be an indicator of the model robustness. In addition, the ROC score represents the performance of the model or diagnostic ability, and it refers to the prediction of the positive instances where the actual observations are positive. It has been reported that the ROC score could be a better comparison measure than F-Score, specifically where class distribution is unbalanced, and the latter might become skewed towards the positive class. Lastly, AUC-ROC indicates the ability of a parameter in the separation of two classes (infected and healthy, in this case).

Experiment setup

To evaluate the efficiency of deep learning models for detecting rust disease in this study, the experiment was conducted using three different crops, including sunflower, dry bean, field pea and the performance of four well-known pre-trained CNN architectures were compared. All models mentioned above were trained and tested on 70% and 30% of the dataset, respectively. The experiment was implemented on Windows10 using the Keras framework with Tensorflow-GPU v2.6.0 as backend on a GPU-enabled workstation with NVIDIA GeForce GTX 1080 8 GB GDDR5.

Considering that optimizer plays a crucial role in changing the attributes of the models, pre-trained CNN models were trained and tested with four different optimizers including Adam, SGD, RMSprop, and Ftrl. Moreover, since the learning rate is one of the most important hyperparameters⁵² that needs to be tuned to achieve optimum performance, we evaluated the performance of the models using three different learning rates including 0.01, 0.001, and 0.0001. Supplementary Note 2 provides information and related references about the tested hyperparameters.

Other hyperparameters were not benchmarked due to the following reasons: batch size: the computer used in this study had GPU of 1080 which could support a batch size of up to 32; epoch number: the top models were mainly stable across tested epoch numbers. Therefore, higher epoch numbers were not tested (Figs. 1, 2); early stopping: to ensure about the stability of the models across a range of epoch numbers, we did not activate early stopping as it stops the model if there is no improvement in the training at a specific epoch number; Image size: smaller size images were not used due to the small size of the rust pustules; image size: the maximum image size (224 × 224) that our hardware could support was used; depth of model: the depth of the network is fixed for pre-trained models.

Data availability

A great part of the data that supports the findings of this study are available from the American Phytopathological Society (APS) but restrictions apply to the availability of these data, which were used under license for the current study (Authors who transfer images to APS retain the ability to use the materials for their individual research, manuscript submission, or Extension activities). Therefore, these data are not publicly available. Data are however available from the corresponding author upon request and with permission of the APS.

References

Chai, Y., Pardey, P. G., Hurley, T. M., Senay, S. D. & Beddow, J. M. A probabilistic bio-economic assessment of the global consequences of wheat leaf rust. Phytopathology 110, 1886–1896 (2020).
Article PubMed Google Scholar
Yang, X. Assessment and management of the risk of soybean rust. in Proceedings of the soybean rust workshop. NSLR (1996).
Kandel, H. Yield, cultural practices and yield limiting factors. in 2011 National Sunflower Association Survey, North Dakota State University (2011).
Friskop, A. J. et al. Sunflower Rust. Dept. Plant Pathology (North Dakota State University Extension, 2011).
Google Scholar
Schwartz, H. F., Steadman, J. R., Harveson, R. M. & Lindgren, D. Rust of Dry Beans (Colorado State Univeristy, 2004).
Google Scholar
Bailey, K. L. Diseases of Field Crops in Canada (Canadian Phytopathological Society, 2003).
Google Scholar
Friskop, A. J. et al. Effect of fungicide and timing of application on management of sunflower rust. Plant Dis. 99, 1210–1215 (2015).
Article CAS PubMed Google Scholar
Schwartz, H. F., Steadman J. R., Lindgren, D. T. Rust of Dry Beans Doctoral thesis (Colorado State University, 2004).
Fenu, G. & Malloci, F. M. Forecasting plant and crop disease: an explorative study on current algorithms. Big Data Cogn. Comput. 5, 2 (2021).
Article Google Scholar
Chakraborty, S. et al. Weather-based prediction of anthracnose severity using artificial neural network models. Plant Pathol. 53, 375–386 (2004).
Article Google Scholar
Kim, Y., Roh, J.-H. & Kim, H. Y. Early forecasting of rice blast disease using long short-term memory recurrent neural networks. Sustainability 10, 34 (2018).
Article Google Scholar
Shahoveisi, F., Riahi Manesh, M. & del Río Mendoza, L. Modeling risk of Sclerotinia sclerotiorum-induced disease development on canola and dry bean using machine learning algorithms. Sci. Rep. 12, 1–10 (2022).
Article Google Scholar
Ngugi, L. C., Abelwahab, M. & Abo-Zahhad, M. Recent advances in image processing techniques for automated leaf pest and disease recognition–A review. Inf. Process. Agric. 8, 27–51 (2021).
Google Scholar
Zhu, H. et al. Hyperspectral imaging for presymptomatic detection of tobacco disease with successive projections algorithm and machine-learning classifiers. Sci. Rep. 7, 1–12 (2017).
ADS Google Scholar
Rumpf, T. et al. Early detection and classification of plant diseases with support vector machines based on hyperspectral reflectance. Comput. Electron. Agric. 74, 91–99 (2010).
Article Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Traore, B. B., Kamsu-Foguem, B. & Tangara, F. Deep convolution neural network for image recognition. Ecol. Inform. 48, 257–268 (2018).
Article Google Scholar
Cheng, X., Zhang, Y., Chen, Y., Wu, Y. & Yue, Y. Pest identification via deep residual learning in complex background. Comput. Electron. Agric. 141, 351–356 (2017).
Article Google Scholar
Taheri Gorji, H. & Kaabouch, N. A deep learning approach for diagnosis of mild cognitive impairment based on MRI images. Brain Sci. 9, 217 (2019).
Article PubMed Central Google Scholar
Kusumoto, D. & Yuasa, S. The application of convolutional neural network to stem cell biology. Inflamm. Regen. 39, 1–7 (2019).
Article Google Scholar
Yamashita, R., Nishio, M., Do, R. K. G. & Togashi, K. Convolutional neural networks: An overview and application in radiology. Insights imaging 9, 611–629 (2018).
Article PubMed PubMed Central Google Scholar
Harrison, C. J. et al. Deep learning and multiwavelength fluorescence imaging for cleanliness assessment and disinfection in food services. Front. Sens. 3, 977770 (2022).
Article Google Scholar
Akinosho, T. D. et al. Deep learning in the construction industry: A review of present status and future innovations. J. Build. Eng. 32, 101827 (2020).
Article Google Scholar
Chattopadhyay, A., Hassanzadeh, P. & Pasha, S. Predicting clustered weather patterns: A test case for applications of convolutional neural networks to spatio-temporal climate data. Sci. Rep. 10, 1–13 (2020).
Article Google Scholar
Vo, A. T., Tran, H. S. & Le, T. H. Advertisement image classification using convolutional neural networks. in 2017 9th International Conference on Knowledge and Systems Engineering (KSE) 197–202 (IEEE) (2017).
Atashi, V., Gorji, H. T., Shahabi, S. M., Kardan, R. & Lim, Y. H. Water level forecasting using deep learning time-series analysis: A case study of red river of the north. Water 14(12), 1971 (2022).
Article Google Scholar
Boulent, J., Foucher, S., Théau, J. & St-Charles, P.-L. Convolutional neural networks for the automatic identification of plant diseases. Front. Plant Sci. 10, 941 (2019).
Article PubMed PubMed Central Google Scholar
Brahimi, M., Boukhalfa, K. & Moussaoui, A. Deep learning for tomato diseases: Classification and symptoms visualization. Appl. Artif. Intell. 31, 299–315 (2017).
Article Google Scholar
Liu, B., Zhang, Y., He, D. & Li, Y. Identification of apple leaf diseases based on deep convolutional neural networks. Symmetry 10, 11 (2018).
Article ADS CAS Google Scholar
Chen, J., Zhang, D., Suzauddola, M., Nanehkaran, Y. A. & Sun, Y. Identification of plant disease images via a squeeze-and-excitation MobileNet model and twice transfer learning. IET Image Process. 15, 1115–1127 (2021).
Article Google Scholar
Chowdhury, M. E. et al. Automatic and reliable leaf disease detection using deep learning techniques. AgriEngineering 3, 294–312 (2021).
Article Google Scholar
Pardede, H. F. et al. Plant diseases detection with low resolution data using nested skip connections. J. Big Data 7, 1–21 (2020).
Article Google Scholar
Sagar, A. & Dheeba, J. On using transfer learning for plant disease detection. bioRxiv (2020).
Saleem, M. H., Potgieter, J. & Arif, K. M. Plant disease detection and classification by deep learning. Plants 8, 468 (2019).
Article PubMed PubMed Central Google Scholar
Sravan, V., Swaraj, K., Meenakshi, K. & Kora, P. A deep learning based crop disease classification using transfer learning. Mat. Today. Proc. (2021).
Wang, G., Sun, Y. & Wang, J. Automatic image-based plant disease severity estimation using deep learning. Comput. Intell. Neurosci. 2017 (2017).
Zhang, K., Wu, Q., Liu, A. & Meng, X. Can deep learning identify tomato leaf disease? Adv. Multim. 2018 (2018).
Atila, Ü., Uçar, M., Akyol, K. & Uçar, E. Plant leaf disease classification using EfficientNet deep learning model. Ecol. Inform. 61, 101182 (2021).
Article Google Scholar
Mohanty, S. P., Hughes, D. P. & Salathé, M. Using deep learning for image-based plant disease detection. Front. Plant Sci. 7, 1419 (2016).
Article PubMed PubMed Central Google Scholar
Mi, Z., Zhang, X., Su, J., Han, D. & Su, B. Wheat stripe rust grading by deep learning with attention mechanism and images from mobile devices. Front. Plant Sci. 11 (2020).
Zhang, X. et al. A deep learning-based approach for automated yellow rust disease detection from high-resolution hyperspectral UAV images. Remote Sens. 11, 1554 (2019).
Article ADS Google Scholar
Hu, J., Shen, L. & Sun, G. Squeeze-and-excitation networks. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 7132–7141 (2018).
Hassan, S. M., Maji, A. K., Jasiński, M., Leonowicz, Z. & Jasińska, E. Identification of plant-leaf diseases using CNN and transfer-learning approach. Electronics 10, 1388 (2021).
Article Google Scholar
Fuentes, A., Yoon, S., Kim, S. C. & Park, D. S. A robust deep-learning-based detector for real-time tomato plant diseases and pests recognition. Sensors 17, 2022 (2017).
Article ADS PubMed PubMed Central Google Scholar
Too, E. C., Yujian, L., Njuki, S. & Yingchun, L. A comparative study of fine-tuning deep learning models for plant disease identification. Comput. Electron. Agric. 161, 272–279 (2019).
Article Google Scholar
Selvaraj, M. G. et al. AI-powered banana diseases and pest detection. Plant Methods 15, 1–11 (2019).
Article CAS Google Scholar
Bera, S. & Shrivastava, V. K. Analysis of various optimizers on deep convolutional neural network model in the application of hyperspectral remote sensing image classification. Int. J. Remote Sens. 41, 2664–2683 (2020).
Article Google Scholar
Kingma, D. P. & Ba, J. A. A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
McMahan, H. B. et al. Ad click prediction: a view from the trenches. in Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 1222–1230 (2013).
Ruder, S. An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016).
Zhou, P., Feng, J., Ma, C., Xiong, C. & Hoi, S. Towards theoretically understanding why sgd generalizes better than adam in deep learning. arXiv preprint arXiv:2010.05627 (2020).
Bengio, Y. Practical recommendations for gradient-based training of deep architectures. Neural Netw. Tricks Trade 437–478 (2012).
Buduma, N. & Locascio, N. Fundamentals of Deep Learning: Designing Next-Generation Machine Intelligence Algorithms (O’Reilly Media Inc., 2017).
Google Scholar
Jennifer Jepkoech, D. M. & Benson Kenduiywo, E. T. The effect of adaptive learning rate on accuracy of CNN. IJACSA 12, 15 (2021).
Google Scholar
Pan, S. J. & Yang, Q. A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22, 1345–1359 (2009).
Article Google Scholar
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comp. Vis. 115, 211–252 (2015).
Article MathSciNet Google Scholar
Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization. in Proceedings of the IEEE International Conference on Computer Vision (CVPR) 618–626 (2017).

Download references

Acknowledgements

We acknowledge the help received from Dimitri Lakshan Fonseka Gunawardena, Julie Pasche, Kristin Simons, and Juan Osorno by allowing us to take photos of healthy and infected plants grown under greenhouse conditions.

Funding

This research was financially supported by the SafteySpect company.

Author information

Authors and Affiliations

Department of Plant Pathology, North Dakota State University, Fargo, ND, USA
Fereshteh Shahoveisi & Samuel Markell
Department of Plant Sciences and Landscape Architecture, University of Maryland, College Park, MD, USA
Fereshteh Shahoveisi & Seyedali Hosseinirad
Biomedical Engineering Program, College of Engineering and Mine, University of North Dakota, Grand Forks, ND, USA
Hamed Taheri Gorji
School of Electrical Engineering and Computer Science, College of Engineering and Mine, University of North Dakota, Grand Forks, ND, USA
Seyedmojtaba Shahabi
Department of Plant Sciences, North Dakota State University, Fargo, ND, USA
Seyedali Hosseinirad
SafetySpect Inc., 10100 Santa Monica Blvd., Suite 300, Los Angeles, CA, USA
Hamed Taheri Gorji & Fartash Vasefi

Authors

Fereshteh Shahoveisi
View author publications
You can also search for this author in PubMed Google Scholar
Hamed Taheri Gorji
View author publications
You can also search for this author in PubMed Google Scholar
Seyedmojtaba Shahabi
View author publications
You can also search for this author in PubMed Google Scholar
Seyedali Hosseinirad
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Markell
View author publications
You can also search for this author in PubMed Google Scholar
Fartash Vasefi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.S. Design of the experiment, data collection, and manuscript preparation and editing. H.T.G. and S.S. Design of the experiment, data analyses, and manuscript preparation. S.H. Data collection and manuscript editing. S.M. Data collection and manuscript editing. F.V. Manuscript editing and financial support.

Corresponding author

Correspondence to Fereshteh Shahoveisi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shahoveisi, F., Taheri Gorji, H., Shahabi, S. et al. Application of image processing and transfer learning for the detection of rust disease. Sci Rep 13, 5133 (2023). https://doi.org/10.1038/s41598-023-31942-9

Download citation

Received: 26 February 2022
Accepted: 20 March 2023
Published: 29 March 2023
DOI: https://doi.org/10.1038/s41598-023-31942-9

This article is cited by

A deep learning approach for early detection of drought stress in maize using proximal scale digital images
- Pooja Goyal
- Rakesh Sharda
- Mukesh Siag
Neural Computing and Applications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.