Visible and near-infrared spectroscopy and deep learning application for the qualitative and quantitative investigation of nitrogen status in cotton leaves

Xiao, Qinlin; Wu, Na; Tang, Wentan; Zhang, Chu; Feng, Lei; Zhou, Lei; Shen, Jianxun; Zhang, Ze; Gao, Pan; He, Yong

doi:10.3389/fpls.2022.1080745

ORIGINAL RESEARCH article

Front. Plant Sci., 20 December 2022

Sec. Technical Advances in Plant Science

Volume 13 - 2022 | https://doi.org/10.3389/fpls.2022.1080745

This article is part of the Research Topic Spectroscopy, Imaging and Machine Learning for Crop Stress View all 13 articles

Visible and near-infrared spectroscopy and deep learning application for the qualitative and quantitative investigation of nitrogen status in cotton leaves

Qinlin Xiao^1,2†

Na Wu^3†‡

Wentan Tang^1,2

Chu Zhang⁴

Lei Feng^1,2

Lei Zhou⁵

Jianxun Shen⁶

Ze Zhang⁷

Pan Gao⁸

Yong He^1,2*‡

¹College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, China
²Key Laboratory of Spectroscopy Sensing, Ministry of Agriculture and Rural Affairs, Hangzhou, China
³School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Huzhou, China
⁴School of Information Engineering, Huzhou University, Huzhou, China
⁵College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing, China
⁶Hangzhou Raw Seed Growing Farm, Hangzhou, China
⁷Key Laboratory of Oasis Eco-Agriculture, College of Agriculture, Shihezi University, Shihezi, China
⁸College of Information Science and Technology, Shihezi University, Shihezi, China

Leaf nitrogen concentration (LNC) is a critical indicator of crop nutrient status. In this study, the feasibility of using visible and near-infrared spectroscopy combined with deep learning to estimate LNC in cotton leaves was explored. The samples were collected from cotton’s whole growth cycle, and the spectra were from different measurement environments. The random frog (RF), weighted partial least squares regression (WPLS), and saliency map were used for characteristic wavelength selection. Qualitative models (partial least squares discriminant analysis (PLS-DA), support vector machine for classification (SVC), convolutional neural network classification (CNNC) and quantitative models (partial least squares regression (PLSR), support vector machine for regression (SVR), convolutional neural network regression (CNNR)) were established based on the full spectra and characteristic wavelengths. Satisfactory results were obtained by models based on CNN. The classification accuracy of leaves in three different LNC ranges was up to 83.34%, and the root mean square error of prediction (RMSEP) of quantitative prediction models of cotton leaves was as low as 3.36. In addition, the identification of cotton leaves based on the predicted LNC also achieved good results. These results indicated that the nitrogen content of cotton leaves could be effectively detected by deep learning and visible and near-infrared spectroscopy, which has great potential for real-world application.

1 Introduction

Cotton (Gossypium spp), as one of the important economic crops in the world, is widely used in the textile industry because of its excellent natural properties. Nitrogen is an essential plant macronutrient, taking an important part in crop photosynthesis, which provides necessary nutritional support for the growth and development of crops (Ma et al., 2022). Observations have shown that nitrogen fertilization has an important effect on cotton yield. Rational nitrogen fertilization is beneficial to increase cotton yield, while both deficit and excessive nitrogen fertilization have a negative impact on cotton growth and development (Liu et al., 2010; Gospodinova and Panayotova, 2019). Gospodinova and Panayotova (2019) summarized the research on the effects of mineral fertilization on cotton yield and concluded that nitrogen should be applied at different development stages as needed. Optimizing the nitrogen fertilizer application scheme is conducive to improving nitrogen utilization efficiency and cotton yield. Knowing the nutritional status of cotton is the prerequisite to realizing on-demand nitrogen application. Therefore, rapid and accurate evaluation and detection of cotton nitrogen is of great significance for monitoring plant nutrition status, as well as making fertilization decisions.

Leaf nitrogen concentration (LNC), a critical indicator of nitrogen nutrient status, is widely used in crop nutrient status evaluation (Wan et al., 2022). A study conducted by Kergoat et al. (2008) has shown that LNC is an essential factor affecting canopy light utilization efficiency and photosynthetic rate. Generally, LNC is determined by destructive analysis methods, such as the Kjeldahl-digestion method. Although the destructive approaches are objective, they have disadvantages such as being time-consuming, labor-intensive, high cost and strong destructiveness. It is also difficult to meet the actual needs of rapid and real-time detection and diagnosis of LNC in a wide range. In recent years, non-destructive techniques, such as visible and near-infrared (VNIR) spectroscopy (Mishra et al., 2021) and multi-spectral and hyperspectral imaging (Tahmasbian et al., 2021; Guo et al., 2022), have been developed to detect crop nutrition status. Multi-spectral and hyperspectral images usually carry more information than spectra data. However, the acquisition of spectral images generally requires expensive and bulky sensors, the amount of data is enormous, and there is more information redundancy, which requires more storage space and tedious data processing. Multi-spectral and hyperspectral imaging are not economically feasible when many samples need to be examined and evaluated. VNIR allows rapid acquisition of spectral information related to samples’ physiological state and internal components at a relatively low cost. In the past few years, VNIR has attracted extensive attention and has been used in qualitative and quantitative research in plants (Zhang et al., 2020a; Xia et al., 2021; Luo et al., 2022).

For the studies aiming at employing VNIR for nitrogen detection, Mishra (Mishra et al., 2021) et al. demonstrated the feasibility of using VNIR to quantitatively predict the nitrogen and potassium concentration in bell pepper leaves. The results showed that VNIR allowed accurate prediction of nitrogen with an RMSEP of 0.44%. Sun et al. (2013) used VNIR to identify the fertilized nitrogen level of lettuce leaves and achieved a high classification accuracy of 100%. Zhang et al. (2022a) explored the performance of using spectra of different ranges to estimate nitrogen content in cotton leaves and obtained a R²_c = 0.794~ 0.909 and R²_P = 0.774 ~ 0.899. Relationships between cotton leaf spectra curves (380-700 nm, 700-1300 nm, and 1300-2500 nm) and nitrogen content contributed to satisfactory predictions for nitrogen content detection. There are indeed many researches on the detection of LNC (Sun et al., 2013; Wang et al., 2018; Gao et al., 2022; Pourdarbani et al., 2022; Tang et al., 2022; Zhang et al., 2022b). Although the studies focusing on the LNC classification achieved good results (Sun et al., 2013; Wang et al., 2018; Pourdarbani et al., 2022), the samples in these studies were classified according to different nitrogen fertilization levels or different nitrogen fertilization days. It should be noted that there is a large difference between the fertilization of nitrogen and its actual uptake for the plant. Therefore, the adaptability of the classification models according to the nitrogen fertilization division is greatly limited by the uncertainty of the actual LNC. What’s more, in practice, it is always hard to get accurate fertilization data and estimate the fertilization condition. In addition, the studies focusing on LNC prediction are mainly for a specific cultivar or a specific spectral data collection environment (Rotbart et al., 2013), which may limit the scope of the applicability of the established models.

Deep learning is a method that simulates the human brain for analysis and learning. It forms abstract features to represent the data distribution. Deep learning has the advantages of strong self-learning and feature-extraction ability and great capability of processing spectra data (Xiao et al., 2020). In recent years, deep learning has been applied to conduct various tasks in spectral and image data processing (Steinbrener et al., 2019; Zhou et al., 2019). Convolutional neural network (CNN) is one of the typical deep learning models. CNN has been proven effective in processing spectra data and establishing classification and regression models for various agricultural tasks (Zhang et al., 2020b; Zhang et al., 2020c; Gai et al., 2022).

The objective of the present study was to explore the feasibility of qualitative diagnosis and quantitative detection of LNC based on VNIR combined with deep learning. The goals include (1) exploring the laws of the spectra of leaves with different LNC; (2) classifying nitrogen levels according to the measured LNC; (3) detecting LNC for two cotton cultivars under the condition that the spectra were collected in different measurement environments. The specific content includes (1) extracting characteristic wavelengths by random frog (RF), weighted partial least squares regression (WPLS), and saliency map for qualitative discrimination and quantitative detection tasks, respectively; (2) building partial least squares discriminant analysis (PLS-DA), support vector machine for classification (SVC), and convolutional neural network classification (CNNC) models based on full spectra and characteristic wavelengths to identify cotton leaves with different LNC qualitatively; (3) developing partial least squares regression (PLSR), support vector machine for regression (SVR), convolutional neural network regression (CNNR) models to quantitatively detect LNC in cotton leaves.

2 Materials and methods

2.1 Sample preparation

Cotton was planted in an experimental field at the Hangzhou Raw Seed Growing Farm (30°22’58.85” N, 119°56’7.80” E), Hangzhou, Zhejiang province, China. Cotton cultivars Lumianyan 24 (LMY24) and Xinluzao 53 (XLZ53) were sampled in this experiment. Thirty-six experimental plots of 4×2 m were used with six nitrogen rates (0, 120, 240, 360, 480, 278 kg/hm²). Each nitrogen level was set with three replicates. Leaf sampling was conducted during the whole growth stage. Leaves at different leaf positions were selected from the experimental plots. Finally, a total of 1400 leaves were acquired. It is worth mentioning that the spectra of 648 leaves were collected in the laboratory, and the spectra of the remaining leaves were collected in the field. For the samples measured in the laboratory, the leaves were cut, placed in the black bags and stored in a cooler with a temperature of about 4°C. These samples were transported to the laboratory immediately. The time of transit was within one hour.

2.2 Spectra acquisition

Leaf spectra acquisition was conducted by a spectroradiometer (Fieldspec4, Analytical Spectral Devices - ASD, Boulder, CO, USA) system. This spectroradiometer consists of a leaf clip, which provides a light source. During the measurement, the leaves were clamped up for spectra acquisition. Three different positions of each leaf were measured, and five scans were conducted for each measurement. The spectra of five scans were averaged as the spectra of the leaf region, and the average spectra of three leaf regions were taken as the spectra of each leaf. The regions of spectra acquisition for each leaf is shown in Figure 1. The collected spectra cover the visible and near-infrared region (400 ~ 1000 nm) and the short-wave near-infrared region (1000 ~ 2500 nm), and the spectral resolution are 3nm and 8nm, respectively. Considering the noise in the beginning, spectra between 430-2500 nm were used in this study.

FIGURE 1

Figure 1 The regions of spectra acquisition for cotton leaves.

2.3 Measurements of leave nitrogen concentration

After finishing the spectra collection, the leaves were placed in an oven, dried at 105°C for half an hour, and then cooled to 80°C until the sample weight recorded a constant weight. Then, the dried leaves were ground into a fine powder and sieved through a 40-mesh. A uniform dry leaf sample of fixed mass was taken, and the nitrogen concentration was determined by the Kjeldahl method after acid digestion (Kjeldahl, 1883). According to the measured LNC (mg/g), cotton leaf samples were divided into three categories: low-level LNC, medium-level LNC, and high-level LNC. The detailed statistical information on sample composition is presented in Table 1. Cotton leaves with different LNC levels are shown in Figure 2. It can be seen that the leave with high-level LNC has a deeper green color.

TABLE 1

Table 1 Statistical information of composition of the cotton leaves.

FIGURE 2

Figure 2 Cotton leaves with different LNC levels: (A) low-level LNC, (B) medium-level LNC, (C) high-level LNC.

2.4 Data analysis methods

2.4.1 Convolutional neural network

In this study, two self-developed CNN architectures were applied for building classification and regression models, and their structures are shown in Figures 3A, B, respectively. For the classification task, two convolution layers were set, both followed by a max pooling layer and a batch normalization layer. The number of filters, kernel size, and strides of the two convolution layers were both set as 16, 3, and 1, respectively. The rectified linear unit (ReLU) was used as the activation for computing the outputs of the convolutional layers. The max pooling layer served as down-sampling and dimensionality reduction to form the features of the next layer. Then, a fully connected network with 64 neurons was added, followed by a batch normalization layer. The dropout layer was used to avoid overfitting. The fully connected layer at the end was used for output. For the regression task, two batch normalization layers followed by convolution layers were employed. The number of filters, kernel size, and strides of the two convolution layers were both set as 32, 3, and 1, respectively. Same as the proposed CNN for classification, the rectified linear unit (ReLU) was used as the activation. A batch normalization layer was added before the features were outputting to the fully connected layer. In the end, two fully connected layers with 64 and 16 neurons were used for building non-linear regression models to predict the LNC of different leaves. The fully connected layer at the end was used for output.

FIGURE 3

Figure 3 The CNN structure for classification model (A) and regression model (B).

For the training phase, the Softmax cross-entropy loss function combined with stochastic gradient descent (SGD) optimizer was applied to train the CNN developed for the classification task. The L1 loss function and the adaptive moment estimation (Adam) optimizer were used for the regression task. The detailed information about SGD optimizer and Adam optimizer could be found on the website https://pytorch.org/docs/stable/optim.html . For both training tasks, the batch size was set as 64, and a scheduled learning rate was used. In the beginning, the learning rate was set to 0.05. The learning rate was reduced ten times after every 200 epochs. According to this rule, the training phase was terminated when the loss was stable.

2.4.2 Conventional models

PLS-DA and SVC models were established to classify cotton leaves with different LNC. PLS-DA is a linear discriminant algorithm developed from PLSR (Yuan et al., 2021). PLS-DA algorithm can effectively extract the variables helpful for classification and realize data recognition. PLS-DA can deal with irreversible matrices and select the number of latent variables so that the model achieves the best balance between underfitting and overfitting (Li et al., 2021). SVC is a pattern recognition algorithm based on a support vector machine (SVM) for classification. It achieves the classification goal by exploring the hyperplane that maximizes the distance between different classes (Xiao et al., 2020). In this study, the radial basis function (RBF) was used as the kernel function. The regularization parameter c and kernel function parameter g were determined through a grid-search procedure. The search range of c and g were both assigned as 2^-8 to 2⁸. PLSR and SVR models were used to establish the quantitative analysis model of LNC. Detailed information on PLS and SVR details can be found in our previous article (Xiao et al., 2022). For both qualitative and quantitative analysis models, five-fold cross-validation was adopted.

2.4.3 Wavelengths selection

Hyperspectral data contains massive amounts of information, which also exists information redundancy, collinearity, and noise that are not conducive to data processing. To make effective use of data, characteristic wavelengths extraction is a common strategy. In this study, RF, WPLS and saliency map were used to extract characteristic wavelengths. For the RF algorithm, based on the idea of inversible-jump Markov Monte Carlo, PLS-DA and PLSR are selected as modeling methods for classification and regression, respectively. Models are established by constantly updating the subset of variables according to the defined criteria. The frequency of each variable selected in the modeling subset is calculated after reaching the number of iterations (Yun et al., 2013; Sun et al., 2021). The top 40 wavelengths with the highest frequency were selected as the characteristic wavelengths. When using WPLS for wavelength selection, a PLS regression model is first established, and each variable’s regression coefficient was calculated. The wavelengths with the larger absolute value of the regression coefficient at the crest and trough were selected (Mehmood et al., 2012). Saliency map is a popular method for computing the contribution of each variable to the model performance. In this study, for classification tasks, CNNC model was first established and calculated the saliency based on the method proposed in Feng’s study (Feng et al., 2021). Similarly, as for regression tasks, CNNR model was first established and saliency map was applied following the way in our previous study (Xiao et al., 2022). The first 40 critical wavelengths with the highest frequency for both tasks were selected as the characteristic wavelengths.

2.4.4 Software and model evaluation

For model establishment, PLS-DA and PLSR were performed in R2019b (The MathWorks, Natick, MA, USA). SVC, and SVR were conducted in the scikit-learn 0.23.1 (Anaconda, Austin, TX, USA) using python 3.1. The CNN models were conducted in MXNet 1.4.0 (MXNetAmazon, Seattle, WA, USA). For feature selection, RF was performed in R2019b (The MathWorks, Natick, MA, USA). WPLS was carried out in the Unscrambler X 10.1 (Camo AS, Oslo, Norway). Saliency map was conducted in MXNet1.4.0 (MXNetAmazon, Seattle, WA, USA).

It is critical to evaluate the model performance with appropriate indicators. Classification accuracy is used for assessing the qualitative analysis models. Classification accuracy is calculated as the ratio of correctly classified samples to the total number of samples. The closer it is to 100%, the better the model’s performance. The coefficients of determination (R²) and root mean square error (RMSE) of calibration, validation, and prediction set were applied to assess the performance of quantitative analysis models. The closer R² of the model is to 1, the closer RMSE is to 0, indicating that the model performance is more satisfactory.

3 Results

3.1 Spectra features

The spectra of all the cotton leaves and leaves with different LNC are shown in Figures 4A, B. As shown in Figure 4A, the spectra of all the leaves present a consistent change tendency. Four peaks (550, 1650, 1820, and 2225 nm) and three valleys (670, 1432, and 1950 nm) were observed. The analysis of the chemical bonds which may be assigned to the peaks and valleys can be found in our previous study (Xiao et al., 2022). Figure 4B presents the spectral curves of cotton leaves with different LNC. It can be seen that the reflectance of leaves with high LNC in the range of 430 ~ 520 nm is slightly higher than that of leaves with low LNC. There is a slight increase in the reflectance between 520 ~ 610 nm, and the reflectance of leaves with low LNC becomes higher. After 700 nm, the reflectance curves increase sharply and form a high reflectivity plateau between 775 and 1300 nm, between which the reflectance of leaves with high LNC is larger. The variation trend of the reflectance with LNC between the range of 1400 ~ 1900 nm and 2000 ~ 2500 nm is the opposite from that in the range of 775-1300 nm. The variation of reflectance in different spectral intervals makes it possible to identify leaves with different LNC content. As the results demonstrated in our previous study (Xiao et al., 2022), the model based on the spectra processed by first derivative (FD) and standard normal variate transformation (SNV) demonstrated great generalization ability. Therefore, the method of FD+SNV was used to preprocess the spectra and the processed spectra were used for subsequent modelling. The transformed curves are shown in Figure 4C.

FIGURE 4

Figure 4 Spectra of cotton leaves: (A) the spectra of all the cotton leaves, (B) the spectra of leaves with different LNC, (C) the spectra transformed by FD +SNV.

3.2 Wavelengths selection

Hyperspectral data contains amounts of information such as redundancy, collinearity, background, and other information unrelated to LNC detection. The irrelevant information will significantly increase the burden of data processing, affect the analysis and extraction of effective data, and directly affect the model’s performance. Therefore, in this study, RF, WPLS, and saliency map were used to select the characteristic wavelengths.

The optimal wavelengths selected by different methods for regression models and classification models are shown in Tables 2 and 3, respectively. It can be seen that the number and the location of the selected wavelengths on spectral curves varied from different methods. For the wavelengths selected for the classification model, compared with full spectra, the number of variables chosen by RF, WPLS, and saliency map was reduced by 98.07%, 98.02%, and 98.07%, respectively. For the wavelengths selected for the regression model, the number of variables selected by RF, WPLS, and saliency map was reduced by 98.07%, 97.73%, and 98.07%, respectively. Obviously, wavelengths selection significantly reduces data computation and alleviates the model dependence on high-performance computing instruments, which will contribute to the popularization and application of the model.

TABLE 2

Table 2 The optimal wavelengths selected by RF, WPLS, and saliency map for regression models.

TABLE 3

Table 3 The optimal wavelengths selected by RF, WPLS, and saliency map for classification models.

The position of the selected wavelengths in the spectral curve is displayed in Figure 5. For specific wavelength selection methods, the position of characteristic wavelength selected for classification and regression is largely coincidental. It indicated that the characteristic wavelengths related to nitrogen detection selected by wavelength selection method were consistent even in the tasks with different purposes. The specific number of selected variables might be related to the calculation protocol of wavelength selection method. Although there are differences in the number and location of wavelengths selected by RF, WPLS, and saliency map, some bands were chosen as the optimal wavelengths for LNC detection by more than one method, such as the bands around 554 nm, 595 nm, 1179nm, 1490 nm, 1671nm, 1673 nm, 1746 nm, 2046 nm, 2154 nm, 2230 nm, and 2459 nm. These bands are likely to have a strong correlation with nitrogen detection. Among the wavelengths selected by more than one algorithm, the bands in visible range was related with the color of the leaves (Malacara, 2011). The spectral response near 1490 nm was associated with N-H amide with N-R group, which can be connected with protein content (Salzer, 2008). The reflectance around 1673 nm and 1746 nm were associated with C-H methyl (Salzer, 2008). The bands around 2046 nm was due to symmetrical NH stretching and amide II (Salzer, 2008).

FIGURE 5

Figure 5 The position of the optimal wavelengths selected for classification and regression models (-C means for classification tasks, -R means for regression tasks).

3.3 Classification models

3.3.1 Classification models using full spectra

In this study, PLS-DA, SVC, and CNNC models were built using full spectra. The results are shown in Table 4. All the models obtained decent results, with the accuracy of the prediction set exceeding 82%. Compared with PLS-DA and SVC models, the CNNC model achieved a more satisfactory result. The accuracy of the prediction set was 84.70%, which illustrates the good performance of CNNC model. The confusion matrix for all datasets of the CNNC model is displayed in Figure 6. In three data sets, about 19-28% of leaf samples with low LNC were easily confused with leaves with medium LNC. Leaves with high LNC were easily confused with those with medium LNC, and the proportion of misclassified leaves was 8%-11%. Overall, leaves with high LNC and leaves with low LNC can be accurately separated, as only one or two leaves with low LNC were misclassified as high LNC, and none samples with high LNC were distinguished as low LNC.

TABLE 4

Table 4 The results of the classification models based on full spectra.

FIGURE 6

Figure 6 Confusion matrix of CNN model using full spectra. (Notes: Number 0, 1, 2 means leaf samples with low-level LNC, medium-level LNC and high-level LNC, respectively.).

3.3.2 Classification models using optimal wavelengths

Table 5 demonstrates the classification results of the PLS-DA, SVC, and CNNC models based on the optimal wavelengths selected by RF, WPLS, and saliency map. The overall performance of the models based on wavelengths selected by RF was superior to the performance of the models constructed on the wavelengths selected by WPLS and saliency map, as the accuracy for the prediction set was slightly higher. For the models based on the optimal wavelengths selected by RF and WPLS, the performance of the CNNC model was superior to that of the PLS-DA and SVC model, accomplishing an accuracy of 84.34% and 83.27% for the prediction set. Regarding the models based on the wavelengths selected by saliency map, SVC performed better than the PLS-DA and CNNC model, with the accuracy of the prediction set reaching 79.36%. Although the CNNC model based on full spectra achieved the best classification accuracy of 84.70% for the prediction set, the CNNC model based on the optimal wavelengths chosen by RF obtained quite similar results. Considering the number of variables used in modeling, the results of CNNC models constructed on the optimal wavelengths selected by RF were reasonably acceptable, which realized comparable performance with the model based on full spectra with less computation.

TABLE 5

Table 5 The results of the classification models based on optimal wavelengths.

3.4 Regression models

3.4.1 Regression models using full spectra

Figure 7 shows the results of different regression models using full spectra for the nitrogen detection of cotton leaves. All the models obtained satisfactory performance, with R²_c (coefficients of determination of calibration set), R²_v (coefficients of determination of validation set), and R²_p (coefficients of determination of prediction set) all exceeding 0.75. Compared with PLSR and SVR models, the CNNR model performed slightly better, achieving the smallest RMSE for the prediction set. These results indicated that VNIR combined with the CNNR model was conducive to effectively characterizing the LNC of cotton leaves.

FIGURE 7

Figure 7 The results of PLSR, SVR, CNNR models based on full spectra for LNC detection.

3.4.2 Regression models using optimal wavelengths

Different regression models were constructed based on the optimal wavelengths for LNC estimation. The results are shown in Table 6. It can be found that the performance of the model established using the optimal wavelengths selected by different methods showed a difference. The models built on the optimal wavelengths selected by WPLS performed slightly better than the models based on the optimal wavelengths selected by RF and saliency map, with higher R²_P and lower RMSEP. In addition, it can be found that the overall performance of CNNR models was better than that of PLS and SVR models. All the CNNR model achieved good results, with R²_c, R²_v, and R²_p were over 0.779, 0.724, and 0.711, indicating the robustness of the CNNR model based on optimal wavelengths. Specifically, the CNNR model based on the wavelengths chosen by WPLS obtained the best result. The R²_p and RMSEP were 0.766 and 3.389, respectively. Besides, a comparison was made between the models based on full spectra and those using the selected optimal wavelengths. Overall, the models established on the chosen variables performed less well than those based on full spectra. The performance of the CNNR model based on optimal wavelengths selected by WPLS was quite close to that based on full spectra. To some extent, the reduced computation compensates for the slight performance deficit, indicating that the CNNR model equipped with optimal wavelength selection methods is effective for cotton LNC estimation.

TABLE 6

Table 6 The results of the regression models based on optimal wavelengths.

3.4.3 Identification of leaf nitrogen status based on the predicted LNC

The predicted LNC values of all samples were calculated by regression model and then classified according to the rules mentioned in section 2.3. Then, the identification accuracy was calculated by comparing the categories corresponding to the predicted and actual values to evaluate the model’s effectiveness. The results are shown in Table 7. It can be seen that the identification of cotton samples based on the predicted value could achieve good results, similar to that of the results based on classification models, which in turn reflects the effectiveness of the regression model.

TABLE 7

Table 7 The identification results of cotton leaves based on the predicted LNC.

4 Discussion

Visible and near-infrared spectral techniques combined with deep learning can be used for nitrogen-level estimation. Some previous studies addressing the nitrogen-level classification of plant leaves were discussed and compared with our results with other spectral imaging works. It must be noted a rough comparison is not rigorous as the papers relate to different plants, techniques, and datasets. Pourdarbani et al. (2022) investigated the feasibility of using hyperspectral imaging to detect excess nitrogen content in tomato plants. Artificial neural networks and the particle swarm optimization algorithm were proposed and achieved a satisfactory classification accuracy of 92.6% for leaves at different nitrogen levels. The leaves in this work (Pourdarbani et al., 2022) were classified according to different days of nitrogen application. Sun et al. (2013) used VNIR to identify the nitrogen level of lettuce leaves. Adaptive boosting was applied with K nearest neighbor and SVM, which could achieve a high classification accuracy of 100%. The samples in this study were divided according to the fertilized nitrogen level. Wang et al. (2018) employed hyperspectral imaging to discriminate nitrogen fertilizer levels of the tea plant. The leaves from three nitrogen fertilizer levels were sampled, and up to 100% accuracy was achieved by the SVM model based on spectral data and textural data. The excellent performance might benefit from the texture information provided by the image. Although the methods mentioned above achieved good results, the samples in these studies were classified according to different nitrogen fertilization levels or different nitrogen fertilization days (Sun et al., 2013; Wang et al., 2018; Pourdarbani et al., 2022). There is a large difference between the fertilization of nitrogen and its actual uptake for the plant. Therefore, the adaptability of the classification models according to the nitrogen fertilization division is greatly limited by the uncertainty of the actual LNC. Thus, in this study, the chemical analysis for nitrogen measurement was conducted, and the leaves were divided into three categories according to the true LNC value of three ranges. Among similar studies which also measured actual LNC, Nativ et al. (Rotbart et al., 2013) used VNIR to estimate the nitrogen concentration in olive leaves. The leaves were divided into three groups according to the measured nitrogen content, and an overall accuracy of 83% was obtained. Du et al. (2016) explored the feasibility of using hyperspectral LiDAR to detect nitrogen content in rice leaves. The accuracy of 83% was obtained when 32 wavelengths were considered. The results in above studies (Rotbart et al., 2013) (Du et al., 2016) are slightly lower than the accuracy of 84.342% achieved by RF-CNN model in this study. It can be observed that although a perfect classification is not achieved, the method used in this study has a relatively higher accuracy of 84.342% in the best case. The performance is quite close to and even higher than the result obtained by other existing methods, which demonstrates that it is feasible to classify cotton leaves with different LNC by VNIR and deep learning algorithm.

Regarding the regression task, in a similar study on the LNC prediction of cotton leaves, Zhang et al. (2022a) explored the potential of using spectra of different ranges to estimate nitrogen content in cotton leaves, and obtained a R²_c = 0.794~ 0.909 and R²_P = 0.774 ~ 0.899. The prediction results of the best model are better than those in this research. The possible reason was that the samples used in this study (Zhang et al., 2022) were acquired at the flower and boll stage of cotton, which only covered two growing stages. The leaves used in our study cover the whole growing stage. Different thicknesses and textures of leaf samples would also cause spectral differences, which may affect the accuracy of nitrogen detection. Besides, the spectra in this paper were collected under two environmental conditions, covering the laboratory environment and the field environment. The difference in the measurement environment would also lead to the difference in the spectra. When the measurement was conducted in the field, there were more interference factors, which was also the reason for the relatively less satisfactory results. However, in practice, due to the diversity of application scenarios and the need for nutrient monitoring over the whole growth cycle of plants, it is critical to develop the models presented in this paper to enhance their applicability.

Besides, deep learning with VNIR performed well in estimating LNC in plant leaves. Table 6 and Figure 7 show that CNNR outperformed PLSR and SVR models, achieving a relatively lower RMSE for the prediction set. The study demonstrated that the CNN model established for regression tasks could achieve good results, which previous studies have confirmed. Weng et al. (2022) combined CNNR and visible and near-infrared reflectance spectroscopy to determine the behenic acid in edible vegetable oils, with R²_P = 0.8843 and RMSEP = 0.1182, outperforming PLSR and SVR model. Wu et al. (2022) applied CNNR and Raman spectroscopy to identify the amount of olive oil in a corn-olive oil blend, with R²_P = 0.9908 and RMSEP = 0.7183. In addition, one-dimension deep learning regression models based on spectral data are performed well in soluble solid content estimation in cherry tomato (Xiang et al., 2022) and oil content prediction of single maize kernel (Zhang et al., 2022c). Hence, the spectral analysis model developed by CNN can be expected to provide a simple, rapid, and accurate analysis of LNC in cotton leaves.

5 Conclusion

In this study, visible and near-infrared spectroscopy combined with deep learning was used to detect LNC in cotton leaves qualitatively and quantitatively. RF, WPLS, and saliency map were used to extract characteristic wavelengths, classification models (PLS-DA, SVC, CNNC) and regression models (PLSR, SVR, CNNR) were established based on full spectra and characteristic wavelengths, respectively. Overall, the models based on CNN architecture performed better than other models for both classification and regression tasks. For the classification task, CNNC model based on full spectra performed best, with the classification accuracy reaching 84.70%. For the regression task, the performance of CNNR model developed on full spectra was superior, achieving an R²_P of 0.77 and an RMSEP of 3.36. The good performance of visible and near-infrared spectroscopy assisted by deep learning demonstrated its effectiveness for nitrogen content prediction of cotton leaves. This approach is helpful for farmers to accurately identify the nutritional status of cotton plants in the field and make reasonable fertilization decisions in time.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

YH, LF, and NW, funding acquisition, conceptualization, and supervision. QX, WT, and NW, data curation and validation. CZ, LF, LZ, ZZ, and PG, formal analysis, and writing, project administration review and editing. CZ and QX, investigation, methodology, and software. JS, resources. QX, NW, and CZ, visualization and writing, original draft. All authors contributed to the article and approved the submitted version.

Funding

This research was supported by XPCC Science and Technology Projects of Key Areas (2020AB005) and National Natural Science Foundation of China (61965014), and the Shenzhen Science and Technology Projects (CJGJZD20210408092401004).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Du, L., Gong, W., Shi, S., Yang, P., Sun, J., Zhu, B., et al. (2016). Estimation of rice leaf nitrogen contents based on hyperspectral lidar. Int. J. Appl. Earth Observation Geoinformation 44, 136–143. doi: 10.1016/j.jag.2015.08.008

CrossRef Full Text | Google Scholar

Feng, L., Wu, B., He, Y., Zhang, C. (2021). Hyperspectral imaging combined with deep transfer learning for rice disease detection. Front. Plant Sci. 12. doi: 10.3389/fpls.2021.693521

CrossRef Full Text | Google Scholar

Gai, Z., Sun, L., Bai, H., Li, X., Wang, J., Bai, S. (2022). Convolutional neural network for apple bruise detection based on hyperspectral. Spectrochimica Acta Part a-Molecular Biomolecular Spectrosc. 279, 121432. doi: 10.1016/j.saa.2022.121432

CrossRef Full Text | Google Scholar

Gao, Z., Luo, N., Yang, B., Zhu, Y. (2022). Estimating leaf nitrogen content in wheat using multimodal features extracted from canopy spectra. Agronomy-Basel 12 (8), 1915. doi: 10.3390/agronomy12081915

CrossRef Full Text | Google Scholar

Gospodinova, G., Panayotova, G. (2019). Strategies for nitrogen fertilization of cotton (gossypium hirsutum l.). a review. Bulgarian J. Agric. Sci. 25, 59–67.

Google Scholar

Guo, Y., Chen, S., Li, X., Cunha, M., Jayavelu, S., Cammarano, D., et al. (2022). Machine learning-based approaches for predicting spad values of maize using multi-spectral images. Remote Sens. 14 (6), 1337. doi: 10.3390/rs14061337

CrossRef Full Text | Google Scholar

Kjeldahl, J. (1883). A new method for the estimation of nitrogen in organic compounds. Analytical Chem. 22, 366–382. doi: 10.1007/BF01338151

CrossRef Full Text | Google Scholar

Kergoat, L., Lafont, S., Arneth, A., Le Dantec, V., Saugier, B. (2008). Nitrogen controls plant canopy light-use efficiency in temperate and boreal ecosystems. J. Geophys. Res. Biogeosci. 113 (G4), G04017. doi: 10.1029/2007JG000676

CrossRef Full Text | Google Scholar

Li, X., Jiang, H., Jiang, X., Shi, M. (2021). Identification of geographical origin of chinese chestnuts using hyperspectral imaging with 1d-cnn algorithm. Agriculture-Basel 11 (12), 1274. doi: 10.3390/agriculture11121274

CrossRef Full Text | Google Scholar

Liu, S., Zhang, S., Zhang, L. (2010). Above-ground dry matter accumulation of cotton genetics at different nitrogen applications. Cotton Sci. 22 (1), 77–82.

Google Scholar

Luo, W., Tian, P., Fan, G., Dong, W., Zhang, H., Liu, X. J. I. P., et al. (2022). Non-destructive determination of four tea polyphenols in fresh tea using visible and near-infrared spectroscopy. Infrared Phys. Technol. 123, 104037. doi: 10.1016/j.infrared.2022.104037

CrossRef Full Text | Google Scholar

Ma, L., Chen, X., Zhang, Q., Lin, J., Yin, C., Ma, Y., et al. (2022). Estimation of nitrogen content based on the hyperspectral vegetation indexes of interannual and multi-temporal in cotton. Agronomy-Basel 12 (6), 1319. doi: 10.3390/agronomy12061319

CrossRef Full Text | Google Scholar

Malacara, D. (2011). Color vision and colorimetry: Theory and applications. 2nd ed (Bellingham, WA, USA: SPIE Press).

Google Scholar

Mehmood, T., Liland, K. H., Snipen, L., Saebo, S. (2012). A review of variable selection methods in partial least squares regression. Chemometrics Intelligent Lab. Syst. 118, 62–69. doi: 10.1016/j.chemolab.2012.07.010

CrossRef Full Text | Google Scholar

Mishra, P., Herrmann, I., Angileri, M. J. T. (2021). Improved prediction of potassium and nitrogen in dried bell pepper leaves with visible and near-infrared spectroscopy utilising wavelength selection techniques. Talanta 225, 121971. doi: 10.1016/j.talanta.2020.121971

PubMed Abstract | CrossRef Full Text | Google Scholar

Pourdarbani, R., Sabzi, S., Rohban, M. H., Garcia-Mateos, G., Molina-Martinez, J. M., Paliwal, J., et al. (2022). Metaheuristic algorithms in visible and near infrared spectra to detect excess nitrogen content in tomato plants. J. near Infrared Spectrosc. 30, 197–207. doi: 10.1177/09670335221098527

CrossRef Full Text | Google Scholar

Rotbart, N., Schmilovitch, Z., Cohen, Y., Alchanatis, V., Erel, R., Ignat, T., et al. (2013). Estimating olive leaf nitrogen concentration using visible and near-infrared spectral reflectance. Biosyst. Eng. 114, 426–434. doi: 10.1016/j.biosystemseng.2012.09.005

CrossRef Full Text | Google Scholar

Salzer, R. (2008). Practical guide to interpretive near-infrared spectroscopy Vol. 47 (Boca Raton, FL, USA: CRC Press).

Google Scholar

Steinbrener, J., Posch, K., Leitner, R. (2019). Hyperspectral fruit and vegetable classification using convolutional neural networks. Comput. Electron. Agric. 162, 364–372. doi: 10.1016/j.compag.2019.04.019

CrossRef Full Text | Google Scholar

Sun, H., Feng, M., Xiao, L., Yang, W., Ding, G., Wang, C., et al. (2021). Potential of multivariate statistical technique based on the effective spectra bands to estimate the plant water content of wheat under different irrigation regimes. Front. Plant Sci. 12. doi: 10.3389/fpls.2021.631573

CrossRef Full Text | Google Scholar

Sun, J., Jin, X., Mao, H., Wu, X., Tang, K., Zhang, X. (2013). Identification of lettuce leaf nitrogen level based on adaboost and hyperspectrum. Spectrosc. Spectral Anal. 33, 3372–3376. doi: 10.3964/j.issn.1000-0593(2013)12-3372-05

CrossRef Full Text | Google Scholar

Tahmasbian, I., Morgan, N. K., Bai, S. H., Dunlop, M. W., Moss, A. F. (2021). Comparison of hyperspectral imaging and near-infrared spectroscopy to determine nitrogen and carbon concentrations in wheat. Remote Sens. 13 (6), 1128. doi: 10.3390/rs13061128

CrossRef Full Text | Google Scholar

Tang, R., Luo, X., Li, C., Zhong, S. (2022). A study on nitrogen concentration detection model of rubber leaf based on spatial-spectral information with nir hyperspectral data. Infrared Phys. Technol. 122, 104094. doi: 10.1016/j.infrared.2022.104094

CrossRef Full Text | Google Scholar

Wang, Y., Hu, X., Hou, Z., Ning, J., Zhang, Z. (2018). Discrimination of nitrogen fertilizer levels of tea plant (camellia sinensis) based on hyperspectral imaging. J. Sci. Food Agric. 98, 4659–4664. doi: 10.1002/jsfa.8996

PubMed Abstract | CrossRef Full Text | Google Scholar

Wan, L., Zhou, W., He, Y., Wanger, T. C., Cen, H. (2022). Combining transfer learning and hyperspectral reflectance analysis to assess leaf nitrogen concentration across different plant species datasets. Remote Sens. Environ. 269, 112826. doi: 10.1016/j.rse.2021.112826

CrossRef Full Text | Google Scholar

Weng, S., Chu, Z., Wang, M., Han, K., Zhu, G., Liu, C., et al. (2022). Reflectance spectroscopy with operator difference for determination of behenic acid in edible vegetable oils by using convolutional neural network and polynomial correction. Food Chem. 367, 130668. doi: 10.1016/j.foodchem.2021.130668

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, X., Gao, S., Niu, Y., Zhao, Z., Ma, R., Xu, B., et al. (2022). Quantitative analysis of blended corn-olive oil based on raman spectroscopy and one-dimensional convolutional neural network. Food Chem. 385, 132655. doi: 10.1016/j.foodchem.2022.132655

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang, Y., Chen, Q., Su, Z., Zhang, L., Chen, Z., Zhou, G., et al. (2022). Deep learning and hyperspectral images based tomato soluble solids content and firmness estimation. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.860656

CrossRef Full Text | Google Scholar

Xiao, Q., Bai, X., Gao, P., He, Y. (2020). Application of convolutional neural network-based feature extraction and data fusion for geographical origin identification of radix astragali by visible/short-wave near-infrared and near infrared hyperspectral imaging. Sensors 20, 4940. doi: 10.3390/s20174940

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiao, Q., Tang, W., Zhang, C., Zhou, L., Feng, L., Shen, J., et al. (2022). Spectral preprocessing combined with deep transfer learning to evaluate chlorophyll content in cotton leaves. Plant Phenomics 2022, 9813841. doi: 10.34133/2022/9813841

PubMed Abstract | CrossRef Full Text | Google Scholar

Xia, J., Zhang, W., Zhang, W., Yang, Y., Hu, G., Ge, D., et al. (2021). A cloud computing-based approach using the visible near-infrared spectrum to classify greenhouse tomato plants under water stress. Comput. Electron. Agric. 181, 105966. doi: 10.1016/j.compag.2020.105966

CrossRef Full Text | Google Scholar

Yuan, R., Liu, G., He, J., Wan, G., Fan, N., Li, Y., et al. (2021). Classification of lingwu long jujube internal bruise over time based on visible near-infrared hyperspectral imaging combined with partial least squares-discriminant analysis. Comput. Electron. Agric. 182, 106043. doi: 10.1016/j.compag.2021.106043

CrossRef Full Text | Google Scholar

Yun, Y., Li, H., Wood, L. R. E., Fan, W., Wang, J., Cao, D., et al. (2013). An efficient method of wavelength interval selection based on random frog for multivariate spectral calibration. Spectrochimica Acta Part a-Molecular Biomolecular Spectrosc. 111, 31–36. doi: 10.1016/j.saa.2013.03.083

CrossRef Full Text | Google Scholar

Zhang, D., Chen, G., Yin, X., Hu, R., Gu, C., Pan, Z., et al. (2020c). Integrating spectral and image data to detect fusarium head blight of wheat. Comput. Electron. Agric. 175, 105588. doi: 10.1016/j.compag.2020.105588

CrossRef Full Text | Google Scholar

Zhang, J., Cheng, T., Shi, L., Wang, W., Niu, Z., Guo, W., et al. (2022b). Combining spectral and texture features of uav hyperspectral images for leaf nitrogen content monitoring in winter wheat. Int. J. Remote Sens. 43, 2335–2356. doi: 10.1080/01431161.2021.2019847

CrossRef Full Text | Google Scholar

Zhang, J., Liu, Y., He, Y., Hu, G., Bai, N. (2020a). Characterization of deep green infection in tobacco leaves using a hand-held digital light projection based near-infrared spectrometer and an extreme learning machine algorithm. Analytical Lett. 53 (14), 2266–2277. doi: 10.1080/00032719.2020.1738452

CrossRef Full Text | Google Scholar

Zhang, J., Yang, Y., Feng, X., Xu, H., Chen, J., He, Y. (2020b). Identification of bacterial blight resistant rice seeds using terahertz imaging and hyperspectral imaging combined with convolutional neural network. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.00821

CrossRef Full Text | Google Scholar

Zhang, L., Wang, Y., Wei, Y., An, D. (2022c). Near-infrared hyperspectral imaging technology combined with deep convolutional generative adversarial network to predict oil content of single maize kernel. Food Chem. 370, 131047. doi: 10.1016/j.foodchem.2021.131047

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Q., Ma, L., Chen, X., Lin, J., Yin, C., Yao, Q., et al. (2022). Estimation of nitrogen in cotton leaves using different hyperspectral region data. Notulae Botanicae Horti Agrobotanici Cluj-Napoca 50 (1), 12595. doi: 10.15835/nbha50112595

CrossRef Full Text | Google Scholar

Zhou, L., Zhang, C., Liu, F., Qiu, Z., He, Y. (2019). Application of deep learning in food: A review. Compr. Rev. Food Sci. Food Saf. 18, 1793–1811. doi: 10.1111/1541-4337.12492

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cotton, leaf nitrogen content, spectra, deep learning, visible and near-infrared spectroscopy (Vis-NIR)

Citation: Xiao Q, Wu N, Tang W, Zhang C, Feng L, Zhou L, Shen J, Zhang Z, Gao P and He Y (2022) Visible and near-infrared spectroscopy and deep learning application for the qualitative and quantitative investigation of nitrogen status in cotton leaves. Front. Plant Sci. 13:1080745. doi: 10.3389/fpls.2022.1080745

Received: 26 October 2022; Accepted: 24 November 2022;
Published: 20 December 2022.

Edited by:

Baohua Zhang, Nanjing Agricultural University, China

Reviewed by:

Zichen Huang, Kyoto University, Japan
Jiangbo Li, Beijing Academy of Agriculture and Forestry Sciences, China
Cedric Okinda, Masinde Muliro University of Science and Technology, Kenya

Copyright © 2022 Xiao, Wu, Tang, Zhang, Feng, Zhou, Shen, Zhang, Gao and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yong He, yhe@zju.edu.cn

^†These two authors contribute equally to this manuscript

^‡ORCID: Na Wu, orcid.org/0000-0002-2654-6605
Yong He, cnorcid.org/0000-0001-6752-1757

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.