Comparison between ParSketch-PLSDA and PLSDA in a context of large amounts of spectral data for sunflower genotype discrimination

Maxime Ryckewaert; Maxime Metz; Belal Gaci; Aldrig Courand; Daphné Heran; Ryad Bendoula

Communication Dans Un Congrès Année : 2021

Comparison between ParSketch-PLSDA and PLSDA in a context of large amounts of spectral data for sunflower genotype discrimination

(1) , , , , (1) , (1)

Maxime Ryckewaert

Fonction : Auteur
PersonId : 736123
IdHAL : maxime-ryckewaert
ORCID : 0000-0002-9494-797X
IdRef : 253121094

Technologies et Méthodes pour les Agricultures de demain

Maxime Metz

Fonction : Auteur
PersonId : 1243639
IdRef : 268867356

Belal Gaci

Fonction : Auteur
PersonId : 1140382
ORCID : 0000-0003-4986-7038

Aldrig Courand

Fonction : Auteur

Daphné Heran

Fonction : Auteur
PersonId : 743541
IdHAL : daphne-heran
ORCID : 0000-0002-1014-7976

Technologies et Méthodes pour les Agricultures de demain

Ryad Bendoula

Fonction : Auteur
PersonId : 735869
IdHAL : ryad-bendoula
ORCID : 0000-0002-2794-1252
IdRef : 098465252

Technologies et Méthodes pour les Agricultures de demain

Résumé

In recent years, high-throughput plant phenotyping (HTPP) platforms in the laboratory or directly in the field have multiplied. The use of optical instruments such as NIR Spectroscopy or hyperspectral imaging tends to increase offering the possibility of generating large quantities of data in an automatic way. This represents a potential application for plant breeding. However, the large amount of data often called massive data leads to difficulties in managing and analyzing them. Processing this massive amount of spectral data is challenging. Partial-Least-Squares (PLS) is the most widely used method for predicting biochemical variables based on a linear relationship with spectra. When dealing with large amounts of spectral data, complex structures and non-linear relationships appear. This can compromise linear regression approaches. Recently, a method called 'parSketch-PLS' has recently been proposed (Metz et al. 2020)to implement a local approach to a large volume of spectral data. This method combines a fast neighborhood search method (parSketch) with PLS. This method is valid to predict categorical variables by adding a Discrimination-Analysis (DA) step like PLS with PLS-DA. In this presentation, we propose to compare parSketch-PLS-DA with the reference method PLS-DA in a context of varietal discrimination. For this purpose, a spectral database was formed by collecting 1,300,000 spectra from hyperspectral images of leaves of four different sunflower genotypes. Results show that the prediction model obtained by PLSDA has a classification error close to 23% on average across all genotypes. ParSketch-PLSDA method outperforms PLS-DA by greatly improving prediction qualities by 10%. These results are encouraging and allow us to anticipate the future bottleneck related to the generation of a large amount of data from phenotyping generating complex structures and non-linear relationships.

Mots clés

Massive data Spectroscopy Digital Agriculture Plant breeding Chemometrics

Domaines

Statistiques [math.ST] Amélioration des plantes

Maxime Ryckewaert : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-03783378

Soumis le : jeudi 22 septembre 2022-10:44:24

Dernière modification le : jeudi 14 mars 2024-03:15:27

Dates et versions

hal-03783378 , version 1 (22-09-2022)

Identifiants

HAL Id : hal-03783378 , version 1

Citer

Maxime Ryckewaert, Maxime Metz, Belal Gaci, Aldrig Courand, Daphné Heran, et al.. Comparison between ParSketch-PLSDA and PLSDA in a context of large amounts of spectral data for sunflower genotype discrimination. NIR20021 - 20th International Conference on Near Infrared Spectroscopy, Oct 2021, Pékin, China. ⟨hal-03783378⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ITAP INSTITUT-AGRO-MONTPELLIER INRAE INRAEOCCITANIEMONTPELLIER MATHNUM RESEAU-EAU

13 Consultations

0 Téléchargements

Comparison between ParSketch-PLSDA and PLSDA in a context of large amounts of spectral data for sunflower genotype discrimination

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager