The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Scientific Reports Année : 2022

The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species

Résumé

Identification of genetic variations is a central part of population and quantitative genomics studies based on high-throughput sequencing data. Even though popular variant callers such as Bcftools mpileup and GATK HaplotypeCaller were developed nearly 10 years ago, their performance is still largely unknown for non-human species. Here, we showed by benchmark analyses with a simulated insect population that Bcftools mpileup performs better than GATK HaplotypeCaller in terms of recovery rate and accuracy regardless of mapping software. The vast majority of false positives were observed from repeats, especially for GATK HaplotypeCaller. Variant scores calculated by GATK did not clearly distinguish true positives from false positives in the vast majority of cases, implying that hard-filtering with GATK could be challenging. These results suggest that Bcftools mpileup may be the first choice for non-human studies and that variants within repeats might have to be excluded for downstream analyses.
Fichier principal
Vignette du fichier
2022_Lefouili_Scientific-Reports.pdf (1.35 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03737834 , version 1 (25-07-2022)

Licence

Paternité

Identifiants

Citer

Messaoud Lefouili, Kiwong Nam. The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species. Scientific Reports, 2022, 12 (1), ⟨10.1038/s41598-022-15563-2⟩. ⟨hal-03737834⟩
58 Consultations
78 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More