The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species - Archive ouverte HAL Access content directly
Journal Articles Scientific Reports Year : 2022

The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species

(1) , (1)
1

Abstract

Identification of genetic variations is a central part of population and quantitative genomics studies based on high-throughput sequencing data. Even though popular variant callers such as Bcftools mpileup and GATK HaplotypeCaller were developed nearly 10 years ago, their performance is still largely unknown for non-human species. Here, we showed by benchmark analyses with a simulated insect population that Bcftools mpileup performs better than GATK HaplotypeCaller in terms of recovery rate and accuracy regardless of mapping software. The vast majority of false positives were observed from repeats, especially for GATK HaplotypeCaller. Variant scores calculated by GATK did not clearly distinguish true positives from false positives in the vast majority of cases, implying that hard-filtering with GATK could be challenging. These results suggest that Bcftools mpileup may be the first choice for non-human studies and that variants within repeats might have to be excluded for downstream analyses.
Fichier principal
Vignette du fichier
2022_Lefouili_Scientific-Reports.pdf (1.35 Mo) Télécharger le fichier
Origin : Publisher files allowed on an open archive

Dates and versions

hal-03737834 , version 1 (25-07-2022)

Licence

Attribution - CC BY 4.0

Identifiers

Cite

Messaoud Lefouili, Kiwong Nam. The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species. Scientific Reports, 2022, 12 (1), ⟨10.1038/s41598-022-15563-2⟩. ⟨hal-03737834⟩
15 View
6 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More