Watch out for a second SNP: focus on multi-nucleotide variants in coding regions and rescued stop-gained - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Access content directly
Journal Articles Frontiers in Genetics Year : 2021

Watch out for a second SNP: focus on multi-nucleotide variants in coding regions and rescued stop-gained

Abstract

Most single-nucleotide polymorphisms (SNPs) are located in non-coding regions, but the fraction usually studied is harbored in protein-coding regions because potential impacts on proteins are relatively easy to predict by popular tools such as the Variant Effect Predictor. These tools annotate variants independently without considering the potential effect of grouped or haplotypic variations, often called “multi-nucleotide variants” (MNVs). Here, we used a large RNA-seq dataset to survey MNVs, comprising 382 chicken samples originating from 11 populations analyzed in the companion paper in which 9.5M SNPs— including 3.3M SNPs with reliable genotypes—were detected. We focused our study on in-codon MNVs and evaluate their potential mis-annotation. Using GATK HaplotypeCaller read-based phasing results, we identified 2,965 MNVs observed in at least five individuals located in 1,792 genes. We found 41.1% of them showing a novel impact when compared to the effect of their constituent SNPs analyzed separately. The biggest impact variation flux concerns the originally annotated stopgained consequences, for which around 95% were rescued; this flux is followed by the missense consequences for which 37% were reannotated with a different amino acid. We then present in more depth the rescued stop-gained MNVs and give an illustration in the SLC27A4 gene. As previously shown in human datasets, our results in chicken demonstrate the value of haplotype-aware variant annotation, and the interest to consider MNVs in the coding region, particularly when searching for severe functional consequence such as stop-gained variants.
Fichier principal
Vignette du fichier
fgene-12-659287.pdf (1.83 Mo) Télécharger le fichier
Origin : Publisher files allowed on an open archive
Licence : CC BY - Attribution

Dates and versions

hal-03280040 , version 1 (25-07-2023)

Licence

Attribution

Identifiers

Cite

Fabien Degalez, Frédéric Jehl, Kévin Muret, Maria Bernard, Frédéric Lecerf, et al.. Watch out for a second SNP: focus on multi-nucleotide variants in coding regions and rescued stop-gained. Frontiers in Genetics, 2021, 12, pp.659287. ⟨10.3389/fgene.2021.659287⟩. ⟨hal-03280040⟩
132 View
9 Download

Altmetric

Share

Gmail Facebook X LinkedIn More