Characterization of 43 Non-Protein-Coding mRNA Genes in Arabidopsis, Including the MIR162a -Derived Transcripts
Résumé
Abstract Messenger RNAs that do not contain a long open reading frame (ORF) or non-protein-coding RNAs (npcRNAs) are an emerging novel class of transcripts. Their functions may involve the RNA molecule itself and/or short ORF-encoded peptides. npcRNA genes are difficult to identify using standard gene prediction programs that rely on the presence of relatively long ORFs. Here, we used detailed bioinformatic analyses of expressed sequence tag/cDNA databases to detect a restricted set of npcRNAs in the Arabidopsis (Arabidopsis thaliana) genome and further characterized these transcripts using a combination of bioinformatic and molecular approaches. Compositional analyses revealed strong nucleotide strand asymmetries in the npcRNAs, as well as a biased GC content, suggesting the existence of functional constraints on these RNAs. Thirteen of these transcripts display tissue-specific expression patterns, and three are regulated in conditions affecting root architecture. The npcRNA 78 gene contains the miR162 sequence in an alternative intron and corresponds to the MIR162a locus. Although DICER-LIKE 1 (DCL1) mRNA is known to be regulated by miR162-guided cleavage, its level does not change in a mir162a mutant. Alternative splicing of npcRNA 78 leads to several transcript isoforms, which all accumulate in a dcl1 mutant. This suggests that npcRNA 78 is a genuine substrate of DCL1 and that splicing of this microRNA primary transcript and miR162 processing are competitive nuclear events. Our results provide new insights into Arabidopsis npcRNA biology and the potential roles of these genes.