Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli - Archive ouverte HAL Access content directly
Journal Articles Nature Biotechnology Year : 2018

Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli

(1, 2) , (2, 3) ,
1
2
3

Abstract

Comparative analyses of natural and mutated sequences have been used to probe mechanisms of gene expression, but small sample sizes may produce biased outcomes. We applied an unbiased design-of-experiments approach to disentangle factors suspected to affect translation efficiency in E. coli. We precisely designed 244,000 DNA sequences implementing 56 replicates of a full factorial design to evaluate nucleotide, secondary structure, codon and amino acid properties in combination. For each sequence, we measured reporter transcript abundance and decay, polysome profiles, protein production and growth rates. Associations between designed sequences properties and these consequent phenotypes were dominated by secondary structures and their interactions within transcripts. We confirmed that transcript structure generally limits translation initiation and demonstrated its physiological cost using an epigenetic assay. Codon composition has a sizable impact on translatability, but only in comparatively rare elongation-limited transcripts. We propose a set of design principles to improve translation efficiency that would benefit from more accurate prediction of secondary structures in vivo.

Dates and versions

hal-01925637 , version 1 (16-11-2018)

Identifiers

Cite

Guillaume Cambray, Joao Guimarães, Adam P. Arkin. Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli. Nature Biotechnology, 2018, 36 (10), 20 p. ⟨10.1038/nbt.4238⟩. ⟨hal-01925637⟩
70 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More