Smiles2Monomers: a link between chemical and biological structures for polymers - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Access content directly
Journal Articles Journal of Cheminformatics Year : 2015

Smiles2Monomers: a link between chemical and biological structures for polymers


Background: The monomeric composition of polymers is powerful for structure comparison and synthetic biology , among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. Results: Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. Conclusion: s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at
Fichier principal
Vignette du fichier
s13321-015-0111-5.pdf (1.4 Mo) Télécharger le fichier
Origin : Publisher files allowed on an open archive

Dates and versions

hal-01250619 , version 1 (05-01-2016)



Yoann Dufresne, Laurent Noé, Valérie Leclère, Maude Pupin. Smiles2Monomers: a link between chemical and biological structures for polymers. Journal of Cheminformatics, 2015, 7, ⟨10.1186/s13321-015-0111-5⟩. ⟨hal-01250619⟩
729 View
142 Download



Gmail Facebook X LinkedIn More