C. Blaschke, M. A. Andrade, C. Ouzounis, and A. Valencia, Automatic Extraction of biological information from scientific text: protein-protein interactions, Proc. of ISMB'99, 1999.

N. Collier, C. Nobata, and . Tsujii, Extracting the names of genes and gene products with a hidden Markov model, Proc. COLING'2000, Saarbrück, 2000.

M. Craven and J. Kumlien, Constructing Biological Knowledge Bases by Extracting Information from Text Sources, Proc. of ISMB'99, 1999.

P. Domingos and M. Pazzani, Beyond independence: conditions for the optimality of the simple Bayesian classifier, Proc. of ICML'96, pp.105-112, 1996.

K. Fukuda, T. Tsunoda, A. Tamura, and T. Takagi, Toward Information Extraction: Identifying protein names from biological papers, Proc. PSB'98, 1998.

K. Humphreys, G. Demetriou, and G. R. , Two applications of information extraction to biological science article: enzyme interaction and protein structure, Proc. of PSB'2000, vol.5, pp.502-513, 2000.

G. John and R. Kohavi, Wrappers for feature subset selection, Artificial Intelligence Journal, 1997.

P. Langley and S. Sage, Induction of selective Bayesian classifiers, Proc. of UAI'94, pp.399-406, 1994.

T. M. Mitchell, Machine Learning, 1997.
URL : https://hal.archives-ouvertes.fr/hal-02564603

, Proceedings of the Message Understanding Conference, pp.1992-98

T. Ono, H. Hishigaki, A. Tanigami, and T. Takagi, Automated extraction of information on protein-protein interactions from the biological literature, In Bioinformatics, vol.17, pp.155-161, 2001.

V. Pillet, Méthodologie d'extraction automatique d'information à partir de la littérature scientifique en vue d'alimenter un nouveau système d'information, thèse de l'Université de droit, d'économie et des sciences d, 2000.

D. Proux, F. Rechenmann, L. Julliard, V. Pillet, and B. Jacq, Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction, Genome Informatics, pp.72-80, 1998.

J. R. Quinlan and . C4, Programs for Machine Learning, vol.5, 1992.

E. Riloff, Automatically constructing a Dictionary for Information Extraction Tasks, Proc. of AAAI-93, pp.811-816, 1993.

S. Soderland, Learning Information Extraction Rules for Semi-Structured and Free Text, Machine Learning Journal, vol.34, 1999.

B. J. Stapley and G. Benoit, Bibliometrics: Information Retrieval and Visualization from co-occurrence of gene names in MedLine abstracts, Proc. of PSB'2000, 2000.

J. Thomas, D. Milward, C. Ouzounis, S. Pulman, and M. Caroll, Automatic Extraction of Protein Interactions from Scientific Abstracts, Proc. of PSB'2000, vol.5, pp.502-513, 2000.

Y. Yang and J. Pedersen, A comparative study on feature selection in text categorization, Proc. of ICML'97, 1997.