. Genomes-on-line, Consortium TGO: Gene ontology: tool for the unification of biology, Nat Genet, vol.2, pp.25-34, 2000.

W. Kreitschmann, W. Fleischmann, and R. Apweiler, Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT, Bioinformatics, vol.17, pp.920-926, 2001.

A. Vinayagam, C. Val, F. Schubert, R. Eils, K. Glatting et al., GOPET: a tool for automated predictions of Gene Ontology terms, BMC Bioinformatics, vol.7, p.161, 2006.

R. Quinlan, C4.5: Programs for Machine Learning Morgan Kaufmann, 1993.

N. Cristianini and J. Shawe-taylor, AN INTRODUCTION TO SUPPORT VEC-TOR MACHINES and other kernel-based learning methods Cambridge University Press, 2000.

O. Troyanskaya, K. Dolinski, A. Owen, R. Altman, and D. Botstein, A Bayesian framework for combining heterogeneous data sources for gene function prediction

, Proc Natl Acad Sci, vol.100, issue.14, pp.8348-53, 2003.

Z. Barutcuoglu, R. Schapire, and O. Troyanskaya, Hierarchical multilabel prediction of gene function, Bioinformatics, vol.22, pp.830-836, 2006.

E. Levy, C. Ouzounis, W. Gilks, and B. Audit, Probabilistic annotation of protein sequences based on functional classifications, BMC Bioinformatics, vol.6, p.302, 2005.
URL : https://hal.archives-ouvertes.fr/ensl-00175314

, RAFALE: french national project RAFALE

A. Gattiker, K. Michoud, C. Rivoire, A. H. Auchincloss, E. Coudert et al., Automated annotation of microbial proteomes in SWISS-PROT, Computational Biology and Chemistry, vol.27, pp.49-58, 2003.

A. Clare and R. King, Machine learning of functional class from phenotype data, Bioinformatics, vol.18, pp.160-166, 2002.

H. Blockeel and L. D. Raedt, Top-Down Induction of First-Order Logical Decision Trees, Artificial Intelligence, vol.101, pp.1-2285, 1998.

H. Blockeel, L. Schietgat, J. Struyf, S. Dzeroski, and C. A. , Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics. Principles and Practice of Knowledge Discovery in Databases (PKDD'06), pp.18-29, 2006.

K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou et al., AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system, Nucleic Acids Res, vol.34, issue.12, pp.3533-3578, 2006.
URL : https://hal.archives-ouvertes.fr/hal-02665192

S. Chaillou, M. C. Champomier-vergès, M. Cornet, A. Coq, A. M. Dudez et al., The complete genome sequence of the meatborne lactic acid bacterium Lactobacillus sakei 23 k, Nature Biotechnology, vol.23, pp.1527-1560, 2005.
URL : https://hal.archives-ouvertes.fr/hal-02683108

. Guchte-m-van-de, S. Penaud, C. Grimaldi, V. Barbe, K. Bryson et al., The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution, Proc Natl Acad Sci, vol.103, pp.9274-9279, 2006.

I. Moszer, L. Jones, S. Moreira, C. Fabry, and A. Danchin, Subtilist: the reference database for the Bacillus subtilis genome, Nucleic Acids Res, vol.30, pp.62-67, 2002.

S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang et al., Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res, vol.25, issue.17, pp.3389-402, 1997.

A. Bairoch, R. Apweiler, C. H. Wu, W. C. Barker, B. Boeckmann et al., The Universal Protein Resource (UniProt), pp.154-159, 2005.

K. Eilbeck, S. E. Lewis, C. J. Mungall, M. Yandell, L. Stein et al., The Sequence Ontology: a tool for the unification of genome annotations, Genome Biology, vol.6, issue.5, p.44, 2005.

A. Bairoch and R. Apweiler, The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000, Nucleic Acids Res, vol.28, pp.45-53, 2000.

E. M. Zdobnov and R. Apweiler, InterProScan-an integration platform for the signature-recognition methods in InterPro, Bioinformatics, vol.17, issue.9, pp.847-855, 2001.

:. Geneontology and . Ontology, , 2007.

A. Clare, Machine learning and data mining for yeast functional genomics, PhD thesis, 2003.

S. Kiritchenko, S. Matwin, R. Nock, and A. F. Famili, Learning and Evaluation in the Presence of Class Hierarchies: Application to

, Text Categorization, Canadian Conference on Artificial Intelligence, pp.395-406, 2006.

I. Tetko, I. Rodchenkov, M. Walter, T. Rattei, and H. Mewes, Beyond the best match: machine learning annotation of protein sequences by integration of different sources of information, Bioinformatics, vol.24, pp.621-629, 2008.