B. Lamiroy, D. Lopresti, H. Korth, and J. Heflin, How Carefully Designed Open Resource Sharing Can Help and Expand Document Analysis Research, Document Recognition and Retrieval XVIII -DRR 2011, vol.7874, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00537035

B. Lamiroy and D. Lopresti, An Open Architecture for End-to-End Document Analysis Benchmarking, 11th International Conference on Document Analysis and Recognition -ICDAR 2011, pp.42-47, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00598907

B. Lamiroy and D. Lopresti, A Platform for Storing, Visualizing, and Interpreting Collections of Noisy Documents, Fourth Workshop on Analytics for Noisy Unstructured Text Data -AND'10. ACM International Conference Proceeding Series, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00516678

G. Salton and C. Yang, On the specification of term values in automatic indexing, 1973.

A. Sieg, B. Mobasher, S. Lytinen, and R. Burke, Using concept hierarchies to enhance user queries in web-based information retrieval, Proceedings of the International Conference on Artificial Intelligence and Applications, 2004.

A. Pretschner and S. Gauch, Ontology based personalized search, ICTAI '99 : Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence, p.391, 1999.

X. Li, Y. Wang, and A. Acero, Learning query intent from regularized click graphs, SIGIR '08 : Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pp.339-346, 2008.

J. Arguello, F. Diaz, J. Callan, and J. Crespo, Sources of evidence for vertical selection, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, SIGIR '09, pp.315-322, 2009.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

A. Sutcliffe and M. Ennis, Towards a cognitive theory of information retrieval, HCI and Information Retrieval, vol.10, issue.3, pp.321-351, 1998.

P. N. Bennett, R. W. White, W. Chu, S. T. Dumais, P. Bailey et al., Modeling the impact of short-and long-term behavior on search personalization, Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '12, pp.185-194, 2012.

D. J. Liebling, P. N. Bennett, and R. W. White, Anticipatory search : using context to initiate search, SIGIR, pp.1035-1036, 2012.

D. Milne and I. H. Witte, Learning to link with wikipedia, Proceeding of the 17th ACM conference on Information and knowledge management, CIKM, 2008.

P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, Proceedings of the 14th international joint conference on Artificial intelligence, vol.1, p.29, 1995.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

P. Wang and C. Domeniconi, Building semantic kernels for text classification using wikipedia, 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.713-721, 2008.

M. Mohler and R. Mihalcea, Text-to-text semantic similarity for automatic short answer grading, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, pp.567-575, 2009.

S. Albitar, S. Fournier, and B. Espinasse, Conceptualization Effects on MEDLINE Documents Classification Using Rocchio Method, Web Intelligence, pp.462-466, 2012.

A. Huang, Similarity measures for text document clustering, presented at the Sixth New Zealand Computer Science Research Student Conference, 2008.

W. Hersh, C. Buckley, T. J. Leone, and D. Hickam, OHSUMED: an interactive retrieval evaluation and new large test collection for research, 17th annual international ACM SIGIR conference on Research and development in information retrieval, pp.192-201, 1994.

. Umls®, Unified Medical Language System, 2013.

A. R. Aronson and F. M. Lang, An overview of MetaMap: historical perspective and recent advances, J Am Med Inform Assoc, vol.17, pp.229-236, 2010.

T. Pedersen, S. V. Pakhomov, S. Patwardhan, and C. G. Chute, Measures of semantic similarity and relatedness in the biomedical domain, J. of Biomedical Informatics, vol.40, pp.288-299, 2007.

J. E. Caviedes and J. J. Cimino, Towards the development of a conceptual distance metric for the UMLS, J. of Biomedical Informatics, vol.37, pp.77-85, 2004.

Z. Wu and M. Palmer, Verbs semantics and lexical selection, presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics, 1994.

C. Leacock and M. Chodorow, Combining Local Context and WordNet Similarity for Word Sense Identification, WordNet: An Electronic Lexical Database (Language, pp.265-283, 1998.

J. Zhong, H. Zhu, J. Li, and Y. Yu, Conceptual Graph Matching for Semantic Search, presented at the Proceedings of the 10th International Conference on Conceptual Structures: Integration and Interfaces, 2002.

H. Al-mubaid and H. A. Nguyen, A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain, Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE, pp.2713-2717, 2006.

T. Gruber, Toward principles for the design of ontologies used for knowledge sharing, International Journal of Human-Computer Studies, vol.43, pp.907-928, 1993.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

D. Hérin, M. Sala, and P. Pompidor, Evaluating and Revising Courses from Learning Web Resources, 2002.

P. Pompidor, M. Sala, and D. Hérin, An incremental method for extraction of pedagogical knowledges on the web, SW-WL'2003 & EIAH'2003, 2003.

M. Sala, P. Pompidor, and D. Hérin, Aid to the Semantic Maintenance of the Web Site, IADIS WWW/Internet'03, 2003.
URL : https://hal.archives-ouvertes.fr/lirmm-00269659

R. Nkambou, C. Frasson, and G. Gauthier, CREAM-Tools: An Authoring Environment for Knowledge Engineering in Intelligent Tutoring Systems, Authoring Tools for Advanced Technology Learning Environments: Toward coste effective, adaptative, interactive, and, pp.93-138, 2003.

N. Hernandez and J. Mothe, TtoO: une méthodologie de construction d'ontologie de domaine à partir d'un thésaurus et d'un corpus de référence, 2006.

P. Cimiano and J. Völker, Text2Onto -A Framework for Ontology Learning and Datadriven Change Discovery, Éd.) Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems (NLDB), pp.227-238, 2005.

A. Maedche and S. Staab, Ontology Learning for the Semantic Web, IEEE Intelligent Systems, Special, 2001.

H. Roitman and A. Gal, OntoBuilder: fully automatic extraction and consolidation of ontologies from web sources using sequence semantics, EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology, pp.573-576, 2006.

M. Ferdinand, C. Zirpins, and D. Trastour, Lifting XML Schema to OWL, Web Engineering Lecture Notes in Computer Science, vol.3140, pp.354-358, 2004.

H. Bohring and S. Auer, Mapping XML to OWL Ontologies, Leipziger Informatik-Tage, vol.72, pp.147-156, 2005.

R. Ghawi and N. Cullot, Building Ontologies from XML Data Sources. DEXA '09. 20th International Workshop on Database and Expert Systems Application, pp.480-484, 2009.

A. Gras-velazquez, Teachers and content packaging standards. Initial conclusions from the ASPECT evaluation, Récupéré sur Adopting Standards and Specifications for Educational Content

R. Gómez-de-regil, Retour d'expérience sur le pilote ASPECT, 2011.

C. Desmoulins, Construction avec des enseignants d'une ontologie des compétences en géométrie, 2010.

I. Bedini and B. Nguyen, Automatic Ontology Generation: State of the Art. University of Versailles Technical report, 2007.

B. S. Bloom, M. D. Engelhart, E. J. Furst, W. H. Hill, and D. R. Krathwohl, Taxonomy of educational objectives: Handbook I: Cognitive domain, 1956.

H. P. Luhn, The automatic creation of literature abstracts, IBM Journal on Research and Development, vol.2, issue.2, 1958.

. Spärck and K. Jones, A statistical interpretation of term specificity and its application in retrieval, Journal of Documentation, p.28, 1972.

H. Schmid, Probabilistic part-of-speech tagging using decision trees, International Conference on New Methods in Language Processing, 1994.

P. N. Mendes, M. Jakob, A. García-silva, C. D. Bizer, and D. Spotlight, Shedding Light on the Web of Documents, the Proceedings of the 7th International Conference on Semantic Systems (I-Semantics, 2011.

F. Breitling, A standard tranformation from XML to RDF via XSLT, Astronomical Notes, pp.755-760, 2009.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas et al., Dbpedia-a large-scale, multilingual knowledge base extracted from wikipedia, Semantic Web Journal, 2013.

D. Torres, P. Molli, H. Skaf-molli, and A. Diaz, Improving wikipedia with DBpedia, Proceedings of the 21st international conference companion on World Wide Web, pp.1107-1112, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00688145

O. Collin, B. Gaillard, and J. Bouraoui, Constitution d'une ressource sémantique issue du treillis des catégories de Wikipedia. TALN 2010-Session Posters, 2010.

I. Fernández-tobías, M. Kaminskas, I. Cantador, and F. Ricci, A generic semantic-based framework for cross-domain recommendation, HetRec '11 Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems, pp.25-32, 2011.

F. Scharffe, G. Atemezing, R. Troncy, F. Gandon, S. Villata et al., Enabling linked-data publication with the datalift platform, Proc. AAAI workshop on semantic cities, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00768424

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

R. Mihalcea, C. Corley, and C. Strapparava, Corpus-based and knowledge-based measures of text semantic similarity, Proceedings of the 21st national conference on Artificial intelligence, vol.1, pp.775-780, 2006.

E. Gabrilovich and S. Markovitch, Computing semantic relatedness using wikipediabased explicit semantic analysis, Proceedings of the 20th international joint conference on Artifical intelligence. IJCAI'07, pp.1606-1611, 2007.

D. Bär, C. Biemann, I. Gurevych, and T. Zesch, Ukp: Computing semantic textual similarity by combining multiple content similarity measures, Proceedings of the 6th International Workshop on Semantic Evaluation, pp.435-440, 2012.

D. Buscaldi, J. Le-roux, J. J. Garcia-flores, and A. Popescu, Lipn-core: Semantic text similarity using n-grams, wordnet, syntactic analysis, esa and information retrieval based features, Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, vol.1, pp.162-168, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00825054

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.

, Extended Dirichlet Smoothing The LM approach in IR is proposed by Ponte and Croft [15]. The basic idea of LM is to assume that a query q, p.64

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, Mustapha Baziz, Mohand Boughanem, and Nathalie Aussenac-Gilles. Conceptual indexing based on document content representation. CoLIS'05, 2005.

M. Bendersky and W. Croft, Discovering key concepts in verbose queries, SIGIR '08, pp.491-498, 2008.

A. Berger and J. Lafferty, Information retrieval as statistical translation, SIGIR '99, pp.222-229, 1999.

J. Chevallet, X-iota: An open xml framework for ir experimentation, Lecture Notes in Computer Science, vol.3411, pp.263-280, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00953922

J. Chevallet, J. Lim, and D. Le, Domain knowledge conceptual inter-media indexing: Application to multilingual multimedia medical reports, CIKM '07, pp.495-504, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00953881

F. Crestani, Exploiting the similarity of non-matching terms at retrieval time, vol.2, pp.25-45, 2000.

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, Indexing by latent semantic analysis, JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, vol.41, issue.6, pp.391-407, 1990.

Y. Jing and W. Croft, An association thesaurus for information retrieval, pp.146-160, 1994.

M. Karimzadehgan and C. Zhai, Estimation of statistical translation models based on mutual information for ad hoc information retrieval, pp.323-330, 2010.

R. Krovetz, Viewing morphology as an inference process, pp.191-202, 1993.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, SIGIR '01, pp.120-127, 2001.

J. Lin and D. Demner-fushman, The role of knowledge in conceptual retrieval: a study in the domain of clinical medicine. SIGIR '06, pp.99-106, 2006.

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, 2008.

F. Peng, N. Ahmed, X. Li, and Y. Lu, Context sensitive stemming for web search, SIGIR '07, pp.639-646, 2007.

M. Jay, W. B. Ponte, and . Croft, A language modeling approach to information retrieval, SIGIR '98, pp.275-281, 1998.

M. F. Porter, Readings in information retrieval. chapter An algorithm for suffix stripping, pp.313-316, 1997.

, The SMART Retrieval System -Experiments in Automatic Document Processing, 1971.

D. Widdows, Geometry and Meaning. Center for the Study of Language and Inf, 2004.

C. Zhai, Statistical Language Models for Information Retrieval, 2008.

C. Zhai and J. Lafferty, A study of smoothing methods for language models applied to information retrieval, vol.22, pp.179-214, 2004.

C. Zhai and J. Lafferty, A study of smoothing methods for language models applied to information retrieval, vol.22, pp.179-214, 2004.

, Author-produced version of the paper presented at 6ème atelier Recherche d'Information SEmantique RISE, SDNRI, 2014.