A. Doucet and H. Ahonen-myka, « Naive Clustering of a Large XML Document Collection, pp.81-87, 2002.

L. Denoyer and P. Gallinari, Report on the XML mining track at INEX 2007 categorization and clustering of XML documents, ACM SIGIR Forum, vol.42, issue.1, pp.79-91, 2008.
DOI : 10.1145/1394251.1394255
URL : https://hal.archives-ouvertes.fr/hal-01172481

S. Ghosh and P. Mitra, « Combining Content and Structure Similarity for XML Document, pp.1-4, 2008.

T. Joachims, Making large-Scale SVM Learning Practical Advances in Kernel Methods, pp.169-184, 1999.

G. Salton, «Search and retrieval experiments in real-time information retrieval», pp.1082-1093, 1968.

K. «. Sauvagnat, Modèle flexible pour la recherche d'information dans des corpus de documents semi-structurés, Thèse de doctorat, 2005.

A. Vercoustre, M. Fegas, Y. Lechevallier, and T. Despeyroux, Classification de documents XML à partir d'une représentation linéaire des arbres de ces documents, EGC, pp.443-457, 2006.

G. Wisniewski, L. Denoyer, and P. Gallinari, « Classification automatique de documents structurés Application au corpus d'arbres étiquetés de type XML, pp.52-66, 2005.

J. Wu and J. Tang, « A bottom-up approach for XML documents, ACM International Conference Proceeding Series, pp.131-137, 2008.

J. Yang and S. Wang, « Extended VSM for XML Document Classification Using Frequent Subtrees». INEX, pp.441-448, 2010.

J. Yang and F. Zhang, XML Document Classification Using Extended VSM, pp.234-244, 2007.
DOI : 10.1007/978-3-540-85902-4_21

J. Yi and N. Sundaresan, A classifier for semi-structured documents, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '00, pp.340-344, 2000.
DOI : 10.1145/347090.347164