News & Highlights
- Co-organized Workshop: MultiClust Workshop at ACM SIGKDD 2013
- Minimizing the variance of cluster mixture models for clustering uncertain objects
- A Segment-based Approach To Clustering Multi-Topic Documents
- Exploring Dictionary-based Semantic Relatedness in Labeled Tree Data
- Projective Clustering Ensembles
- Uncertain Centroid based Partitional Clustering of Uncertain Data
- XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure
- Co-organized Workshop: 3Clust Workshop at PAKDD 2012
- A Statistical Model for Topically Segmented Documents
- SIGIR Report on INEX 2010
Tags
classification clustering clustering ensembles document clustering DSA email mining fuzzy logics information extraction linear programming mass spectrometry optimization PDF documents projective clustering semantic relatedness similarity detection time series uncertain data web content mining web personalization web usage mining web wrapping WordNet word sense disambiguation wrapping XML XML content clustering XML mining XML structure clustering
Tag Archives: document clustering
A Segment-based Approach To Clustering Multi-Topic Documents
A. Tagarelli, G. Karypis. A Segment-based Approach To Clustering Multi-Topic Documents. Knowledge and Information Systems (KAIS), 1-33. Online First: September 12, 2012.
XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure
F. Hadzic, M. Hecker, A. Tagarelli. XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure. 7th International Conference on Advanced Data Mining and Applications (ADMA’11), LNAI 7121 (Part 2), pp. 403-416. Beijing, China, December 17-19, 2011. PDF
A Statistical Model for Topically Segmented Documents
G. Ponti, A. Tagarelli, G. Karypis. A Statistical Model for Topically Segmented Documents. Fourteenth International Conference on Discovery Science (DS’11), LNAI 6926, pp. 247-261. Espoo, Finland, October 5-7, 2011. PDF
Collaborative Clustering of XML Documents
S. Greco, F. Gullo, G. Ponti, A. Tagarelli. Collaborative Clustering of XML Documents. Journal of Computer and System Sciences 77:988-1008, 2011. PDF
Posted in Journals, News
Tagged document clustering, P2P, XML, XML content clustering, XML mining, XML structure clustering
Leave a comment
Overview of the INEX 2010 XML Mining Track: Clustering and Classification of XML Documents
C. M. De Vries, R. Nayak, S. Kutty, S. Geva, A. Tagarelli. Overview of the INEX 2010 XML Mining Track: Clustering and Classification of XML Documents. Ninth International Workshop on the Initiative for the Evaluation of XML Retrieval (INEX 2010), LNCS … Continue reading
Posted in Conference Proceedings, News
Tagged document clustering, INEX, Wikipedia, XML, XML content clustering, XML mining, XML structure clustering
Leave a comment
Edited Book: XML Data Mining
Andrea is Editor of the upcoming book “XML Data Mining: Models, Methods, and Applications”, IGI Global, 538 pages. Copyright 2012. Release date: November 2011. Publisher’s book page Book Brochure Book Brochure with TOC
Collaborative XML Document Clustering
F. Gullo, G. Ponti, A. Tagarelli, S. Greco. Collaborative XML Document Clustering. 1th International Workshop on Distributed XML Processing. Vienna, Austria, September 22-25, 2009.
Posted in Conference Proceedings
Tagged document clustering, P2P, XML, XML mining, XML structure clustering
Leave a comment
Topic-based Hard Clustering of Documents using Generative Models
G. Ponti, A. Tagarelli. Topic-based Hard Clustering of Documents using Generative Models. 18th International Symposium on Methodologies for Intelligent Systems (ISMIS ‘09), LNAI 5722, pp. 231-240. Prague, Czech Republic, September 14-17, 2009.
A Segment-based Approach To Clustering Multi-Topic Documents
A. Tagarelli, G. Karypis. A Segment-based Approach To Clustering Multi-Topic Documents. Workshop on Text Mining, in conjunction with the 8th SIAM International Conference on Data Mining (SDM ’08). Atlanta, Georgia, USA, April 24-26, 2008.
Mining Categories for Emails via Clustering and Pattern Discovery
G. Manco, E. Masciari, A. Tagarelli. Mining Categories for Emails via Clustering and Pattern Discovery. Journal of Intelligent Information Systems 30(2):153-181, 2008.
Posted in Journals
Tagged document clustering, email mining, frequent pattern discovery
Leave a comment
A Tree-based Approach to Clustering XML Documents by Structure
G. Costa, G. Manco, R. Ortale, A. Tagarelli. A Tree-based Approach to Clustering XML Documents by Structure. 8th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD ’04), LNAI 3202, pp. 137-148. Pisa, Italy, September 20-24, 2004.
Posted in Conference Proceedings
Tagged document clustering, XML, XML mining, XML structure clustering
Leave a comment
Clustering of XML Documents by Structure based on Tree Matching and Merging
G. Costa, G. Manco, R. Ortale, A. Tagarelli. Clustering of XML Documents by Structure based on Tree Matching and Merging. 12th Italian Symposium on Advanced Database Systems (SEBD ’04), pp. 314-325. S. Margherita di Pula (Cagliari), Italy, June 21-23, 2004.
Posted in Conference Proceedings
Tagged document clustering, XML, XML mining, XML structure clustering
Leave a comment
Distance-based Clustering of XML Documents
F. De Francesca, G. Gordano, R. Ortale, A. Tagarelli. Distance-based Clustering of XML Documents. 1st International Workshop on Mining Graphs, Trees and Sequences (MGTS’03) in conjunction with 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD ’03), pp. 75-78. … Continue reading
Posted in Conference Proceedings
Tagged document clustering, XML, XML mining, XML structure clustering
Leave a comment
A Framework for Adaptive Mail Classification
G. Manco, E. Masciari, A. Tagarelli. A Framework for Adaptive Mail Classification. 14th International Conference on Tools with Artificial Intelligence (ICTAI ’02), pp. 387-392. Washington DC, USA, November 4-6, 2002.
Towards an Adaptive Mail Classifier
G. Manco, E. Masciari, M. Ruffolo, A. Tagarelli. Towards an Adaptive Mail Classifier. Workshop su “Apprendimento Automatico: Metodi ed Applicazioni”, Ottavo Convegno dell’Associazione Nazionale per l’Intelligenza Artificiale (AI*IA ’02). Siena, Italy, September 10-13, 2002.