News & Highlights (this year’s items in bold)
- Metacluster-based Projective Clustering Ensembles
- ”Who’s out there?” Identifying and Ranking Lurkers in Social Networks
- Co-organized Workshop: MultiClust Workshop at ACM SIGKDD 2013
- Minimizing the variance of cluster mixture models for clustering uncertain objects
- A Segment-based Approach To Clustering Multi-Topic Documents
- Exploring Dictionary-based Semantic Relatedness in Labeled Tree Data
- Projective Clustering Ensembles
- Uncertain Centroid based Partitional Clustering of Uncertain Data
- XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure
- Co-organized Workshop: 3Clust Workshop at PAKDD 2012
- A Statistical Model for Topically Segmented Documents
- SIGIR Report on INEX 2010
Tags
classification clustering clustering ensembles document clustering DSA email mining fuzzy logics information extraction information networks linear programming mass spectrometry optimization PageRank PDF documents projective clustering semantic relatedness similarity detection subspace clustering time series uncertain data web content mining web personalization web usage mining web wrapping WordNet word sense disambiguation wrapping XML XML content clustering XML mining XML structure clustering
Author Archives: Andrea
Metacluster-based Projective Clustering Ensembles
F. Gullo, C. Domeniconi, A. Tagarelli. Metacluster-based Projective Clustering Ensembles. Machine Learning Journal (SI on MultiClust), Springer. Accepted: June 7, 2013.
”Who’s out there?” Identifying and Ranking Lurkers in Social Networks
A. Tagarelli, R. Interdonato. ”Who’s out there?” Identifying and Ranking Lurkers in Social Networks. To Appear as full paper in: Proc. of The 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013). Niagara Falls, Canada, … Continue reading
Posted in Conference Proceedings, News
Tagged Alpha-Centrality, information networks, lurking, PageRank, ranking, social networks, twitter
Leave a comment
Co-organized Workshop: MultiClust Workshop at ACM SIGKDD 2013
Andrea is Co-Organizer of the upcoming ACM SIGKDD 2013 Workshop on Multiple Clusterings, Multi-view Data, and Multi-source Knowledge-driven Clustering (MultiClust)
Minimizing the variance of cluster mixture models for clustering uncertain objects
F. Gullo, G. Ponti, A. Tagarelli. Minimizing the variance of cluster mixture models for clustering uncertain objects. Statistical Analysis and Data Mining (SAM) 6(2):116-135, 2013. Online First: November 19, 2012.
Posted in Journals, News
Tagged clustering, uncertain cluster prototype, uncertain data
Leave a comment
A Segment-based Approach To Clustering Multi-Topic Documents
A. Tagarelli, G. Karypis. A Segment-based Approach To Clustering Multi-Topic Documents. Knowledge and Information Systems (KAIS), 1-33. Online First: September 12, 2012.
Multiobjective Optimization of Co-Clustering Ensembles
F. Gullo, AKM Khaled Ahsan Talukder, S. Luke, C. Domeniconi, A. Tagarelli. Multiobjective Optimization of Co-Clustering Ensembles. Fourteenth International Conference on Genetic and Evolutionary Computation (GECCO), pp. 1495-1496. Philadelphia, USA, July 7-11, 2012.
Posted in Conference Proceedings, Publications
Tagged clustering, clustering ensembles, co-clustering, optimization
Leave a comment
Projective Clustering Ensembles
F. Gullo, C. Domeniconi, A. Tagarelli. Projective Clustering Ensembles. Data Mining and Knowledge Discovery (DAMI), Accepted April 3, 2012, Online First: May 3, 2012.
Posted in Journals, News, Publications
Tagged clustering, clustering ensembles, optimization, projective clustering
Leave a comment
Uncertain Centroid based Partitional Clustering of Uncertain Data
F. Gullo, A. Tagarelli. Uncertain Centroid based Partitional Clustering of Uncertain Data. Proceedings of the VLDB Endowment (ACM), 5(7):610-621, 2012. PDF
XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure
F. Hadzic, M. Hecker, A. Tagarelli. XML Document Clustering Using Structure-Preserving Flat Representation of XML Content and Structure. 7th International Conference on Advanced Data Mining and Applications (ADMA’11), LNAI 7121 (Part 2), pp. 403-416. Beijing, China, December 17-19, 2011. PDF
Co-organized Workshop: 3Clust Workshop at PAKDD 2012
Andrea is Co-Organizer of the upcoming PAKDD 2012 Workshop on Multi-view data, High-dimensionality, External Knowledge: Striving for a Unified Approach to Clustering (3Clust)
Posted in Events, News
Tagged alternative clustering, clustering, clustering ensembles, subspace clustering
Leave a comment
A Statistical Model for Topically Segmented Documents
G. Ponti, A. Tagarelli, G. Karypis. A Statistical Model for Topically Segmented Documents. Fourteenth International Conference on Discovery Science (DS’11), LNAI 6926, pp. 247-261. Espoo, Finland, October 5-7, 2011. PDF
A Time Series Approach for Clustering Mass Spectrometry Data
F. Gullo, G. Ponti, A. Tagarelli, G. Tradigo, P. Veltri. A Time Series Approach for Clustering Mass Spectrometry Data. Journal of Computational Science, 3:344-355, 2012. Accepted for publication: June 30, 2011. Available on-line: July 12, 2011
Posted in Journals, News
Tagged clinical data, clustering, mass spectrometry, proteomics, time series
Leave a comment
SIGIR Report on INEX 2010
D. Alexander, P. Arvola, T. Beckers, P. Bellot, T. Chappell, C. M. DeVries, A. Doucet, N. Fuhr, S. Geva, J. Kamps, G. Kazai, M. Koolen, S. Kutty, M. Landoni, V. Moriceau, R. Nayak, R. Nordlie, N. Pharo, E. SanJuan, R. … Continue reading
Posted in Journals, News
Tagged INEX, XML content clustering, XML mining, XML structure clustering
Leave a comment
Collaborative Clustering of XML Documents
S. Greco, F. Gullo, G. Ponti, A. Tagarelli. Collaborative Clustering of XML Documents. Journal of Computer and System Sciences 77:988-1008, 2011. PDF
Posted in Journals, News
Tagged document clustering, P2P, XML, XML content clustering, XML mining, XML structure clustering
Leave a comment
Overview of the INEX 2010 XML Mining Track: Clustering and Classification of XML Documents
C. M. De Vries, R. Nayak, S. Kutty, S. Geva, A. Tagarelli. Overview of the INEX 2010 XML Mining Track: Clustering and Classification of XML Documents. Ninth International Workshop on the Initiative for the Evaluation of XML Retrieval (INEX 2010), LNCS … Continue reading
Posted in Conference Proceedings, News
Tagged document clustering, INEX, Wikipedia, XML, XML content clustering, XML mining, XML structure clustering
Leave a comment
Co-organized Workshop: Knowledge Discovery in Health Care and Medicine (KD-HCM)
Andrea is Co-Organizer of the upcoming ECML PKDD 2011 Workshop on Knowledge Discovery in Health Care and Medicine (KD-HCM)
Posted in Events, News
Leave a comment
Edited Book: XML Data Mining
Andrea is Editor of the upcoming book “XML Data Mining: Models, Methods, and Applications”, IGI Global, 538 pages. Copyright 2012. Release date: November 2011. Publisher’s book page Book Brochure Book Brochure with TOC
Advancing Data Clustering via Projective Clustering Ensembles
F. Gullo, C. Domeniconi, A. Tagarelli. Advancing Data Clustering via Projective Clustering Ensembles. ACM International Conference on Management of Data (SIGMOD’11), pp. 733-744. Athens, Greece, June 12-16, 2011. PDF This paper has been awarded of the SIGMOD11 Repeatability/Workability Evaluation Test. SIGMOD has … Continue reading
Posted in Conference Proceedings, News
Tagged clustering, clustering ensembles, optimization, projective clustering
Leave a comment
Schema-based Web Wrapping
B. Fazzinga, S. Flesca, A. Tagarelli. Schema-based Web Wrapping. Knowledge and Information Systems, 26(1):127-173, 2011.
Posted in Journals, News
Tagged information extraction, schema extraction, web wrapping, wrapper generalization, XML
Leave a comment
Enhancing Single-Objective Projective Clustering Ensembles
F. Gullo, C. Domeniconi, A. Tagarelli. Enhancing Single-Objective Projective Clustering Ensembles. 10th IEEE International Conference on Data Mining (ICDM ’10), pp. 833-838. Sydney, Australia, December 14-17, 2010. PDF
Posted in Conference Proceedings, News
Tagged clustering, clustering ensembles, optimization, projective clustering
Leave a comment
Minimizing the Variance of Cluster Mixture Models for Clustering Uncertain Objects
F. Gullo, G. Ponti, A. Tagarelli. Minimizing the Variance of Cluster Mixture Models for Clustering Uncertain Objects. 10th IEEE International Conference on Data Mining (ICDM ’10), pp. 839-844. Sydney, Australia, December 14-17, 2010.
Posted in Conference Proceedings
Tagged clustering, uncertain data, variance of mixture models
Leave a comment
A Fuzzy Logic Approach to Wrapping PDF Documents
S. Flesca, E. Masciari, A. Tagarelli. A Fuzzy Logic Approach to Wrapping PDF Documents. IEEE Transactions on Knowledge and Data Engineering, 23(12):1826-1841, 2011. Support page for submission
Posted in Journals, News
Tagged fuzzy logics, information extraction, PDF documents, wrapping
Leave a comment
Projective Clustering Ensembles
F. Gullo, C. Domeniconi, A. Tagarelli. Projective Clustering Ensembles. 9th IEEE International Conference on Data Mining (ICDM ’09), pp. 794-799. Miami, Florida, USA, December 6-9, 2009. PDF
Posted in Conference Proceedings
Tagged clustering, clustering ensembles, optimization, projective clustering
Leave a comment
A Time Series Representation Model for Accurate and Fast Similarity Detection
F. Gullo, G. Ponti, A. Tagarelli, S. Greco. A Time Series Representation Model for Accurate and Fast Similarity Detection. Pattern Recognition 42(11):2998-3014, 2009.
Posted in Journals
Tagged classification, clustering, DSA, similarity detection, time series
Leave a comment
Collaborative XML Document Clustering
F. Gullo, G. Ponti, A. Tagarelli, S. Greco. Collaborative XML Document Clustering. 1th International Workshop on Distributed XML Processing. Vienna, Austria, September 22-25, 2009.
Posted in Conference Proceedings
Tagged document clustering, P2P, XML, XML mining, XML structure clustering
Leave a comment
Low-voltage Electricity Customer Profiling based on Load Data Clustering
F. Gullo, G. Ponti, A. Tagarelli, S. Iiritano, M. Ruffolo, D. Labate. Low-voltage Electricity Customer Profiling based on Load Data Clustering. 13th International Database Engineering and Applications Symposium (IDEAS ’09), pp. 330-333. Cetraro, Italy, September 16-18, 2009.
Topic-based Hard Clustering of Documents using Generative Models
G. Ponti, A. Tagarelli. Topic-based Hard Clustering of Documents using Generative Models. 18th International Symposium on Methodologies for Intelligent Systems (ISMIS ‘09), LNAI 5722, pp. 231-240. Prague, Czech Republic, September 14-17, 2009.