Institutions | About Us | Help | Gaeilge
rian logo


Mark
Go Back
Producing Accurate Interpretable Clusters from High-Dimensional Data
Greene, Derek; Padraig, Cunningham
TCD-CS-2005-42 The primary goal of cluster analysis is to produce clusters that accurately reflect the natural groupings in the data. A second objective that is important for high-dimensional data is to identify features that are descriptive of the clusters. In addition to these requirements, we often wish to allow objects to be associated with more than one cluster. In this paper we present a technique, based on the spectral co-clustering model, that is effective in meeting these objectives. Our evaluation on a range of text clustering problems shows that the proposed method yields accuracy superior to that afforded by existing techniques, while producing cluster descriptions that are amenable to human interpretation.
Keyword(s): Computer Science
Publication Date:
2005
Type: Report
Peer-Reviewed: Unknown
Language(s): English
Institution: Trinity College Dublin
Citation(s): Greene, Derek; Cunningham, Padraig. 'Producing Accurate Interpretable Clusters from High-Dimensional Data'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2005-42, 2005, pp12
Publisher(s): Trinity College Dublin, Department of Computer Science
File Format(s): application/pdf
First Indexed: 2014-05-13 05:29:25 Last Updated: 2015-04-10 05:13:52