Building Fuzzy Thematic Clusters and Mapping Them to Higher Ranks in a Taxonomy
    Download PDF
Boris Mirkin,Susana Nascimento,Trevor Fenner,Luis Moniz Pereira. Building Fuzzy Thematic Clusters and Mapping Them to Higher Ranks in a Taxonomy. International Journal of Software and Informatics, 2010,4(3):257~275
Hits: 4016
Download times: 2435
Abstract:We present a novel methodology for the analysis of activities engaged in an organization such as the research conducted in a University department by mapping them to a related hierarchical taxonomy such as Classification of Computer Subjects by ACM (ACM-CCS). We start by collecting data of activities of the individual components of the organization and present them as the components fuzzy membership profiles over the subjects of the taxonomy. Our method generalizes the profiles in two steps. First step finds fuzzy clusters of taxonomy subjects according to the working of the organization. Second, each cluster is mapped to higher ranks of the taxonomy in a parsimonious way. Each of the steps is formalized and solved in a novel way. We build fuzzy clusters of the taxonomy leaves according to the similarity between individual profiles by using a novel, additive spectral, fuzzy clustering method that involves a number of model-based stopping conditions, in contrast to other methods. As the found clusters are not necessarily consistent with the taxonomy, each is considered as a query set. To lift a query set to higher ranks of the taxonomy, we develop an original recursive algorithm for minimizing a penalty function that involves 'head subjects' on the higher ranks of the taxonomy together with their 'gaps' and 'offshoots'. The method is illustrated by applying it to real-world data.
keywords:additive clustering  spectral clustering  fuzzy clustering  parsimonious lift  research activity structure
View Full Text  View/Add Comment  Download reader

 

 

more>>  
Visitor:3139381
Top Paper  |  E-mail Alert  |  Publication Ethics  |  New Version

© Copyright by Institute of Software, the Chinese Academy of Sciences
京ICP备05046678号-5

京公网安备 11040202500065号