2010 | OriginalPaper | Chapter
Automated Constraint Selection for Semi-supervised Clustering Algorithm
Authors : Carlos Ruiz, Carlos G. Vallejo, Myra Spiliopoulou, Ernestina Menasalvas
Published in: Current Topics in Artificial Intelligence
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
The incorporation of background knowledge in unsupervised algorithms has been shown to yield performance improvements in terms of model quality and execution speed. However, performance is dependent on the quantity and quality of the background knowledge being exploited. In this work, we study the issue of selecting Must-Link and Cannot-Link constraints for semi-supervised clustering. We propose “
ConstraintSelector
”, an algorithm that takes as input a set of labeled data instances, from which constraints can be derived, ranks these instances on their usability and then derives constraints from the top-ranked instances only. Our experiments show that
ConstraintSelector
chooses, respectively reduces, the set of candidate constraints without compromising the quality of the derived model.