
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
keywords:
ml
clustering
Clustering traditionally aims to reveal a natural grouping structure model from unlabeled data. However, this model may not always be ideal for users' preference. In this paper, we propose a personalized clustering method which explicitly performs targeted representation learning by interacting with users via modicum task information (e.g., must-link or cannot-link pairs) to guide the clustering direction. First, we query users with the the most informative pairs, i.e., those pairs most hard to cluster and those most easy to miscluster, to facilitate the representation learning in terms of the clustering preference. And then, by exploiting attention mechanism the targeted representation is learned and augmented. By leveraging the targeted representation and constraint clustering loss as well, personalized clustering is obtained. Theoretically, we verify that the risk of personalized clustering is tightly bounded, that guarantees active queries to users do mitigate the clustering risk. Experimentally, extensive results show that our method performs well across different clustering tasks and datasets, even with a limited number of queries.