A Categorical Model for Discovering Latent Structure in Social Annotations
Keywords:Social, Annotation, Latent, Tags, Topic , Models
The advent of social tagging systems has enabled a new community-based view of the Web in which objects like images, videos, and Web pages are annotated by thousands of users. Understanding the emergent semantics inherent in the socially-generated collection of annotations has important research implications for information discovery and knowledge sharing. To this end, we propose a novel probabilistic generative model for discovering latent structure in large-scale social annotations. The generative model identifies latent community-based ``categories'' of interest that can be used to group semantically-related tags and to augment traditional content-based information search and discovery. We illustrate the proposed approach over large collections of Web objects annotated by the Flickr and Delicious communities. Additionally, we show how to integrate the annotation-based categorical model with traditional content-based approaches for the effective focused discovery and exploration of Web objects.