Department of Computer Science
Web page organization and visualization using generative topographic mapping: A pilot study
Automatic Web page organization and visualization is an effective way for foraging information in a Web structure. Web pages contain both text (content) and links (structure), implying that content and structure analysis techniques should be adopted and properly integrated. In this paper, we take the probabilistic model-based approach and extend a topographypreserving model known as Generative Topography Map (GTM). The extended GTM provides a principled way to integrate Web pages and hyperlinks and project them into a low-dimension latent space (2D in our case) for visualization. The proposed extension has been applied to the WebKB dataset. Based on the preliminary results obtained, we proposed several directions for future research.
Web page organization and visualization, Web content and structure analysis, Generative Topography Map
Source Publication Title
ICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL 2004)
Link to Publisher's Edition
Zhang, Xiao-Feng, Chak-Man Lam, and William K. Cheung. "Web page organization and visualization using generative topographic mapping: A pilot study." ICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL 2004) (2004): 126-131.