Document Type

Conference Paper

Department/Unit

Department of Computer Science

Title

Web page organization and visualization using generative topographic mapping: A pilot study

Language

English

Abstract

Automatic Web page organization and visualization is an effective way for foraging information in a Web structure. Web pages contain both text (content) and links (structure), implying that content and structure analysis techniques should be adopted and properly integrated. In this paper, we take the probabilistic model-based approach and extend a topographypreserving model known as Generative Topography Map (GTM). The extended GTM provides a principled way to integrate Web pages and hyperlinks and project them into a low-dimension latent space (2D in our case) for visualization. The proposed extension has been applied to the WebKB dataset. Based on the preliminary results obtained, we proposed several directions for future research.

Keywords

Web page organization and visualization, Web content and structure analysis, Generative Topography Map

Publication Date

7-2004

Source Publication Title

ICML 2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL 2004)

Start Page

126

End Page

131

Conference Location

Banff, Canada

This document is currently not available here.

Share

COinS