2006 | OriginalPaper | Chapter
Topic Structure Mining Using PageRank Without Hyperlinks
Authors : Hiroyuki Toda, Ko Fujimura, Ryoji Kataoka, Hiroyuki Kitagawa
Published in: Digital Libraries: Achievements, Challenges and Opportunities
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
This paper proposes a novel text mining method for any given document set. It is based on PageRank-based centrality scores within the graph structure generated from the similarity of all document pairs. Evaluations using a newspaper collection show that the proposed approach yields much better performance in terms of main topic identification and topical clustering than the baseline method. Furthermore, we show an example of document set visualization that offers novel document browsing through the topic structure. Experiments show that our topic structure mining method is useful for user-oriented document selection.