Skip to main content

2001 | OriginalPaper | Buchkapitel

Automatic Web-Page Classification by Using Machine Learning Methods

verfasst von : Makoto Tsukada, Takashi Washio, Hiroshi Motoda

Erschienen in: Web Intelligence: Research and Development

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

This paper describes automatic Web-page classification by using machine learning methods. Recently, the importance of portal site services is increasing including the search engine function onWorld Wide Web. Especially, the portal site such as Yahoo! service, which hierarchically classifies Web-pages into many categories, is becoming popular. However, the classification of Web-page into each category relies on man power, which costs much time and care. To alleviate this problem, we propose techniques to generate attributes by using co-occurrence analysis and to classify Web-page automatically based on machine learning. We apply these techniques to Web-pages on Yahoo! JAPAN and construct decision trees, which determine appropriate category for each Web-page. The performance of this proposed method is evaluated in terms of error rate, recall, and precision. The experimental evaluation demonstrates that this method provides acceptable accuracy with the classification of Web-page into top level categories on Yahoo! JAPAN.

Metadaten
Titel
Automatic Web-Page Classification by Using Machine Learning Methods
verfasst von
Makoto Tsukada
Takashi Washio
Hiroshi Motoda
Copyright-Jahr
2001
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-45490-X_36

Premium Partner