Skip to main content
Top

2004 | OriginalPaper | Chapter

Automatic HTML to XML Conversion

Authors : Shijun Li, Mengchi Liu, Tok Wang Ling, Zhiyong Peng

Published in: Advances in Web-Age Information Management

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

We present a new approach to automatically convert HTML documents into XML documents. It first captures the inter-blocks nested structure, then the intra-blocks nested structure, which consists of blocks including headings, lists, paragraphs and tables in HTML documents, by exploiting both formatting information and structural information implied by HTML tags.

Metadata
Title
Automatic HTML to XML Conversion
Authors
Shijun Li
Mengchi Liu
Tok Wang Ling
Zhiyong Peng
Copyright Year
2004
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-27772-9_78

Premium Partner