Skip to main content

2004 | OriginalPaper | Buchkapitel

Automatic HTML to XML Conversion

verfasst von : Shijun Li, Mengchi Liu, Tok Wang Ling, Zhiyong Peng

Erschienen in: Advances in Web-Age Information Management

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

We present a new approach to automatically convert HTML documents into XML documents. It first captures the inter-blocks nested structure, then the intra-blocks nested structure, which consists of blocks including headings, lists, paragraphs and tables in HTML documents, by exploiting both formatting information and structural information implied by HTML tags.

Metadaten
Titel
Automatic HTML to XML Conversion
verfasst von
Shijun Li
Mengchi Liu
Tok Wang Ling
Zhiyong Peng
Copyright-Jahr
2004
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-27772-9_78

Premium Partner