skip to main content
research-article

Generating XML structure using examples and constraints

Published:01 August 2008Publication History
Skip Abstract Section

Abstract

This paper presents a framework for automatically generating structural XML documents. The user provides a target DTD and an example of an XML document, called a Generate-XML-By-Example Document, or a GxBE document, for short. GxBE documents use a natural declarative syntax, which includes XPath expressions and the function count. Using GxBE documents, users can express important global and local characteristics for the desired target documents, and can require satisfaction of XPath expressions from a given workload. This paper explores the problem of efficiently generating a document that satisfies a given DTD and GxBE document.

References

  1. A. Aboulnaga, J. Naughton, and C. Zhang. Generating synthetic complex-structured XML data. In WebDB, 2001.Google ScholarGoogle Scholar
  2. D. Barbosa, A. Mendelzon, J. Keenleyside, and K. Lyons. ToXgene: an extensible template-based data generator for XML. In WebDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Benedikt, W. Fan, and F. Geerts. XPath satisfiability in the presence of dtds. In PODS, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Binnig, D. Kossman, and E. Lo. Testing database applications. In SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Bruno and S. Chaudhuri. Flexible database generators. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Caron and D. Ziadi. Characterization of glushkov automata. Theoretical Computer Science, 233:75--90, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Cohen. Count-constraints for generating XML. In NGITS, Kibbutz Shefayim, Israel, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. Houkjaer, K. Torp, and R. Wind. Simple and realistic data generation. In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Jerrum, L. G. Valiant, and V. V. Vazirani. Random generation of combinatorial structures from a uniform distribution. Theoretical Computer Science, 43:169--188, 1986. Google ScholarGoogle ScholarCross RefCross Ref
  10. P. Kilpeläinen and R. Tuhkanen. One-unambiguity of regular expressions with numeric occurrence indicators. Inf. Comput., 205(6):890--916, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Neufeld, G. Moerkotte, and P. C. Lockemann. Generating consistent test data for a variable set of general consistency constraints. The VLDB Journal, 2(2):173--213, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Runapongsa, J. Patel, H. Jagadish, Y. Chen, and S. Al-Khalifa. The Michigan benchmark: towards XML query performance diagnostics. Information Systems, 31(2):73--97, 206. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse. XMark: a benchmark for XML data management. In VLDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Y. Vardi. The complexity of relational query languages. In STOC, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. B. Yao, M. Özsu, and N. Khandelwal. XBench benchmark and performance testing of XML DBMSs. In ICDE, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Generating XML structure using examples and constraints

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader