Abstract
This paper presents a framework for automatically generating structural XML documents. The user provides a target DTD and an example of an XML document, called a Generate-XML-By-Example Document, or a GxBE document, for short. GxBE documents use a natural declarative syntax, which includes XPath expressions and the function count. Using GxBE documents, users can express important global and local characteristics for the desired target documents, and can require satisfaction of XPath expressions from a given workload. This paper explores the problem of efficiently generating a document that satisfies a given DTD and GxBE document.
- A. Aboulnaga, J. Naughton, and C. Zhang. Generating synthetic complex-structured XML data. In WebDB, 2001.Google Scholar
- D. Barbosa, A. Mendelzon, J. Keenleyside, and K. Lyons. ToXgene: an extensible template-based data generator for XML. In WebDB, 2002. Google ScholarDigital Library
- M. Benedikt, W. Fan, and F. Geerts. XPath satisfiability in the presence of dtds. In PODS, 2005. Google ScholarDigital Library
- C. Binnig, D. Kossman, and E. Lo. Testing database applications. In SIGMOD, 2006. Google ScholarDigital Library
- N. Bruno and S. Chaudhuri. Flexible database generators. In VLDB, 2005. Google ScholarDigital Library
- P. Caron and D. Ziadi. Characterization of glushkov automata. Theoretical Computer Science, 233:75--90, 2000. Google ScholarDigital Library
- S. Cohen. Count-constraints for generating XML. In NGITS, Kibbutz Shefayim, Israel, 2006. Google ScholarDigital Library
- K. Houkjaer, K. Torp, and R. Wind. Simple and realistic data generation. In VLDB, 2006. Google ScholarDigital Library
- M. Jerrum, L. G. Valiant, and V. V. Vazirani. Random generation of combinatorial structures from a uniform distribution. Theoretical Computer Science, 43:169--188, 1986. Google ScholarCross Ref
- P. Kilpeläinen and R. Tuhkanen. One-unambiguity of regular expressions with numeric occurrence indicators. Inf. Comput., 205(6):890--916, 2007. Google ScholarDigital Library
- A. Neufeld, G. Moerkotte, and P. C. Lockemann. Generating consistent test data for a variable set of general consistency constraints. The VLDB Journal, 2(2):173--213, 1993. Google ScholarDigital Library
- K. Runapongsa, J. Patel, H. Jagadish, Y. Chen, and S. Al-Khalifa. The Michigan benchmark: towards XML query performance diagnostics. Information Systems, 31(2):73--97, 206. Google ScholarDigital Library
- A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse. XMark: a benchmark for XML data management. In VLDB, 2002. Google ScholarDigital Library
- M. Y. Vardi. The complexity of relational query languages. In STOC, 1982. Google ScholarDigital Library
- B. Yao, M. Özsu, and N. Khandelwal. XBench benchmark and performance testing of XML DBMSs. In ICDE, 2004. Google ScholarDigital Library
Index Terms
- Generating XML structure using examples and constraints
Recommendations
Mapping Relational Schemas to XML DTDs with Constraints
IMSCCS '06: Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences - Volume 2 (IMSCCS'06) - Volume 02XML is becoming a prevalent format and de facto standard for data exchange in many applications. While traditionally, lots of data are stored and managed in relational databases. There is an urgent need to research some efficient methods to convert ...
Comments