Skip to main content
Top

2016 | OriginalPaper | Chapter

On the Complexity of Extracting Subtree with Keeping Distinguishability

Authors : Xianmin Liu, Zhipeng Cai, Dongjing Miao, Jianzhong Li

Published in: Combinatorial Optimization and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We consider the problem of subtree extraction with guarantee of preserving distinguishability. Given a query q and a tree T, evaluating q on T will output q(T) which is a set of nodes of T. For two nodes a and b in T, they can be distinguished by some query q in T, iff exactly one of them belongs to q(T). Then, given a tree T, a query class \(\mathcal {L}\), and two disjoint node sets A and B of T, a subtree \(T'\) of T is called preserving distinguishability of T, iff (1) \(T'\) contains all nodes in \(A\cup B\), (2) for any node pair \((a,b)\in {A\times B}\), if a and b can be distinguished by some query in \(\mathcal {L}\) in T, they can also be distinguished by some query (not necessarily the same one) in \(\mathcal {L}\) in \(T'\), and (3) for any node pair \((a,b)\in {A\times B}\) and a query \(q\in \mathcal {L}\), if a and b can be distinguished by q in \(T'\), they can also be distinguished by q in T. The subtree extraction problem considered by this paper is to determine whether there is a small enough subtree \(T'\) of T, such that for query class \(\mathcal {L}\) and node sets A and B, \(T'\) preserves the distinguishability of T. In this paper, as an initial attempt of investigating this problem, fixing \(\mathcal {L}\) to be a specific part of tree pattern queries (introduced later), the subtree extraction problem is shown to be NP-complete.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Neven, F., Schwentick, T.: Expressive, efficient pattern languages for tree-structured data. In: PODS (2000) Neven, F., Schwentick, T.: Expressive, efficient pattern languages for tree-structured data. In: PODS (2000)
2.
go back to reference Gou, G., Chirkova, R.: Efficiently querying large XML data repositories: a survey. IEEE Trans. Knowl. Data Eng. 19(10), 1381–1403 (2007)CrossRef Gou, G., Chirkova, R.: Efficiently querying large XML data repositories: a survey. IEEE Trans. Knowl. Data Eng. 19(10), 1381–1403 (2007)CrossRef
3.
4.
go back to reference Bex, G.J., Neven, F., Schwentick, T., Vansummeren, S.: Inference of concise regular expressions and DTDs. ACM Trans. Database Syst. (TODS) 35(2), 11:1–11:47 (2010)CrossRef Bex, G.J., Neven, F., Schwentick, T., Vansummeren, S.: Inference of concise regular expressions and DTDs. ACM Trans. Database Syst. (TODS) 35(2), 11:1–11:47 (2010)CrossRef
5.
6.
go back to reference Raeymaekers, S., Bruynooghe, M., Bussche, J.: Learning (\(k, l\))-contextual tree languages for information extraction. Mach. Learn. 71(2–3), 155–183 (2008)CrossRef Raeymaekers, S., Bruynooghe, M., Bussche, J.: Learning (\(k, l\))-contextual tree languages for information extraction. Mach. Learn. 71(2–3), 155–183 (2008)CrossRef
7.
go back to reference Staworko, S., Wieczorek, P.: Learning twig, path queries. In: ICDT (2012) Staworko, S., Wieczorek, P.: Learning twig, path queries. In: ICDT (2012)
8.
go back to reference Carme, J., Gilleron, R., Lemay, A., Niehren, J.: Interactive learning of node selecting tree transducer. Mach. Learn. 66(1), 33–67 (2007)CrossRef Carme, J., Gilleron, R., Lemay, A., Niehren, J.: Interactive learning of node selecting tree transducer. Mach. Learn. 66(1), 33–67 (2007)CrossRef
9.
go back to reference Bex, G.J., Neven, F., Vansummeren, S.: Learning deterministic regular expressions for the inference of schemas from xml data. ACM Trans. Web 4(4), 1–32 (2010)CrossRef Bex, G.J., Neven, F., Vansummeren, S.: Learning deterministic regular expressions for the inference of schemas from xml data. ACM Trans. Web 4(4), 1–32 (2010)CrossRef
10.
go back to reference Liu, L., Pu, C., Han, W.: XWRAP: an XML-enabled wrapper construction system for web information sources. In: ICDE, pp. 611–621 (2000) Liu, L., Pu, C., Han, W.: XWRAP: an XML-enabled wrapper construction system for web information sources. In: ICDE, pp. 611–621 (2000)
11.
go back to reference Suzuki, Y., Shoudai, T., Uchida, T., Miyahara, T.: Ordered term tree languages which are polynomial time inductively inferable from positive data. Theoret. Comput. Sci. 350(1), 63–90 (2006)MathSciNetCrossRefMATH Suzuki, Y., Shoudai, T., Uchida, T., Miyahara, T.: Ordered term tree languages which are polynomial time inductively inferable from positive data. Theoret. Comput. Sci. 350(1), 63–90 (2006)MathSciNetCrossRefMATH
12.
go back to reference Amer-Yahia, S., Cho, S., Lakshmanan, L.V.S., Srivastava, D.: Tree pattern query minimization. VLDB J. 11(4), 315–331 (2002)CrossRefMATH Amer-Yahia, S., Cho, S., Lakshmanan, L.V.S., Srivastava, D.: Tree pattern query minimization. VLDB J. 11(4), 315–331 (2002)CrossRefMATH
14.
go back to reference Benedikt, M., Koch, C.: XPath leashed. ACM Comput. Surv. (CSUR) 41(1), 1–54 (2009)CrossRef Benedikt, M., Koch, C.: XPath leashed. ACM Comput. Surv. (CSUR) 41(1), 1–54 (2009)CrossRef
15.
go back to reference Papadimitriou, C.H.: Computational Complexity. Addison-Wesley, Boston (1994)MATH Papadimitriou, C.H.: Computational Complexity. Addison-Wesley, Boston (1994)MATH
Metadata
Title
On the Complexity of Extracting Subtree with Keeping Distinguishability
Authors
Xianmin Liu
Zhipeng Cai
Dongjing Miao
Jianzhong Li
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-48749-6_17

Premium Partner