Skip to main content
Erschienen in: Journal of Intelligent Information Systems 1/2014

01.02.2014

Hierarchical directory mapping for category-constrained meta-search

verfasst von: Jyh-Jong Tsay, Chi-Hsiang Lin

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Hierarchical category directories, in which categories are recursively partitioned into sub-categories, have been provided by many information sources, such as news, online stores and shopping websites. Such information sources categorize instances in their databases, and support category-constrained search in which one usually navigates along the category directory to select a category, and then submits a query to find objects in the selected category whose descriptions match the query. As more and more online sources are available, it is challenging to build a meta-search system which provides a unified directory and a meta-search capability to search and access all sources from different websites in one query submission. One of the fundamental problems in building such a meta-search system is category mapping which maps the selected category in the unified directory to categories provided by the information sources. In this paper, we develop an efficient algorithm for category mapping between hierarchical directories. Our algorithm is based on the following two techniques: consistency refinement and hierarchical substitution, which are developed with extensive use of hierarchical structures. Experiment shows that our approach substantially improves previous approaches, and can be used to implement automatic category mapping for meta-search systems which support category-constrained search.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Chang, C.-H., Kayed, M., Girgis, M.R., Shaalan, K. (2006). A survey of web information extraction systems. IEEE Transactions on Knowledge and Data Engineering, 18(10), 1411–1428.CrossRef Chang, C.-H., Kayed, M., Girgis, M.R., Shaalan, K. (2006). A survey of web information extraction systems. IEEE Transactions on Knowledge and Data Engineering, 18(10), 1411–1428.CrossRef
Zurück zum Zitat Chinchor, N. (1992) MUC-4 evaluation metrics. In Proc. of the Fourth Message Understanding Conference (pp. 22–29). McLean, Virginia, USA. Chinchor, N. (1992) MUC-4 evaluation metrics. In Proc. of the Fourth Message Understanding Conference (pp. 22–29). McLean, Virginia, USA.
Zurück zum Zitat Choi, N., Song, I.-Y., Han, H. (2006). A survey on ontology mapping. ACM SIGMOD Record, 35(3), 34–41.CrossRef Choi, N., Song, I.-Y., Han, H. (2006). A survey on ontology mapping. ACM SIGMOD Record, 35(3), 34–41.CrossRef
Zurück zum Zitat Chuang, S.-L., Chang, K.C.-C., Zhai, C. (2007). Collaborative wrapping: A turbo framework for web data extraction. In Proceedings of the IEEE 23rd international conference on data engineering (ICDE) (pp. 1261–1262). Istanbul, Turkey. Chuang, S.-L., Chang, K.C.-C., Zhai, C. (2007). Collaborative wrapping: A turbo framework for web data extraction. In Proceedings of the IEEE 23rd international conference on data engineering (ICDE) (pp. 1261–1262). Istanbul, Turkey.
Zurück zum Zitat Doan, A., Madhavan, J., Domingos, P., Halevy, A. (2004). Ontology matching: A machine learning approach. In S. Staab & R. Studer (Eds.), Handbook on ontologies in information systems (pp. 397–416). Springer-Verlag. Doan, A., Madhavan, J., Domingos, P., Halevy, A. (2004). Ontology matching: A machine learning approach. In S. Staab & R. Studer (Eds.), Handbook on ontologies in information systems (pp. 397–416). Springer-Verlag.
Zurück zum Zitat Duong, T.H., Nguyen, N.T., Jo, G.S. (2009). A Hybrid method for integrating multiple ontologies. Cybernetics and Systems, 40(2), 123–145.CrossRefMATH Duong, T.H., Nguyen, N.T., Jo, G.S. (2009). A Hybrid method for integrating multiple ontologies. Cybernetics and Systems, 40(2), 123–145.CrossRefMATH
Zurück zum Zitat Ehrig, M., & Staab, S. (2004). QOM—quick ontology mapping. In The 3rd international semantic web conference (pp. 683–697). Hiroshima, Japan. Ehrig, M., & Staab, S. (2004). QOM—quick ontology mapping. In The 3rd international semantic web conference (pp. 683–697). Hiroshima, Japan.
Zurück zum Zitat He, B., & Chang, K.C.-C. (2004). Automatic complex schema matching across web query interfaces: a correlation mining approach. ACM Transactions on Database Systems, 31(1), 346–395.CrossRef He, B., & Chang, K.C.-C. (2004). Automatic complex schema matching across web query interfaces: a correlation mining approach. ACM Transactions on Database Systems, 31(1), 346–395.CrossRef
Zurück zum Zitat Kang, C.-L. (2006). Design and development of an integrated product search system. Master’s Thesis, Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan, ROC. Kang, C.-L. (2006). Design and development of an integrated product search system. Master’s Thesis, Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan, ROC.
Zurück zum Zitat Kaza, S., & Chen, H. (2008). Evaluating ontology mapping techniques: an experiment in public safety information sharing. Decision Support Systems, 45(4), 714–728.CrossRef Kaza, S., & Chen, H. (2008). Evaluating ontology mapping techniques: an experiment in public safety information sharing. Decision Support Systems, 45(4), 714–728.CrossRef
Zurück zum Zitat Kushmerick, N., Weld, D.S., Doorenbos, R.B. (1997). Wrapper induction for information extraction. In: Proc. IJCAI. Nagoya, Aichi, Japan. Kushmerick, N., Weld, D.S., Doorenbos, R.B. (1997). Wrapper induction for information extraction. In: Proc. IJCAI. Nagoya, Aichi, Japan.
Zurück zum Zitat Lewis, D.D., Schapire, R.E., Callan, J.P., Papka, R. (1996). Training algorithms for linear text classifiers. In Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval (pp. 298–306). Zurich, Switzerland. Lewis, D.D., Schapire, R.E., Callan, J.P., Papka, R. (1996). Training algorithms for linear text classifiers. In Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval (pp. 298–306). Zurich, Switzerland.
Zurück zum Zitat Su, W., Wang, J., Lochovsky, F. (2006). Holistic schema matching for web query interface. In Proceedings of EDBT 2006, LNCS 3896 (pp. 77–94). Munich, Germany. Su, W., Wang, J., Lochovsky, F. (2006). Holistic schema matching for web query interface. In Proceedings of EDBT 2006, LNCS 3896 (pp. 77–94). Munich, Germany.
Zurück zum Zitat Tsay, J.-J., & Wang, J.-D. (2004). Improving linear classifier for chinese text categorization. Information Processing and Management: An International Journal, 40(2), 223–237.CrossRefMATH Tsay, J.-J., & Wang, J.-D. (2004). Improving linear classifier for chinese text categorization. Information Processing and Management: An International Journal, 40(2), 223–237.CrossRefMATH
Zurück zum Zitat Tsay, J.-J., Lin, C.-H., Chen, T.-B. (2010). Category mapping for the automatic integration of category-constrained web search. International Journal of Business Intelligence and Data Mining, 5, 43–55.CrossRef Tsay, J.-J., Lin, C.-H., Chen, T.-B. (2010). Category mapping for the automatic integration of category-constrained web search. International Journal of Business Intelligence and Data Mining, 5, 43–55.CrossRef
Zurück zum Zitat Tsay, J.-J., & Tsay, C.-W. (2010). Visual content structures for wrapper induction in building metasearch systems. Toronto, Canada: Web Intelligence 2010. Tsay, J.-J., & Tsay, C.-W. (2010). Visual content structures for wrapper induction in building metasearch systems. Toronto, Canada: Web Intelligence 2010.
Zurück zum Zitat Zhang, Z., He, B., Chang, K.C.-C. (2004). On-the-fly constraint mapping across web query interfaces. In Proceedings of the VLDB workshop on information integration on the web. Toronto, Ontario, Canada. Zhang, Z., He, B., Chang, K.C.-C. (2004). On-the-fly constraint mapping across web query interfaces. In Proceedings of the VLDB workshop on information integration on the web. Toronto, Ontario, Canada.
Zurück zum Zitat Zhou, N. (2003). A study on automatic ontology mapping of categorical information. In Proceedings of the 2003 annual national conference on digital government research (pp. 1–4). Boston, Massachusetts, USA. Zhou, N. (2003). A study on automatic ontology mapping of categorical information. In Proceedings of the 2003 annual national conference on digital government research (pp. 1–4). Boston, Massachusetts, USA.
Metadaten
Titel
Hierarchical directory mapping for category-constrained meta-search
verfasst von
Jyh-Jong Tsay
Chi-Hsiang Lin
Publikationsdatum
01.02.2014
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 1/2014
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-013-0256-5

Weitere Artikel der Ausgabe 1/2014

Journal of Intelligent Information Systems 1/2014 Zur Ausgabe

Premium Partner