Skip to main content
Erschienen in: Journal of Intelligent Information Systems 3/2010

01.12.2010

Semantic-distance based evaluation of ranking queries over relational databases

verfasst von: Liang Zhu, Qin Ma, Chunnian Liu, Guojun Mao, Wenzhu Yang

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 3/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Traditional database search uses pattern match in the comparison process. For a query with some search words, tuples are selected only if the words of the tuples exactly match the query words. In this paper, we propose a new method for evaluating relational ranking queries (or top-N queries) with text attributes. This method defines semantic distance functions and utilizes semantic match between words in database search. The attempt is that tuples, not only exactly matching, but also close to the query according to semantic distances, can both be fetched. The basic idea of the method is to create an index based on WordNet to expand the tuple words semantically. The candidate results for a query are retrieved by the index and a simple SQL selection statement, and then top-N answers are obtained. Extensive experiments are carried out to measure the performance of this new strategy for the evaluation of ranking queries over relational databases.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
A relationship L on the sets X 1, ..., X k is a subset of their Cartesian product, written \(L \subseteq X_{1} \times \) ... ×X k .
 
Literatur
Zurück zum Zitat Bates, M. J. (1989). Rethinking subject cataloging in the online environment. Library Resources & Technical Services, 33(4), 400–412.MathSciNet Bates, M. J. (1989). Rethinking subject cataloging in the online environment. Library Resources & Technical Services, 33(4), 400–412.MathSciNet
Zurück zum Zitat Budanitsky, A., & Hirst, G. (2001). Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Proceedings of NAACL 2001 workshop on WordNet and other lexical resources. Pittsburgh, USA. Budanitsky, A., & Hirst, G. (2001). Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Proceedings of NAACL 2001 workshop on WordNet and other lexical resources. Pittsburgh, USA.
Zurück zum Zitat Buscaldi, D., Rosso, P., & Sanchis, A. E. (2005). A wordnet-based query expansion method for geographical information retrieval. In Working notes for the CLEF workshop. Vienna, Austria. Buscaldi, D., Rosso, P., & Sanchis, A. E. (2005). A wordnet-based query expansion method for geographical information retrieval. In Working notes for the CLEF workshop. Vienna, Austria.
Zurück zum Zitat Carey, M., & Kossmann, D. (1997). On saying “enough already!” In SQL. In Proceedings ACM international conference on management of data (SIGMOD’97) (pp. 219–230). Tucson, Arizona, USA. Carey, M., & Kossmann, D. (1997). On saying “enough already!” In SQL. In Proceedings ACM international conference on management of data (SIGMOD’97) (pp. 219–230). Tucson, Arizona, USA.
Zurück zum Zitat Chen, Y. (2002). Raw relation sets, order fusion and top-N query problem. Ph.D. Dissertation, Department of Computer Science, SUNY at Binghamton. Chen, Y. (2002). Raw relation sets, order fusion and top-N query problem. Ph.D. Dissertation, Department of Computer Science, SUNY at Binghamton.
Zurück zum Zitat Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms (2nd ed.). Cambridge: MIT.MATH Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms (2nd ed.). Cambridge: MIT.MATH
Zurück zum Zitat Das, S., Chong, E. I., Eadon, G., & Srinivasan, J. (2004). Supporting ontology-based semantic matching in RDBMS. In Proceedings of the thirtieth international conference on very large data bases (VLDB’04) (pp. 1054–1065). Toronto, Canada. Das, S., Chong, E. I., Eadon, G., & Srinivasan, J. (2004). Supporting ontology-based semantic matching in RDBMS. In Proceedings of the thirtieth international conference on very large data bases (VLDB’04) (pp. 1054–1065). Toronto, Canada.
Zurück zum Zitat Deerwester, S., Dumais, S. T., Landauer, T. K., Furnas, G. W., & Harshman, R. A. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391–407.CrossRef Deerwester, S., Dumais, S. T., Landauer, T. K., Furnas, G. W., & Harshman, R. A. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391–407.CrossRef
Zurück zum Zitat Dumais, S. T., Landauer, T. K., & Littman, M. L. (1996). Automatic cross-linguistic information retrieval using latent semantic indexing. In Proceedings ACM SIGIR ’96 workshop on cross-linguistic information retrieval. Zurich, Switzerland. Dumais, S. T., Landauer, T. K., & Littman, M. L. (1996). Automatic cross-linguistic information retrieval using latent semantic indexing. In Proceedings ACM SIGIR ’96 workshop on cross-linguistic information retrieval. Zurich, Switzerland.
Zurück zum Zitat Fellbaum, C. (1998). WordNet: An electronic lexical database. Cambridge: MIT.MATH Fellbaum, C. (1998). WordNet: An electronic lexical database. Cambridge: MIT.MATH
Zurück zum Zitat Hristidis, V., Gravano, L., & Papakonstantinou, Y. (2003). Efficient IR-style keyword search over relational databases. In Proceedings of 29th international conference on very large data bases (VLDB’03) (pp. 850–861). Berlin, Germany. Hristidis, V., Gravano, L., & Papakonstantinou, Y. (2003). Efficient IR-style keyword search over relational databases. In Proceedings of 29th international conference on very large data bases (VLDB’03) (pp. 850–861). Berlin, Germany.
Zurück zum Zitat Hung, E., Deng, Y., & Subrahmanian, V. S. (2004). TOSS: An extension of TAX with ontologies and similarity queries. In Proceedings of the ACM international conference on management of data (SIGMOD’04) (pp. 719–730). Paris, France. Hung, E., Deng, Y., & Subrahmanian, V. S. (2004). TOSS: An extension of TAX with ontologies and similarity queries. In Proceedings of the ACM international conference on management of data (SIGMOD’04) (pp. 719–730). Paris, France.
Zurück zum Zitat Ilyas, I. F., Beskales, G., & Soliman, M. A. (2008). A survey of top-k query processing techniques in relational database systems. ACM Computing Surveys, 40(4), 11.CrossRef Ilyas, I. F., Beskales, G., & Soliman, M. A. (2008). A survey of top-k query processing techniques in relational database systems. ACM Computing Surveys, 40(4), 11.CrossRef
Zurück zum Zitat Kandogan, E., Krishnamurthy, R., Raghavan, S., Vaithyanathan, S., & Zhu, H. (2006). Avatar semantic search: A database approach to information retrieval. In Proceedings of the ACM international conference on management of data (SIGMOD’06) (pp. 790–792). Chicago, Illinois, USA. Kandogan, E., Krishnamurthy, R., Raghavan, S., Vaithyanathan, S., & Zhu, H. (2006). Avatar semantic search: A database approach to information retrieval. In Proceedings of the ACM international conference on management of data (SIGMOD’06) (pp. 790–792). Chicago, Illinois, USA.
Zurück zum Zitat Kruse, P. M., Naujoks, A., Roesner, D., & Kunze, M. (2005). Clever search: A wordnet based wrapper for internet search engines. In Proc. 2nd GermaNet workshop, The Computing Research Repository (CoRR:2005) abs/cs/0501086. Kruse, P. M., Naujoks, A., Roesner, D., & Kunze, M. (2005). Clever search: A wordnet based wrapper for internet search engines. In Proc. 2nd GermaNet workshop, The Computing Research Repository (CoRR:2005) abs/cs/0501086.
Zurück zum Zitat Li, C., Chang, K., Ilyas, I. F., & Song, S. (2005). RankSQL, query algebra and optimization for relational top-k queries. In Proceedings ACM international conference on management of data (SIGMOD’05) (pp. 131–142). Baltimore, Maryland, USA. Li, C., Chang, K., Ilyas, I. F., & Song, S. (2005). RankSQL, query algebra and optimization for relational top-k queries. In Proceedings ACM international conference on management of data (SIGMOD’05) (pp. 131–142). Baltimore, Maryland, USA.
Zurück zum Zitat Lim, L., Wang, H., & Wang, M. (2007). Unifying data and domain knowledge using virtual views. In Proceedings of the 33rd international conference on very large data bases (VLDB’07) (pp. 255–266). Vienna, Austria. Lim, L., Wang, H., & Wang, M. (2007). Unifying data and domain knowledge using virtual views. In Proceedings of the 33rd international conference on very large data bases (VLDB’07) (pp. 255–266). Vienna, Austria.
Zurück zum Zitat Liu, F., Yu, C., Meng, W., & Chowdhury, A. (2006). Effective keyword search in relational databases. In Proceedings of the ACM international conference on management of data (SIGMOD’06) (pp. 563–574). Chicago, IL, USA. Liu, F., Yu, C., Meng, W., & Chowdhury, A. (2006). Effective keyword search in relational databases. In Proceedings of the ACM international conference on management of data (SIGMOD’06) (pp. 563–574). Chicago, IL, USA.
Zurück zum Zitat Lu, Y., Meng, W., Shu, L., Yu, C., & Liu, K. (2005). Evaluation of result merging strategies for metasearch engines. In 6th international conference on web information systems engineering (WISE’05) (pp. 53–66). New York, USA. Lu, Y., Meng, W., Shu, L., Yu, C., & Liu, K. (2005). Evaluation of result merging strategies for metasearch engines. In 6th international conference on web information systems engineering (WISE’05) (pp. 53–66). New York, USA.
Zurück zum Zitat Moldovan, D. I., & Mihalcea, R. (2000). Using wordnet and lexical operators to improve internet searches. IEEE Internet Computing, 4(1), 34–43.CrossRef Moldovan, D. I., & Mihalcea, R. (2000). Using wordnet and lexical operators to improve internet searches. IEEE Internet Computing, 4(1), 34–43.CrossRef
Zurück zum Zitat Motro, A. (1988). VAGUE: A user interface to relational databases that permits vague queries. ACM Transactions on Office Information Systems, 6(3), 187–214. doi:10.1145/45945.48027.CrossRef Motro, A. (1988). VAGUE: A user interface to relational databases that permits vague queries. ACM Transactions on Office Information Systems, 6(3), 187–214. doi:10.​1145/​45945.​48027.CrossRef
Zurück zum Zitat Necib, C. B., & Freytag, J. C. (2003). Ontology based query processing in database management systems. In CoopIS/DOA/ODBASE2003 (pp. 839–857). Catania, Sicily, Italy. Necib, C. B., & Freytag, J. C. (2003). Ontology based query processing in database management systems. In CoopIS/DOA/ODBASE2003 (pp. 839–857). Catania, Sicily, Italy.
Zurück zum Zitat Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin, 24(4), 35–43. Singhal, A. (2001). Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin, 24(4), 35–43.
Zurück zum Zitat Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR’96) (pp. 21–29). Zurich, Switzerland. Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. In Proceedings of the 19th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR’96) (pp. 21–29). Zurich, Switzerland.
Zurück zum Zitat Udrea, O., Deng, Y., Hung, E., & Subrahmanian, V. S. (2005). Probabilistic ontologies and relational databases. In CoopIS/DOA/ODBASE2005 (pp. 1–17). Agia Napa, Cyprus. Udrea, O., Deng, Y., Hung, E., & Subrahmanian, V. S. (2005). Probabilistic ontologies and relational databases. In CoopIS/DOA/ODBASE2005 (pp. 1–17). Agia Napa, Cyprus.
Zurück zum Zitat Yu, C., Philip, G., & Meng, W. (2003). Distributed top-N query processing with possibly uncooperative local systems. In Proceedings of 29th international conference on very large data bases (VLDB’03) (pp. 117–128). Berlin, Germany. Yu, C., Philip, G., & Meng, W. (2003). Distributed top-N query processing with possibly uncooperative local systems. In Proceedings of 29th international conference on very large data bases (VLDB’03) (pp. 117–128). Berlin, Germany.
Zurück zum Zitat Zhang, J., Peng, Z., Wang, S., & Nie, H. (2006). Si-SEEKER: Ontology-based semantic search over databases. In Knowledge science, engineering and management, first international conference (KSEM 2006) (pp. 599–611). LNAI 4092. Guilin, China. Zhang, J., Peng, Z., Wang, S., & Nie, H. (2006). Si-SEEKER: Ontology-based semantic search over databases. In Knowledge science, engineering and management, first international conference (KSEM 2006) (pp. 599–611). LNAI 4092. Guilin, China.
Metadaten
Titel
Semantic-distance based evaluation of ranking queries over relational databases
verfasst von
Liang Zhu
Qin Ma
Chunnian Liu
Guojun Mao
Wenzhu Yang
Publikationsdatum
01.12.2010
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 3/2010
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-009-0116-5

Weitere Artikel der Ausgabe 3/2010

Journal of Intelligent Information Systems 3/2010 Zur Ausgabe

Premium Partner