Skip to main content
Erschienen in: Journal on Data Semantics 3/2014

01.09.2014 | Original Article

Context-Based Query Using Dependency Structures Based on Latent Topic Model

verfasst von: Masato Shirai, Takashi Yanagisawa, Takao Miura

Erschienen in: Journal on Data Semantics | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To improve and enhance information retrieval on text database, there have been many approaches proposed so far, but few investigation captures context aspects of queries (of languages) directly. Here, we propose a new approach to retrieve contextual dependencies in Japanese based on latent topic model. The key idea comes from dependency structure which captures context in the database and the queries. We examine some experimental results to see the effectiveness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
In other words, a topic does not mean human-recognizable subject such as politics or airplane but a kind of cluster putting together by some probabilistic measure.
 
2
Early draft version of this work appeared as “Context-based Query using Dependency Structures based on Latent Topic Model” in 2nd International Conference on Model and Data Engineering (MEDI2012), Poitiers, France. We have extended the comparison with several relevant investigation, revised the discussion section and some other minor changes.
 
3
word is a syntax.
 
4
One exception is any predicate should appear as a last verb.
 
5
We mean we may generate dependencies based on this probability distribution.
 
6
Here, we assume the joint probability in a form of naive Bayesian manner.
 
7
Clinton, ZeroZero, Ashita appear where the latter two words show the names of Manga.
 
Literatur
1.
Zurück zum Zitat Berger A, Lafferty J (1999) Information retrieval as statistical translation. Proceedings of the ACM SIGIR, pp 222–229 Berger A, Lafferty J (1999) Information retrieval as statistical translation. Proceedings of the ACM SIGIR, pp 222–229
2.
Zurück zum Zitat Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022MATH Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022MATH
3.
Zurück zum Zitat Blei DM, Lafferty JD (2006) Dynamic topic models. Proceedings of the ICML, pp 113–120 Blei DM, Lafferty JD (2006) Dynamic topic models. Proceedings of the ICML, pp 113–120
4.
Zurück zum Zitat Canini KR, Shi L, Griffiths TL (2009) Online inference of topics with latent dirichlet allocation. J Mach Learn Res Proc Track 5:65–72 Canini KR, Shi L, Griffiths TL (2009) Online inference of topics with latent dirichlet allocation. J Mach Learn Res Proc Track 5:65–72
5.
Zurück zum Zitat Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman, R: Indexing by latent semantic analysis. J Am Soc Inform Sci 41(6):391–407 (1990) Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman, R: Indexing by latent semantic analysis. J Am Soc Inform Sci 41(6):391–407 (1990)
6.
Zurück zum Zitat Grossman DA, Frieder O 2004 Information retrieval algorithms and heuristics. 2nd edn. Springer, Heidelberg Grossman DA, Frieder O 2004 Information retrieval algorithms and heuristics. 2nd edn. Springer, Heidelberg
7.
Zurück zum Zitat Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international SIGIR conference on research and development in information retrieval, proceedings ACM SIGIR Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international SIGIR conference on research and development in information retrieval, proceedings ACM SIGIR
8.
Zurück zum Zitat Hoffman MD, Blei DM, Bach FR (2010) Online learning for latent dirichlet allocation. In: Proceedings of the 24th annual conference on neural information processing systems (NIPS), pp 856–864 Hoffman MD, Blei DM, Bach FR (2010) Online learning for latent dirichlet allocation. In: Proceedings of the 24th annual conference on neural information processing systems (NIPS), pp 856–864
9.
Zurück zum Zitat Iwata T, Yamada T, Sakurai Y, Ueda N (2010) Online multiscale dynamic topic models. In: Proceedings of the ACM SIGKDD, pp 663–672 Iwata T, Yamada T, Sakurai Y, Ueda N (2010) Online multiscale dynamic topic models. In: Proceedings of the ACM SIGKDD, pp 663–672
10.
Zurück zum Zitat Kurohashi S, Nagao M (1994) KN Parser : Japanese dependency/case structure analyzer. In: Proceedings of the workshop on sharable natural language, resources Kurohashi S, Nagao M (1994) KN Parser : Japanese dependency/case structure analyzer. In: Proceedings of the workshop on sharable natural language, resources
11.
Zurück zum Zitat Li W, McCallum A (2006) Pachinko allocation: DAG-structured mixture models of topic correlations. In: Proceedings of the ICML, pp 577–584 Li W, McCallum A (2006) Pachinko allocation: DAG-structured mixture models of topic correlations. In: Proceedings of the ICML, pp 577–584
12.
Zurück zum Zitat Liu X, Croft WB (2004) Cluster-based retrieval using language models. In: Proceedings of the ACM SIGIR, pp 186–193 Liu X, Croft WB (2004) Cluster-based retrieval using language models. In: Proceedings of the ACM SIGIR, pp 186–193
13.
Zurück zum Zitat Manning CD, Schutze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge Manning CD, Schutze H (1999) Foundations of statistical natural language processing. MIT Press, Cambridge
14.
Zurück zum Zitat Sato I, Kurihara K, Nakagawa H (2010) Deterministic single-pass algorithm for LDA. In: Proceedings 24th annual conference on neural information processing systems (NIPS), pp 2074–2082 Sato I, Kurihara K, Nakagawa H (2010) Deterministic single-pass algorithm for LDA. In: Proceedings 24th annual conference on neural information processing systems (NIPS), pp 2074–2082
15.
Zurück zum Zitat Shinzato K, Kurohashi S (2010) Exploiting term importance categories and dependency relations for natural language search. In: Proceedings of the 2nd workshop on NLPIX 2010, Beijing, pp 2–11 Shinzato K, Kurohashi S (2010) Exploiting term importance categories and dependency relations for natural language search. In: Proceedings of the 2nd workshop on NLPIX 2010, Beijing, pp 2–11
16.
Zurück zum Zitat Shirai M, Miura T (2011) On domain independence of author identification. In: Proceedings of the 12th international conference on intelligent data engineering and automated learning (IDEAL), Norwich Shirai M, Miura T (2011) On domain independence of author identification. In: Proceedings of the 12th international conference on intelligent data engineering and automated learning (IDEAL), Norwich
17.
Zurück zum Zitat Wakabayashi K, Miura T (2012) Forward–backward activation algorithm for Hierarchical Hidden Markov models. In: Proceedings of the 26th annual conference on neural information processing systems (NIPS), Nevada Wakabayashi K, Miura T (2012) Forward–backward activation algorithm for Hierarchical Hidden Markov models. In: Proceedings of the 26th annual conference on neural information processing systems (NIPS), Nevada
18.
Zurück zum Zitat Wei X, Bruce Croft W (2006) LDA-based document models for Ad-Hoc retrieval. In: Proceedings of the ACM SIGIR Wei X, Bruce Croft W (2006) LDA-based document models for Ad-Hoc retrieval. In: Proceedings of the ACM SIGIR
19.
Zurück zum Zitat Yanagisawa T, Miura T (2009) Sentence generation for stream announcement. In: Proceedings of the IEEE international pacific rim conference on communications, computers and signal processing (PACRIM) Yanagisawa T, Miura T (2009) Sentence generation for stream announcement. In: Proceedings of the IEEE international pacific rim conference on communications, computers and signal processing (PACRIM)
20.
Zurück zum Zitat Yanagisawa T, Miura T, Shioya I (2010) Simplifying sentences by frequent parsing patterns. In: Proceedings of the 11th international conference on intelligent data engineering and automated learning (IDEAL) Yanagisawa T, Miura T, Shioya I (2010) Simplifying sentences by frequent parsing patterns. In: Proceedings of the 11th international conference on intelligent data engineering and automated learning (IDEAL)
21.
Zurück zum Zitat Yi X, Allan J (2009) A comparative study of utilizing topic models for information retrieval. In: Proceedings of the 31th European conference on IR research on advances in, information retrieval, pp 29–41 Yi X, Allan J (2009) A comparative study of utilizing topic models for information retrieval. In: Proceedings of the 31th European conference on IR research on advances in, information retrieval, pp 29–41
22.
Zurück zum Zitat Zhai C, Lafferty J (2001) A study of smoothing methods for language models applied to ad-hoc information retrieval. In: Proceedings of the ACM SIGIR, pp 334–342 Zhai C, Lafferty J (2001) A study of smoothing methods for language models applied to ad-hoc information retrieval. In: Proceedings of the ACM SIGIR, pp 334–342
Metadaten
Titel
Context-Based Query Using Dependency Structures Based on Latent Topic Model
verfasst von
Masato Shirai
Takashi Yanagisawa
Takao Miura
Publikationsdatum
01.09.2014
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal on Data Semantics / Ausgabe 3/2014
Print ISSN: 1861-2032
Elektronische ISSN: 1861-2040
DOI
https://doi.org/10.1007/s13740-013-0031-3

Weitere Artikel der Ausgabe 3/2014

Journal on Data Semantics 3/2014 Zur Ausgabe