Skip to main content

2017 | OriginalPaper | Buchkapitel

Keyword Based Identification of Thrust Area Using MapReduce for Knowledge Discovery

verfasst von : Nirmal Kaur, Manmohan Sharma

Erschienen in: Advanced Informatics for Computing Research

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Keyword based identification generally used in many applications like Web pages, Query processing, Searching interfaces with dealing the power of data mining algorithms which contributes effective and efficient work in large datasets. Keywords are most important terms in documents or text fields to get some interesting knowledge for fulfill the discovery goal. The goal of this paper is to specify the Thrust Area for particular searched keyword in computer science field by this interface. This paper use MapReduce framework with some modification and search the keyword from database to identify the Thrust Area. The proposed interface is mapped on the processed query resulting in the relevant information extracted from the given datasets. MapReduce can work with keywords in large datasets such as sorting, counting frequency etc. with high efficiency. Experimental work has also been carried out to analyses the performance on various parameters such as the time taken by each input source to make clusters and identify Thrust Areas.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Cios, K.J., Pedrycz, W., Swiniarski, R.W., Kurgan, L.: Data Mining: A Knowledge Discovery Approach. Springer, Heidelberg (2007)MATH Cios, K.J., Pedrycz, W., Swiniarski, R.W., Kurgan, L.: Data Mining: A Knowledge Discovery Approach. Springer, Heidelberg (2007)MATH
2.
Zurück zum Zitat Ramagari, B.M.: Data mining techniques and application. Indian J. Comput. Sci. Eng. (2011) Ramagari, B.M.: Data mining techniques and application. Indian J. Comput. Sci. Eng. (2011)
3.
Zurück zum Zitat Feldman, R., Dagan, I.: Knowledge discovery in textual databases (KDT). In: Proceedings of KDD, vol. 95 (1995) Feldman, R., Dagan, I.: Knowledge discovery in textual databases (KDT). In: Proceedings of KDD, vol. 95 (1995)
4.
Zurück zum Zitat Mcgarry, K.: A survey of interestingness measures for knowledge discovery. Knowl. Eng. Rev. (2005). Cambridge University Press Mcgarry, K.: A survey of interestingness measures for knowledge discovery. Knowl. Eng. Rev. (2005). Cambridge University Press
5.
Zurück zum Zitat Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005). doi:10.1007/11603412_5 CrossRef Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005). doi:10.​1007/​11603412_​5 CrossRef
6.
Zurück zum Zitat Agrawal, S., Chaudhari, S., Das, G.: DBXplorer: a system for keyword-based search over relational databases. In: ICDE IEEE (2002) Agrawal, S., Chaudhari, S., Das, G.: DBXplorer: a system for keyword-based search over relational databases. In: ICDE IEEE (2002)
7.
Zurück zum Zitat Balmin, A., Hristidis, V., Papakonstantinou, Y.: ObjectRank: authority-based keyword search in databases. In: VLDB (2004) Balmin, A., Hristidis, V., Papakonstantinou, Y.: ObjectRank: authority-based keyword search in databases. In: VLDB (2004)
8.
Zurück zum Zitat Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using BANKS. In: ICDE (2002) Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using BANKS. In: ICDE (2002)
9.
Zurück zum Zitat Yu, B., Li, G., Sollins, K.: Effective keyword based selection of relational database. In: SIGMOD (2007) Yu, B., Li, G., Sollins, K.: Effective keyword based selection of relational database. In: SIGMOD (2007)
10.
Zurück zum Zitat Kalesha, P., Rao, M., Kavitha, C.: Efficient preprocessing and patterns identification approach for text mining. Int. J. Comput. Trends Technol. (2011) Kalesha, P., Rao, M., Kavitha, C.: Efficient preprocessing and patterns identification approach for text mining. Int. J. Comput. Trends Technol. (2011)
11.
Zurück zum Zitat Sarda, N.L., Jain, A.: A system of keyword based and searching in databases. Arxiv.org (2001) Sarda, N.L., Jain, A.: A system of keyword based and searching in databases. Arxiv.​org (2001)
12.
Zurück zum Zitat Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: SIGKDD (2002) Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: SIGKDD (2002)
13.
Zurück zum Zitat Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. Google Inc. (2004) Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. Google Inc. (2004)
14.
Zurück zum Zitat Maclean, D.: A very brief introduction to MapReduce, for CS448G (2011) Maclean, D.: A very brief introduction to MapReduce, for CS448G (2011)
15.
Zurück zum Zitat Hulgeri, A., Bhalotia, G., Nakhrey, C., Chakrabarti, S.: Keyword search in databases. In: Bulletin of the IEEE Computer Society Technical Committee on Data Engineering (2001) Hulgeri, A., Bhalotia, G., Nakhrey, C., Chakrabarti, S.: Keyword search in databases. In: Bulletin of the IEEE Computer Society Technical Committee on Data Engineering (2001)
16.
Zurück zum Zitat Agarwal, S., Chaudhari, S., Das, G.: DBExplorer: a system for keyword-based search over relational databases. In: Proceedings of the 18th International Conference with Hashing and Other Known Compression Techniques (2002) Agarwal, S., Chaudhari, S., Das, G.: DBExplorer: a system for keyword-based search over relational databases. In: Proceedings of the 18th International Conference with Hashing and Other Known Compression Techniques (2002)
17.
Zurück zum Zitat Hristidis, V., Papakonstantinou, Y.: DISCOVER: keyword search in relational databases. In: Proceedings of the 28th VLDB Conference (2002) Hristidis, V., Papakonstantinou, Y.: DISCOVER: keyword search in relational databases. In: Proceedings of the 28th VLDB Conference (2002)
18.
Zurück zum Zitat Agichtein, E., Gravano, L.: Querying text databases for efficient information extraction. In: Proceedings of the IEEE ICDE (2003) Agichtein, E., Gravano, L.: Querying text databases for efficient information extraction. In: Proceedings of the IEEE ICDE (2003)
19.
Zurück zum Zitat Su, Q., Widom, J.: Indexing relational database content offline for efficient keyword based search. In: International Database Engineering & Application Symposium (2005) Su, Q., Widom, J.: Indexing relational database content offline for efficient keyword based search. In: International Database Engineering & Application Symposium (2005)
20.
Zurück zum Zitat Chaudhari, S., Das, G.: Keyword querying and ranking in databases. In: Proceedings of the VLDB Endowment (2009) Chaudhari, S., Das, G.: Keyword querying and ranking in databases. In: Proceedings of the VLDB Endowment (2009)
21.
Zurück zum Zitat Qin, Z., Li, P.: SWEE: approximately searching web service with keywords effectively and efficiently. © IEEE (2010) Qin, Z., Li, P.: SWEE: approximately searching web service with keywords effectively and efficiently. © IEEE (2010)
22.
Zurück zum Zitat Li, L., Petschulat, S.: Efficient and effective aggregate keyword search on rational databases. Int. J. Data Warehous. Min. (2012) Li, L., Petschulat, S.: Efficient and effective aggregate keyword search on rational databases. Int. J. Data Warehous. Min. (2012)
23.
Zurück zum Zitat Uthayan, K.R., Anandha, V.: Hybrid ontology for semantic information retrieval model using keyword matching indexing system. Res. Artic.@ Sci. World J. (2015) Uthayan, K.R., Anandha, V.: Hybrid ontology for semantic information retrieval model using keyword matching indexing system. Res. Artic.@ Sci. World J. (2015)
24.
Zurück zum Zitat Sun, T., Shu, C.: An efficient hierarchical clustering method for large datasets with Map-Reduce. In: International Conference on Parallel and Distributed Computing, Application and Technologies (2009) Sun, T., Shu, C.: An efficient hierarchical clustering method for large datasets with Map-Reduce. In: International Conference on Parallel and Distributed Computing, Application and Technologies (2009)
25.
Zurück zum Zitat Rao, P.S., Prasad, M.H.M.K., Reddy, K.T.: An efficient semantic ranked keyword search of big data using Map Reduce. Int. J. Database Theory Appl. (2015) Rao, P.S., Prasad, M.H.M.K., Reddy, K.T.: An efficient semantic ranked keyword search of big data using Map Reduce. Int. J. Database Theory Appl. (2015)
26.
Zurück zum Zitat Hao, Y., Cao, H.: Efficient keyword search on graphs using MapReduce. In: IEEE International Conference on Big Data (2015) Hao, Y., Cao, H.: Efficient keyword search on graphs using MapReduce. In: IEEE International Conference on Big Data (2015)
Metadaten
Titel
Keyword Based Identification of Thrust Area Using MapReduce for Knowledge Discovery
verfasst von
Nirmal Kaur
Manmohan Sharma
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-5780-9_5