Skip to main content
Top

2018 | OriginalPaper | Chapter

70. Efficient Prior-Art Retrieval of Patent Documents Using MapReduce Paradigm

Authors : K. Girthana, S. Swamynathan

Published in: Proceedings of the International Conference on Computing and Communication Systems

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A patent is a legal right given to novel, non-obvious and useful inventions. The prior-art search involves retrieving prior works related to it to avoid duplication of the invention and granting of the patent. Moreover, it analyzes a variety of documents like newspaper articles, proceedings, and journals. The amount of patent document and the volume of filings keep on increasing at an unprecedented rate every year. Processing on this enormous volume of data sequentially is time-consuming. Hence, the proposed Prior-Art Retrieval System (PARS) retrieves only the patent documents through Google patent API, and K-Means clustering was employed in a parallel mode to cluster the documents. Through Relevance Mapping prominent document clusters were identified. The documents within the relevant clusters are ranked based on the citations. The top ranked documents were displayed to the patent analyst.The results show that the processing time with map reduce has reduced significantly and accuracy of clusters was around 50%.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Gaff, Brian M., and Bruce Rubinger.: The significance of prior art. Computer. 8, pp. 9–11 (2014) Gaff, Brian M., and Bruce Rubinger.: The significance of prior art. Computer. 8, pp. 9–11 (2014)
2.
go back to reference Wanagiri, M. Z., Adriani, M.: Prior Art Retrieval Using Various Patent Document Fields Contents. CLEF (Notebook Papers/LABs/Workshops), pp. 1–6, UK (2010) Wanagiri, M. Z., Adriani, M.: Prior Art Retrieval Using Various Patent Document Fields Contents. CLEF (Notebook Papers/LABs/Workshops), pp. 1–6, UK (2010)
3.
go back to reference Xue, X., Croft, W. B.: Automatic query generation for patent search. In: 18th ACM conference on Information and knowledge management, pp. 2037–2040, Germany (2009) Xue, X., Croft, W. B.: Automatic query generation for patent search. In: 18th ACM conference on Information and knowledge management, pp. 2037–2040, Germany (2009)
4.
go back to reference Jun, S., Park, S. S., Jang, D. S. : Document clustering method using dimension reduction and support vector clustering to overcome sparseness. Expert Systems with Applications. 41, 7, 3204–3212 (2014)CrossRef Jun, S., Park, S. S., Jang, D. S. : Document clustering method using dimension reduction and support vector clustering to overcome sparseness. Expert Systems with Applications. 41, 7, 3204–3212 (2014)CrossRef
5.
go back to reference Andrews, N. O., and Fox, E. A.: Recent developments in document clustering. Technical Report TR-07-35 (2007) Andrews, N. O., and Fox, E. A.: Recent developments in document clustering. Technical Report TR-07-35 (2007)
6.
go back to reference Huang, S. H., Ke, H. R., Yang, W. P.: Structure clustering for Chinese patent documents. Expert Systems with Applications. 34, 4, 2290–2297 (2008) Huang, S. H., Ke, H. R., Yang, W. P.: Structure clustering for Chinese patent documents. Expert Systems with Applications. 34, 4, 2290–2297 (2008)
7.
go back to reference Balabantaray, R. C., Sarma, C., Jha, M.: Document Clustering using K-Means and K-Medoids. International Journal of Knowledge Based Computer Systems. 1, 1 (2015) Balabantaray, R. C., Sarma, C., Jha, M.: Document Clustering using K-Means and K-Medoids. International Journal of Knowledge Based Computer Systems. 1, 1 (2015)
8.
go back to reference Bradley, P. S., Fayyad, U. M., Reina, C.: Scaling Clustering algorithms to large databases. In: 4th International Conference on Knowledge Discovery and Data Mining (KDD-98), pp. 9–15. (1998) Bradley, P. S., Fayyad, U. M., Reina, C.: Scaling Clustering algorithms to large databases. In: 4th International Conference on Knowledge Discovery and Data Mining (KDD-98), pp. 9–15. (1998)
9.
go back to reference Kriegel, H. P., Kroger, P., Renz, M., Wurst, S.: A generic framework for efficient subspace clustering of high-dimensional data. In: Proceedings of the 5th IEEE International conference on data mining (ICDM), pp 250–257 (2005) Kriegel, H. P., Kroger, P., Renz, M., Wurst, S.: A generic framework for efficient subspace clustering of high-dimensional data. In: Proceedings of the 5th IEEE International conference on data mining (ICDM), pp 250–257 (2005)
10.
go back to reference Han, J. and Kamber M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, Elsevier (2011)MATH Han, J. and Kamber M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, Elsevier (2011)MATH
11.
go back to reference Ngazimbi, M.: Data Clustering Using MapReduce. In Masters Thesis, Boise State University (2009) Ngazimbi, M.: Data Clustering Using MapReduce. In Masters Thesis, Boise State University (2009)
12.
go back to reference Sun, T., Shu, C., Li, F., Yu, H., Ma, L., Fang, Y.: An Efficient Hierarchical Clustering Method for Large Datasets with Map-Reduce. In Proceedings of the International Conference on Parallel and Distributed Computing, Applications and Technologies, 12, 2, pp. 494–499 (2009) Sun, T., Shu, C., Li, F., Yu, H., Ma, L., Fang, Y.: An Efficient Hierarchical Clustering Method for Large Datasets with Map-Reduce. In Proceedings of the International Conference on Parallel and Distributed Computing, Applications and Technologies, 12, 2, pp. 494–499 (2009)
13.
go back to reference Wang, S., Dutta, H.: PARABLE: A PArallel RAndom-partition Based HierarchicaL ClustEring Algorithm for the MapReduce Framework. In: 6th Annual Machine Learning Symposium at the New York Academy of Science. (2011) Wang, S., Dutta, H.: PARABLE: A PArallel RAndom-partition Based HierarchicaL ClustEring Algorithm for the MapReduce Framework. In: 6th Annual Machine Learning Symposium at the New York Academy of Science. (2011)
14.
go back to reference Zhao, W., Ma, H.,He, Q.: Parallel K-means Clustering Based on MapReduce. In: IEEE International Conference on Cloud Computing, pp. 674–679 (2009) Zhao, W., Ma, H.,He, Q.: Parallel K-means Clustering Based on MapReduce. In: IEEE International Conference on Cloud Computing, pp. 674–679 (2009)
15.
go back to reference Kang, I. S., Na, S. H., Kim, J., Lee, J. H.: Cluster-based patent retrieval. Information processing & management, 43, 5, 1173–1182 (2007) Kang, I. S., Na, S. H., Kim, J., Lee, J. H.: Cluster-based patent retrieval. Information processing & management, 43, 5, 1173–1182 (2007)
16.
go back to reference Aleman-Meza, B., Arpinar, I. B., Nural, M. V., Sheth, A. P.: Ranking documents semantically using ontological relationships. In: 4th International Conference on semantic computing, pp. 299–304, US (2010) Aleman-Meza, B., Arpinar, I. B., Nural, M. V., Sheth, A. P.: Ranking documents semantically using ontological relationships. In: 4th International Conference on semantic computing, pp. 299–304, US (2010)
Metadata
Title
Efficient Prior-Art Retrieval of Patent Documents Using MapReduce Paradigm
Authors
K. Girthana
S. Swamynathan
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6890-4_70