Skip to main content

2015 | OriginalPaper | Buchkapitel

A Signature Approach to Patent Classification

verfasst von : Dilesha Seneviratne, Shlomo Geva, Guido Zuccon, Gabriela Ferraro, Timothy Chappell, Magali Meireles

Erschienen in: Information Retrieval Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We propose a document signature approach to patent classification. Automatic patent classification is a challenging task because of the fast growing number of patent applications filed every year and the complexity, size and nested hierarchical structure of patent taxonomies. In our proposal, the classification of a target patent is achieved through a k-nearest neighbour search using Hamming distance on signatures generated from patents; the classification labels of the retrieved patents are weighted and combined to produce a patent classification code for the target patent. The use of this method is motivated by the fact that intuitively document signatures are more efficient than previous approaches for this task that considered the training of classifiers on the whole vocabulary feature set. Our empirical experiments also demonstrate that the combination of document signatures and k-nearest neighbours search improves classification effectiveness, provided that enough data is used to generate signatures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Previous work has also used the first 300 words extracted from each patent: this setting has in fact shown strong promise [7].
 
2
No publicly available implementation of Fall et al.’s methods was available and our re-implementation did not lead to effectiveness comparable to the reported one. We were therefore unable to obtain efficiency figures for the benchmark methods. Similarly, we were unable to test for significant differences.
 
Literatur
1.
Zurück zum Zitat Chappell, T., Geva, S., Zuccon, G.: Approximate nearest-neighbour search with inverted signature slice lists. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 147–158. Springer, Heidelberg (2015) Chappell, T., Geva, S., Zuccon, G.: Approximate nearest-neighbour search with inverted signature slice lists. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 147–158. Springer, Heidelberg (2015)
2.
Zurück zum Zitat Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: Proceedings of CIKM 2004, pp. 78–87 (2004) Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: Proceedings of CIKM 2004, pp. 78–87 (2004)
3.
Zurück zum Zitat Chakrabarti, S., Dom, B., Agrawal, R., Raghavan, P.: Using taxonomy, discriminants, and signatures for navigating in text databases. VLDB 97, 446–455 (1997) Chakrabarti, S., Dom, B., Agrawal, R., Raghavan, P.: Using taxonomy, discriminants, and signatures for navigating in text databases. VLDB 97, 446–455 (1997)
4.
Zurück zum Zitat Chakrabarti, S., Dom, B., Agrawal, R., Raghavan, P.: Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. VLDB J. 7(3), 163–178 (1998)CrossRef Chakrabarti, S., Dom, B., Agrawal, R., Raghavan, P.: Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. VLDB J. 7(3), 163–178 (1998)CrossRef
5.
Zurück zum Zitat Chen, Y.-L., Chang, Y.-C.: A three-phase method for patent classification. Inf. Process. Manage. 48(6), 1017–1030 (2012)CrossRefMathSciNet Chen, Y.-L., Chang, Y.-C.: A three-phase method for patent classification. Inf. Process. Manage. 48(6), 1017–1030 (2012)CrossRefMathSciNet
6.
Zurück zum Zitat De Vries, C.M., Geva, S.: Pairwise similarity of topsig document signatures. In: Proceedings of ADCS 2012, pp. 128–134 (2012) De Vries, C.M., Geva, S.: Pairwise similarity of topsig document signatures. In: Proceedings of ADCS 2012, pp. 128–134 (2012)
7.
Zurück zum Zitat Fall, C.J., Törcsvári, A., Benzineb, K., Karetka, G.: Automated categorization in the international patent classification. SIGIR Forum 37(1), 10–25 (2003)CrossRef Fall, C.J., Törcsvári, A., Benzineb, K., Karetka, G.: Automated categorization in the international patent classification. SIGIR Forum 37(1), 10–25 (2003)CrossRef
8.
Zurück zum Zitat Faloutsos, C.: Signature-based text retrieval methods: a survey. Data Eng. 13(1), 25–32 (1990) Faloutsos, C.: Signature-based text retrieval methods: a survey. Data Eng. 13(1), 25–32 (1990)
9.
Zurück zum Zitat Geva, S., De Vries, C.M.: TopSig: topology preserving document signatures. In: Proceedings of CIKM 2011, pp. 333–338 (2011) Geva, S., De Vries, C.M.: TopSig: topology preserving document signatures. In: Proceedings of CIKM 2011, pp. 333–338 (2011)
10.
Zurück zum Zitat Kim, J.-H., Choi, K.-S.: Patent document categorization based on semantic structural information. Inf. Process. Manage. 43(5), 1200–1215 (2007). Patent ProcessingCrossRef Kim, J.-H., Choi, K.-S.: Patent document categorization based on semantic structural information. Inf. Process. Manage. 43(5), 1200–1215 (2007). Patent ProcessingCrossRef
11.
Zurück zum Zitat Larkey, L.S.: A patent search and classification system. In: Proceedings of DL 1999, pp. 179–187 (1999) Larkey, L.S.: A patent search and classification system. In: Proceedings of DL 1999, pp. 179–187 (1999)
12.
Zurück zum Zitat Tikk, D.: A hierarchical online classifier for patent categorization, pp. 244–267 (2007) Tikk, D.: A hierarchical online classifier for patent categorization, pp. 244–267 (2007)
Metadaten
Titel
A Signature Approach to Patent Classification
verfasst von
Dilesha Seneviratne
Shlomo Geva
Guido Zuccon
Gabriela Ferraro
Timothy Chappell
Magali Meireles
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-28940-3_35

Neuer Inhalt