Skip to main content
Erschienen in: International Journal on Digital Libraries 3-4/2014

01.08.2014

Word occurrence based extraction of work contributors from statements of responsibility

verfasst von: Nuno Freire

Erschienen in: International Journal on Digital Libraries | Ausgabe 3-4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper addresses the identification of all contributors of an intellectual work, when they are recorded in bibliographic data but in unstructured form. National bibliographies are very reliable on representing the first author of a work; however, secondary contributors are frequently represented only in the statements of responsibility that are transcribed by the cataloguer from the book into the bibliographic records. The identification of work contributors mentioned in statements of responsibility is a typical motivation for the application of information extraction techniques. This paper presents an approach developed for the specific application scenario of the ARROW rights infrastructure being deployed in several European countries to assist in the determination of the copyright status of works that may not be under public domain. An evaluation of our approach was performed in catalogues of nine European national libraries of countries that are available in the ARROW rights infrastructure, which cover eight different languages. The evaluation has shown that it performs reliably across languages and bibliographic datasets. It achieved an overall precision of 98.7 % and recall of 96.7 %.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Joint Steering Committee for Revision of AACR: Anglo-American Cataloguing Rules, 2nd edn (2005) ISBN: 978-1-85604-570-4 Joint Steering Committee for Revision of AACR: Anglo-American Cataloguing Rules, 2nd edn (2005) ISBN: 978-1-85604-570-4
2.
Zurück zum Zitat Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Linguisticae Investigationes 30, 3–26 (2007)CrossRef Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Linguisticae Investigationes 30, 3–26 (2007)CrossRef
3.
Zurück zum Zitat McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: International Conference on Machine Learning (2000) McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: International Conference on Machine Learning (2000)
4.
Zurück zum Zitat Martins, B., Borbinha, J., Pedrosa, G., Gil, J., Freire, N.: Geographically-aware information retrieval for collections of digitized historical maps. In: 4th ACM Workshop on Geographical, information Retrieval (2007) Martins, B., Borbinha, J., Pedrosa, G., Gil, J., Freire, N.: Geographically-aware information retrieval for collections of digitized historical maps. In: 4th ACM Workshop on Geographical, information Retrieval (2007)
5.
Zurück zum Zitat Freire, N., Borbinha, J., Calado, P., Martins, B.: A metadata geoparsing system for place name recognition and resolution in metadata records. In: ACM/IEEE Joint Conference on Digital Libraries (2011) Freire, N., Borbinha, J., Calado, P., Martins, B.: A metadata geoparsing system for place name recognition and resolution in metadata records. In: ACM/IEEE Joint Conference on Digital Libraries (2011)
6.
Zurück zum Zitat Sporleder, C.: Natural language processing for cultural heritage domains. Lang Linguist Compass 4(9), 750–768 (2010)CrossRef Sporleder, C.: Natural language processing for cultural heritage domains. Lang Linguist Compass 4(9), 750–768 (2010)CrossRef
7.
Zurück zum Zitat King, P., Poulovassilis, A.: Enhancing database technology to better manage and exploit Partially Structured Data. University of London, Technical report (2000) King, P., Poulovassilis, A.: Enhancing database technology to better manage and exploit Partially Structured Data. University of London, Technical report (2000)
8.
Zurück zum Zitat Michelson, M., Knoblock, C.: Creating relational data from unstructured and ungrammatical data sources. J Artif Intell Res 31, 543–590 (2008)MATH Michelson, M., Knoblock, C.: Creating relational data from unstructured and ungrammatical data sources. J Artif Intell Res 31, 543–590 (2008)MATH
9.
Zurück zum Zitat Guo, J., Xu, G., Cheng, X., Li, H.: Named entity recognition in query. In: 32nd Annual ACM SIGIR Conference (2009) Guo, J., Xu, G., Cheng, X., Li, H.: Named entity recognition in query. In: 32nd Annual ACM SIGIR Conference (2009)
10.
Zurück zum Zitat Du, J., Zhang, Z., Yan, J., Cui, Y., Chen, Z.: Using search session context for named entity recognition in query. In: 33rd Annual ACM SIGIR Conference (2010). Du, J., Zhang, Z., Yan, J., Cui, Y., Chen, Z.: Using search session context for named entity recognition in query. In: 33rd Annual ACM SIGIR Conference (2010).
11.
Zurück zum Zitat Freire, N., Borbinha, J., Calado, P.: An approach for named entity recognition in poorly structured data. In: Proceeding of the 9th Extended Semantic Web Conference (2012) Freire, N., Borbinha, J., Calado, P.: An approach for named entity recognition in poorly structured data. In: Proceeding of the 9th Extended Semantic Web Conference (2012)
13.
Zurück zum Zitat Crocker, D., Overell, P.: Augmented BNF for Syntax Specifications: ABNF. RFC Editor (2008) Crocker, D., Overell, P.: Augmented BNF for Syntax Specifications: ABNF. RFC Editor (2008)
14.
Zurück zum Zitat Sang, T.K., Erik, F.: Introduction to the CoNLL-2002 Shared Task: language-independent named entity recognition. In: Proceedings Conference on Natural Language Learning (2002) Sang, T.K., Erik, F.: Introduction to the CoNLL-2002 Shared Task: language-independent named entity recognition. In: Proceedings Conference on Natural Language Learning (2002)
15.
Zurück zum Zitat Sang, T.K., Erik, F., De Meulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings Conference on Natural Language Learning (2003) Sang, T.K., Erik, F., De Meulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings Conference on Natural Language Learning (2003)
16.
Zurück zum Zitat Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: Proceeding of the International Conference on Computational Linguistics (1996) Grishman, R., Sundheim, B.: Message understanding conference-6: a brief history. In: Proceeding of the International Conference on Computational Linguistics (1996)
17.
Zurück zum Zitat Bennett, R., Hengel-Dittrich, C., O’Neill, E., Tillett, B.B.: VIAF (Virtual International Authority File): Linking Die Deutsche Bibliothek and Library of Congress Name Authority Files. World Library and Information Congress: 72nd IFLA General Conference and Council (2006) Bennett, R., Hengel-Dittrich, C., O’Neill, E., Tillett, B.B.: VIAF (Virtual International Authority File): Linking Die Deutsche Bibliothek and Library of Congress Name Authority Files. World Library and Information Congress: 72nd IFLA General Conference and Council (2006)
18.
Zurück zum Zitat Freire, N., Muhr, M.: Use of authorities open data in the ARROW rights infrastructure. In: Proceedings of the DC-2013 Linking to the Future Conference (2013) Freire, N., Muhr, M.: Use of authorities open data in the ARROW rights infrastructure. In: Proceedings of the DC-2013 Linking to the Future Conference (2013)
19.
Zurück zum Zitat Freire, N., Scipione, G., Muhr, M., Juffinger, A.: Supporting rights clearance for digitisation projects with the ARROW service. LIBER Q 22(4), 265–284 (2013) Freire, N., Scipione, G., Muhr, M., Juffinger, A.: Supporting rights clearance for digitisation projects with the ARROW service. LIBER Q 22(4), 265–284 (2013)
Metadaten
Titel
Word occurrence based extraction of work contributors from statements of responsibility
verfasst von
Nuno Freire
Publikationsdatum
01.08.2014
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Digital Libraries / Ausgabe 3-4/2014
Print ISSN: 1432-5012
Elektronische ISSN: 1432-1300
DOI
https://doi.org/10.1007/s00799-014-0113-3

Weitere Artikel der Ausgabe 3-4/2014

International Journal on Digital Libraries 3-4/2014 Zur Ausgabe