Skip to main content

2018 | OriginalPaper | Buchkapitel

Adapting TimeML to Basque: Event Annotation

verfasst von : Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

Erschienen in: Computational Linguistics and Intelligent Text Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we present an event annotation effort following EusTimeML, a temporal mark-up language for Basque based on TimeML. For this, we first describe events and their main ontological and grammatical features. We base our analysis on Basque grammars and TimeML mark-up language classification of events. Annotation guidelines have been created to address the event information annotation for Basque and an annotation experiment has been conducted. A first round has served to evaluate the preliminary guidelines and decisions on event annotation have been taken according to annotations and inter-annotator agreement results. Then a guideline tuning period has followed. In the second round, we have created a manually-annotated gold standard corpus for event annotation in Basque. Event analysis and annotation experiment are part of a complete temporal information analysis and corpus creation work.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The amount of events varies among the annotations.
 
Literatur
1.
Zurück zum Zitat Minard, A.-L., Speranza, M., Agirre, E., Aldabe, I., van Erp, M., Magnini, B., Rigau, G., Urizar, R.: SemEval-2015 Task 4: TimeLine: cross-document event ordering. In: Proceedings of 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, pp. 778–786. Association for Computational Linguistics, June 2015 Minard, A.-L., Speranza, M., Agirre, E., Aldabe, I., van Erp, M., Magnini, B., Rigau, G., Urizar, R.: SemEval-2015 Task 4: TimeLine: cross-document event ordering. In: Proceedings of 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, pp. 778–786. Association for Computational Linguistics, June 2015
2.
Zurück zum Zitat Pustejovsky, J., Castaño, J.M., Ingria, R., Saurí, R., Gaizauskas, R.J., Setzer, A., Katz, G., Radev, D.R.: TimeML: robust specification of event and temporal expressions in text. N. Dir. Quest. Answ. 3, 28–34 (2003) Pustejovsky, J., Castaño, J.M., Ingria, R., Saurí, R., Gaizauskas, R.J., Setzer, A., Katz, G., Radev, D.R.: TimeML: robust specification of event and temporal expressions in text. N. Dir. Quest. Answ. 3, 28–34 (2003)
5.
Zurück zum Zitat Llorens, H., Saquete, E., Navarro, B.: TIPSem (English and Spanish): evaluating CRFs and semantic roles in TempEval-2. In: Proceedings of 5th International Workshop on Semantic Evaluation, SemEval 2010, Stroudsburg, PA, USA, pp. 284–291. Association for Computational Linguistics (2010). http://dl.acm.org/citation.cfm?id=1859664.1859727 Llorens, H., Saquete, E., Navarro, B.: TIPSem (English and Spanish): evaluating CRFs and semantic roles in TempEval-2. In: Proceedings of 5th International Workshop on Semantic Evaluation, SemEval 2010, Stroudsburg, PA, USA, pp. 284–291. Association for Computational Linguistics (2010). http://​dl.​acm.​org/​citation.​cfm?​id=​1859664.​1859727
6.
Zurück zum Zitat Radinsky, K., Horvitz, E.: Mining the web to predict future events. In: Proceedings of 6th ACM International Conference on Web Search and Data Mining, pp. 255–264. ACM (2013) Radinsky, K., Horvitz, E.: Mining the web to predict future events. In: Proceedings of 6th ACM International Conference on Web Search and Data Mining, pp. 255–264. ACM (2013)
7.
Zurück zum Zitat Bauer, S., Clark, S., Graepel, T.: Learning to identify historical figures for timeline creation from Wikipedia articles. In: Proceedings of HistoInformatics 2014 - 2nd International Workshop on Computational History, Barcelona, Spain (2014) Bauer, S., Clark, S., Graepel, T.: Learning to identify historical figures for timeline creation from Wikipedia articles. In: Proceedings of HistoInformatics 2014 - 2nd International Workshop on Computational History, Barcelona, Spain (2014)
8.
Zurück zum Zitat TimeML Working Group: TimeML Annotation Guidelines Version 1.3., Manuscript. Technical report, Brandeis University (2010) TimeML Working Group: TimeML Annotation Guidelines Version 1.3., Manuscript. Technical report, Brandeis University (2010)
10.
Zurück zum Zitat Saurı, R., Batiukova, O., Pustejovsky, J.: Annotating events in Spanish. TimeML annotation guidelines. Technical report, Version TempEval-2010, Barcelona Media-Innovation Center (2009) Saurı, R., Batiukova, O., Pustejovsky, J.: Annotating events in Spanish. TimeML annotation guidelines. Technical report, Version TempEval-2010, Barcelona Media-Innovation Center (2009)
11.
Zurück zum Zitat Altuna, P., Salaburu, P., Goenaga, P., Lasarte, M.P., Akesolo, L., Azkarate, M., Charriton, P., Eguskitza, A., Haritschelhar, J., King, A., Larrarte, J.M., Mujika, J.A., Oyharabal, B., Rotaetxe, K.: Euskal Gramatika Lehen Urratsak (EGLU) I. Euskaltzaindiko Gramatika Batzordea, Euskaltzaindia, Bilbao (1985) Altuna, P., Salaburu, P., Goenaga, P., Lasarte, M.P., Akesolo, L., Azkarate, M., Charriton, P., Eguskitza, A., Haritschelhar, J., King, A., Larrarte, J.M., Mujika, J.A., Oyharabal, B., Rotaetxe, K.: Euskal Gramatika Lehen Urratsak (EGLU) I. Euskaltzaindiko Gramatika Batzordea, Euskaltzaindia, Bilbao (1985)
12.
Zurück zum Zitat Altuna, P., Salaburu, P., Goenaga, P., Lasarte, M.P., Akesolo, L., Azkarate, M., Charriton, P., Eguskitza, A., Haritschelhar, J., King, A., Larrarte, J.M., Mujika, J.A., Oyharabal, B., Rotaetxe, K.: Euskal Gramatika Lehen Urratsak (EGLU) II. Euskaltzaindiko Gramatika Batzordea, Euskaltzaindia, Bilbao (1987) Altuna, P., Salaburu, P., Goenaga, P., Lasarte, M.P., Akesolo, L., Azkarate, M., Charriton, P., Eguskitza, A., Haritschelhar, J., King, A., Larrarte, J.M., Mujika, J.A., Oyharabal, B., Rotaetxe, K.: Euskal Gramatika Lehen Urratsak (EGLU) II. Euskaltzaindiko Gramatika Batzordea, Euskaltzaindia, Bilbao (1987)
13.
Zurück zum Zitat Hualde, J.I., de Urbina, J.O.: A Grammar of Basque, vol. 26. Walter de Gruyter, Boston (2003)CrossRef Hualde, J.I., de Urbina, J.O.: A Grammar of Basque, vol. 26. Walter de Gruyter, Boston (2003)CrossRef
14.
Zurück zum Zitat Jȩdrzejko, E.: The problematics of describing periphrastic predication between word and image. Stud. Pol. Linguist. 6(1), 27–44 (2011) Jȩdrzejko, E.: The problematics of describing periphrastic predication between word and image. Stud. Pol. Linguist. 6(1), 27–44 (2011)
15.
Zurück zum Zitat Caselli, T., Lenzi, V.B., Sprugnoli, R., Pianta, E., Prodanof, I.: Annotating events, temporal expressions and relations in Italian: the It-TimeML experience for the Ita-TimeBank. In: Proceedings of 5th Linguistic Annotation Workshop, pp. 143–151. Association for Computational Linguistics, Portland (2011) Caselli, T., Lenzi, V.B., Sprugnoli, R., Pianta, E., Prodanof, I.: Annotating events, temporal expressions and relations in Italian: the It-TimeML experience for the Ita-TimeBank. In: Proceedings of 5th Linguistic Annotation Workshop, pp. 143–151. Association for Computational Linguistics, Portland (2011)
17.
Zurück zum Zitat Lenzi, V.B., Moretti, G., Sprugnoli, R.: CAT: the CELCT annotation tool. In: Calzolari, N., (Conference Chair), Choukri, K., Declerck, T., Doğan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) Proceedings of 8th International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, pp. 333–338. European Language Resources Association (ELRA) (2012) Lenzi, V.B., Moretti, G., Sprugnoli, R.: CAT: the CELCT annotation tool. In: Calzolari, N., (Conference Chair), Choukri, K., Declerck, T., Doğan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) Proceedings of 8th International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, pp. 333–338. European Language Resources Association (ELRA) (2012)
18.
Zurück zum Zitat Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945)CrossRef Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945)CrossRef
19.
Zurück zum Zitat Minard, A.-L., Speranza, M., Urizar, R., Altuna, B., van Erp, M., Schoen, A., van Son, C.: MEANTIME, the NewsReader multilingual event and time corpus. In: Proceedings of LREC 2016 (2016) Minard, A.-L., Speranza, M., Urizar, R., Altuna, B., van Erp, M., Schoen, A., van Son, C.: MEANTIME, the NewsReader multilingual event and time corpus. In: Proceedings of LREC 2016 (2016)
20.
Zurück zum Zitat Agerri, R., Agirre, E., Aldabe, I., Altuna, B., Beloki, Z., Laparra, E., de Lacalle, M.L., Rigau, G., Soroa, A., Urizar, R.: NewsReader project. Procesamiento del Lenguaje Natural 53, 155–158 (2014) Agerri, R., Agirre, E., Aldabe, I., Altuna, B., Beloki, Z., Laparra, E., de Lacalle, M.L., Rigau, G., Soroa, A., Urizar, R.: NewsReader project. Procesamiento del Lenguaje Natural 53, 155–158 (2014)
Metadaten
Titel
Adapting TimeML to Basque: Event Annotation
verfasst von
Begoña Altuna
María Jesús Aranzabe
Arantza Díaz de Ilarraza
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-75487-1_43

Premium Partner