Skip to main content

2022 | OriginalPaper | Buchkapitel

ChouBERT: Pre-training French Language Model for Crowdsensing with Tweets in Phytosanitary Context

verfasst von : Shufan Jiang, Rafael Angarita, Stéphane Cormier, Julien Orensanz, Francis Rousseaux

Erschienen in: Research Challenges in Information Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To fulfil the increasing need for food of the growing population and face climate change, modern technologies have been applied to improve different farming processes. One important application scenario is to detect and measure natural hazards using sensors and data analysis techniques. Crowdsensing is a sensing paradigm that empowers ordinary people to contribute with data their sensor-enhanced mobile devices gather or generate. In this paper, we propose to use Twitter as an open crowdsensing platform for acquiring farmers knowledge. We proved this concept by applying pre-trained language models to detect individual’s observation from tweets for pest monitoring.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Boubiche, D.E., et al.: Mobile crowd sensing-taxonomy, applications, challenges, and solutions. Comput. Hum. Behav. 101, 352–370 (2019)CrossRef Boubiche, D.E., et al.: Mobile crowd sensing-taxonomy, applications, challenges, and solutions. Comput. Hum. Behav. 101, 352–370 (2019)CrossRef
2.
Zurück zum Zitat Jacob, D., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1. pp. 4171–4186. ACL, June 2019 Jacob, D., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1. pp. 4171–4186. ACL, June 2019
3.
Zurück zum Zitat Klerkx, L., Jakku, E., Labarthe, P.: A review of social science on digital agriculture, smart farming and agriculture 4.0: new contributions and a future research agenda. NJAS Wageningen J. Life Sci. 90–91, 100315 (2019) Klerkx, L., Jakku, E., Labarthe, P.: A review of social science on digital agriculture, smart farming and agriculture 4.0: new contributions and a future research agenda. NJAS Wageningen J. Life Sci. 90–91, 100315 (2019)
4.
Zurück zum Zitat Laifa, A., Gautier, L., Cruz, C.: Impact of textual data augmentation on linguistic pattern extraction to improve the idiomaticity of extractive summaries. In: Golfarelli, M., Wrembel, R., Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) DaWaK 2021. LNCS, vol. 12925, pp. 143–151. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-86534-4_13CrossRef Laifa, A., Gautier, L., Cruz, C.: Impact of textual data augmentation on linguistic pattern extraction to improve the idiomaticity of extractive summaries. In: Golfarelli, M., Wrembel, R., Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) DaWaK 2021. LNCS, vol. 12925, pp. 143–151. Springer, Cham (2021). https://​doi.​org/​10.​1007/​978-3-030-86534-4_​13CrossRef
5.
Zurück zum Zitat Le, H., et al.: FlauBERT: unsupervised language model pre-training for French. In: Proceedings of The 12th Language Resources and Evaluation Conference, LREC, pp. 2479–2490. European Language Resources Association, Marseille (2020) Le, H., et al.: FlauBERT: unsupervised language model pre-training for French. In: Proceedings of The 12th Language Resources and Evaluation Conference, LREC, pp. 2479–2490. European Language Resources Association, Marseille (2020)
6.
Zurück zum Zitat Martin, L., et al.: CamemBERT: a tasty French language model. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7203–7219. Association for Computational Linguistics, Online (2020) Martin, L., et al.: CamemBERT: a tasty French language model. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7203–7219. Association for Computational Linguistics, Online (2020)
7.
Zurück zum Zitat Mendes, J., et al.: Smartphone applications targeting precision agriculture practices-a systematic review. Agronomy 10(6), 855 (2020)CrossRef Mendes, J., et al.: Smartphone applications targeting precision agriculture practices-a systematic review. Agronomy 10(6), 855 (2020)CrossRef
8.
Zurück zum Zitat Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
10.
Zurück zum Zitat Shankar, P., Bitter, C., Liwicki, M.: Digital crop health monitoring by analyzing social media streams. In: 2020 IEEE/ITU International Conference on Artificial Intelligence for Good (AI4G), Geneva, Switzerland, pp. 87–94. IEEE (2020) Shankar, P., Bitter, C., Liwicki, M.: Digital crop health monitoring by analyzing social media streams. In: 2020 IEEE/ITU International Conference on Artificial Intelligence for Good (AI4G), Geneva, Switzerland, pp. 87–94. IEEE (2020)
11.
Zurück zum Zitat Stella, D., et al.: JuriBERT: a masked-language model adaptation for French legal text. In: Proceedings of the Natural Legal Language Processing Workshop, pp. 95–101. Association for Computational Linguistics, November 2021 Stella, D., et al.: JuriBERT: a masked-language model adaptation for French legal text. In: Proceedings of the Natural Legal Language Processing Workshop, pp. 95–101. Association for Computational Linguistics, November 2021
12.
Zurück zum Zitat Thollet, B.: MEDIAS SOCIAUX EN AGRICULTURE: Contribution à l’analyse des usages et de leur potentiel d’apprentissage pour la transition agroécologique. Master’s thesis, AgroSup Dijon, Dijon, France, August 2020 Thollet, B.: MEDIAS SOCIAUX EN AGRICULTURE: Contribution à l’analyse des usages et de leur potentiel d’apprentissage pour la transition agroécologique. Master’s thesis, AgroSup Dijon, Dijon, France, August 2020
15.
Zurück zum Zitat Turenne, N., et al.: Open data platform for knowledge access in plant health domain: VESPA mining. CoRR abs/1504.06077 (2015) Turenne, N., et al.: Open data platform for knowledge access in plant health domain: VESPA mining. CoRR abs/1504.06077 (2015)
16.
Zurück zum Zitat Welvaert, M., et al.: Limits of use of social media for monitoring biosecurity events. PLOS ONE 12(2), e0172457 (2017) Welvaert, M., et al.: Limits of use of social media for monitoring biosecurity events. PLOS ONE 12(2), e0172457 (2017)
17.
Zurück zum Zitat Wolf, T., et al.: Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38–45. Association for Computational Linguistics, Online, October 2020 Wolf, T., et al.: Transformers: state-of-the-art natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 38–45. Association for Computational Linguistics, Online, October 2020
Metadaten
Titel
ChouBERT: Pre-training French Language Model for Crowdsensing with Tweets in Phytosanitary Context
verfasst von
Shufan Jiang
Rafael Angarita
Stéphane Cormier
Julien Orensanz
Francis Rousseaux
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-031-05760-1_40