Skip to main content
Top

RASAM – A Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi

  • 2021
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The chapter presents RASAM, a specialized dataset for the recognition and analysis of Arabic Maghrebi scripts, which have been underrepresented in digital humanities. The dataset, comprising 300 annotated images from three manuscripts, focuses on the unique characteristics of Maghrebi scripts, such as their rounded shapes and specific writing tools. The dataset aims to foster the development of robust handwritten text recognition (HTR) systems for these scripts, addressing the challenges posed by their cursive and right-to-left (RTL) nature. The chapter also discusses the collaborative hackathon that created the dataset, the annotation process, and the evaluation of HTR models, demonstrating the effectiveness of fine-tuning and transfer learning for under-resourced languages.
This work was carried out with the financial support of the French Ministry of Higher Education, Research and Innovation. It is in line with the scientific focus on digital humanities defined by the Research Consortium Middle-East and Muslim Worlds (GIS MOMM). We would also like to thank all the transcribers and people who took part in the hackathon and ensured its successful completion.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
RASAM – A Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi
Authors
Chahan Vidal-Gorène
Noëmie Lucas
Clément Salah
Aliénor Decours-Perez
Boris Dupin
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-86198-8_19
This content is only visible if you are logged in and have the appropriate permissions.
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG