Skip to main content
Top

2019 | OriginalPaper | Chapter

A Self-organizing Feature Map for Arabic Word Extraction

Authors : Hassina Bouressace, János Csirik

Published in: Text, Speech, and Dialogue

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Arabic word spotting is a key step for Arabic NLP and the text recognition task. Many recent studies have addressed segmentation problems in the Arabic language. However, many issues still have to be overcome. In this paper, we propose a new approach for segmenting an image Arabic text into its constituent words. Our approach consists of two main steps. In the first step, a set of features is extracted from connected components using the Run-length smoothing algorithm (RLSA). In the second step, spatially close connected components that are likely to belong to the same word component are grouped together. This is done via a learning technique called the self-organizing feature map (Kohonen map). We evaluated our approach on 300 images with different sizes and fonts for handwritten text using AHDB. Our results suggest that our approach can efficiently segments lines. Moreover, as our approach is based on a straightforward machine learning model, it should be possible to adapt it to other languages as well.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 102.000 books
  • more than 537 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 67.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 67.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Literature
This content is only visible if you are logged in and have the appropriate permissions.
Metadata
Title
A Self-organizing Feature Map for Arabic Word Extraction
Authors
Hassina Bouressace
János Csirik
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-27947-9_11

Premium Partner