Next Article in Journal
Partition and Inclusion Hierarchies of Images: A Comprehensive Survey
Next Article in Special Issue
Efficient Query Specific DTW Distance for Document Retrieval with Unlimited Vocabulary
Previous Article in Journal
An Investigation of Smooth TV-Like Regularization in the Context of the Optical Flow Problem
Previous Article in Special Issue
A New Binarization Algorithm for Historical Documents
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames

by
Oussama Zayene
1,2,*,
Sameh Masmoudi Touj
1,
Jean Hennebert
3,
Rolf Ingold
2 and
Najoua Essoukri Ben Amara
1
1
LATIS Lab, National Engineering School of Sousse (Eniso), University of Sousse, Sousse 4054, Tunisia
2
DIVA Group, Department of Informatics, University of Fribourg (Unifr), Fribourg 1700, Switzerland
3
ICoSys Institute, HES-SO, University of Applied Sciences, Fribourg 1705, Switzerland
*
Author to whom correspondence should be addressed.
J. Imaging 2018, 4(2), 32; https://doi.org/10.3390/jimaging4020032
Submission received: 26 November 2017 / Revised: 23 January 2018 / Accepted: 26 January 2018 / Published: 31 January 2018
(This article belongs to the Special Issue Document Image Processing)

Abstract

Recognizing texts in video is more complex than in other environments such as scanned documents. Video texts appear in various colors, unknown fonts and sizes, often affected by compression artifacts and low quality. In contrast to Latin texts, there are no publicly available datasets which cover all aspects of the Arabic Video OCR domain. This paper describes a new well-defined and annotated Arabic-Text-in-Video dataset called AcTiV 2.0. The dataset is dedicated especially to building and evaluating Arabic video text detection and recognition systems. AcTiV 2.0 contains 189 video clips serving as a raw material for creating 4063 key frames for the detection task and 10,415 cropped text images for the recognition task. AcTiV 2.0 is also distributed with its annotation and evaluation tools that are made open-source for standardization and validation purposes. This paper also reports on the evaluation of several systems tested under the proposed detection and recognition protocols.
Keywords: video text detection; video text recognition; AcTiV dataset; Arabic Video OCR video text detection; video text recognition; AcTiV dataset; Arabic Video OCR

Share and Cite

MDPI and ACS Style

Zayene, O.; Masmoudi Touj, S.; Hennebert, J.; Ingold, R.; Essoukri Ben Amara, N. Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames. J. Imaging 2018, 4, 32. https://doi.org/10.3390/jimaging4020032

AMA Style

Zayene O, Masmoudi Touj S, Hennebert J, Ingold R, Essoukri Ben Amara N. Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames. Journal of Imaging. 2018; 4(2):32. https://doi.org/10.3390/jimaging4020032

Chicago/Turabian Style

Zayene, Oussama, Sameh Masmoudi Touj, Jean Hennebert, Rolf Ingold, and Najoua Essoukri Ben Amara. 2018. "Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames" Journal of Imaging 4, no. 2: 32. https://doi.org/10.3390/jimaging4020032

APA Style

Zayene, O., Masmoudi Touj, S., Hennebert, J., Ingold, R., & Essoukri Ben Amara, N. (2018). Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames. Journal of Imaging, 4(2), 32. https://doi.org/10.3390/jimaging4020032

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop