SEEK: Salford Environment for Expertise and Knowledge

Published Conference Proceedings - Paper
November 2012

A Robust Hybrid Approach for Text Line Segmentation in Historical Documents

Clausner, C & Antonacopoulos, A & pletschacher, S 2012, A Robust Hybrid Approach for Text Line Segmentation in Historical Documents, in: 'Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012)', IEEE-CS, Los Alamitos, CA, USA. Conference details: 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, November 2012..

Abstract

Large-scale digitisation of historical documents demands robust methods that cope with the presence of frequent distortions and noisy artefacts. This paper presents a hybrid text line segmentation method that uses a novel data structure and a rule base to combine the strengths of top-down and bottom-up approaches while minimising their weaknesses. The effectiveness of the proposed approach has been methodically eval- uated in the context of large-scale digitisation using a standardised framework. Results on a diverse dataset show improved performance over top-down and bottom-up approaches as well as over a leading commercially available system.

 

Publication Details

Conference Proceedings
Antonacopoulos, A & Pletschacher, S & Clausner, C eds. 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), IEEE-CS, Los Alamitos, CA, USA.

Conference Details
21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, November 2012.