AN INTEGRATED SYSTEM FOR HANDWRITTEN DOCUMENT IMAGE PROCESSING
Abstract
In this paper we attempt to face common problems of handwritten documents such as nonparallel text lines in a page, hill and dale writing, slanted and connected characters. Towards this end an integrated system for document image preprocessing is presented. This system consists of the following modules: skew angle estimation and correction, line and word segmentation, slope and slant correction. The skew angle correction, slope correction and slant removing algorithms are based on a novel method that is a combination of the projection profile technique and the Wigner–Ville distribution. Furthermore, the skew angle correction algorithm can cope with pages whose text line skew angles vary, and handle them by areas. Our system can be used as a preprocessing stage to any handwriting character recognition or segmentation system as well as to any writer identification system. It was tested in a wide variety of handwritten document images of unconstrained English and Modern Greek text from about 100 writers. Additionally, combinations of the above algorithms have been used in the framework of the ACCeSS system (European project LE-1 1802, aiming at the automatic processing of application forms of insurance companies) as well as in the processing of GRUHD and IAM-B databases for automating the procedure of extracting data.
References
A. Amiri , OSCAR: a visual programming toolkit for offline handwritten forms recognition, Proc. IWFHR4, 4th Int. Workshop on Frontiers in Handwriting Recognition (1994) pp. 441–448. Google ScholarA. Bagdanov and S. Kanai , Evaluation of document image skew techniques,Proc. SPIE (1996) pp. 343–353. Google ScholarB. Boashash , B. Lovell and L. White , Time frequency analysis and pattern recognition using singular value decomposition of the Wigner–Ville distribution, Advanced Algorithms and Architecture for Signal Processing,Proc. SPIE 828 (1987) pp. 104–114. Google ScholarP. Boles and B. Boashash , The cross Wigner–Ville distribution — a two dimensional analysis method for the processing of vibrosis seismic signals, Proc. IEEE ICASP'87 (1988) pp. 904–907. Google Scholar- IEEE Trans. PAMI 11(1), 68 (1989). Crossref, Google Scholar
M. Y. Chen , Off-line handwritten word recognition using HMM, 5th Adv. Technol. Conf. (1992) pp. 563–587. Google ScholarW. Chin , A. Harvey and A. Jennings , Skew detection in handwritten scripts, Proc. IEEE Speech and Image Technologies for Computing and Telecommunications (1997) pp. 319–322. Google Scholar- Phillips J. Res. 35(1–3), 217 (1980). Google Scholar
G. Cristobal , J. Bescos and J. Santamaria , Application of Wigner distribution for image representation and analysis, Proc IEEE 8th Int. Conf. Patt. Recogn. (1986) pp. 998–1000. Google Scholar- Patt. Recogn. 30(9), 1505 (1997), DOI: 10.1016/S0031-3203(96)00157-4. Crossref, Web of Science, Google Scholar
- Patt. Recogn. Lett. 18, 675 (1997), DOI: 10.1016/S0167-8655(97)00032-9. Crossref, Web of Science, Google Scholar
- Patt. Recogn. Lett. 20(11–13), 1305 (1999). Crossref, Web of Science, Google Scholar
E. Kavallieratou , The GRUHD database of modern Greek unconstrained handwriting, LREC20003 (1999) pp. 1755–1759. Google Scholar- J. Elec. Electron. Eng. 152 (1998). Google Scholar
- Patt. Recogn. 27(10), 1325 (1994), DOI: 10.1016/0031-3203(94)90068-X. Crossref, Web of Science, Google Scholar
J. Liu , C. Lee and R. Shu , An efficient method for the skew normalization of a document image, Proc. 12th Int. Conf. Pattern Recognition (1992) pp. 122–125. Google ScholarU. Marti and H. Bunke , A full English sentence database for off-line handwriting recognition, Proc. 5th Int. Conf. Document Analysis and Recognition, ICDAR'99 (1999) pp. 705–708. Google Scholar- IEEE Trans. PAMI 18(5), 548 (1996). Crossref, Google Scholar
- IEEE Trans. Patt. Anal. Mach. Intell. 15(11), 1162 (1993), DOI: 10.1109/34.244677. Crossref, Web of Science, Google Scholar
T. Pavlidis and J. Zhou , Page segmentation by white streams, Proc. 1st Int. Conf. Document Analysis and Recognition (ICDAR) (International Association of Pattern Recognition, 1991) pp. 945–953. Google ScholarG. S. Peake and T. N. Tan , A general algorithm for document skew angle estimation, IEEE Int. Conf. Image Process.2 (1997) pp. 230–233. Google ScholarH. Penz , Fast real-time recognition and quality inspection of printed characters via point-correlation,Proc. SPIE 4303 (2001) pp. 127–137. Google Scholar- IEEE Trans. Patt. Anal. Mach. Intell. 20(3), 309 (1998), DOI: 10.1109/34.667887. Crossref, Web of Science, Google Scholar
M. Shridar and F. Kimura , Handwritten address interpretation using word recognition with and without lexicon, Proc. IEEE Int. Conf. Systems, Man and Cybernetics3 (1995) pp. 2341–2346. Google Scholar- CVIU 70(3), 321 (1998). Web of Science, Google Scholar
- Cable et Trasmission A 2, 61 (1948). Google Scholar
- Patt. Recogn. 30(3), 503 (1997), DOI: 10.1016/S0031-3203(96)00081-7. Crossref, Web of Science, Google Scholar
K. B. Yu and S. Cheng , Signal synthesis from Wigner distribution, Proc. IEEE ICASSP '85 (1985) pp. 1037–1040. Google Scholar- Patt. Recogn. 29(10), 1599 (1996), DOI: 10.1016/0031-3203(96)00020-9. Crossref, Web of Science, Google Scholar
- IEEE Trans. Patt. Anal. Mach. Intell. 18(11), 1127 (1996). Web of Science, Google Scholar