Persian Printed Document Analysis and Page Segmentation
عنوان مقاله: Persian Printed Document Analysis and Page Segmentation
شناسه ملی مقاله: JR_JCR-1-1_006
منتشر شده در شماره 1 دوره 1 فصل Winter and Spring در سال 1386
شناسه ملی مقاله: JR_JCR-1-1_006
منتشر شده در شماره 1 دوره 1 فصل Winter and Spring در سال 1386
مشخصات نویسندگان مقاله:
Ali Broumandnia - Department of Computer & IT, Islamic Azad University-South Tehran Branch, Tehran, Iran
Jamshid Shanbehzadeh - Department of Computer, Tarbiat Moalem University, Iran
خلاصه مقاله:
Ali Broumandnia - Department of Computer & IT, Islamic Azad University-South Tehran Branch, Tehran, Iran
Jamshid Shanbehzadeh - Department of Computer, Tarbiat Moalem University, Iran
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifying them as texts, images, and tables/drawings. The proposed method was experiment with the Persian documents. The result of these tests have shown that the proposed method provide more accurate and speed results.
کلمات کلیدی: Page segmentation, pyramidal image structure, connected components, horizontal and vertical merging
صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/682903/