CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Persian Printed Document Analysis and Page Segmentation

عنوان مقاله: Persian Printed Document Analysis and Page Segmentation
شناسه ملی مقاله: JR_JCR-1-1_006
منتشر شده در شماره 1 دوره 1 فصل Winter and Spring در سال 1386
مشخصات نویسندگان مقاله:

Ali Broumandnia - Department of Computer & IT, Islamic Azad University-South Tehran Branch, Tehran, Iran
Jamshid Shanbehzadeh - Department of Computer, Tarbiat Moalem University, Iran

خلاصه مقاله:
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifying them as texts, images, and tables/drawings. The proposed method was experiment with the Persian documents. The result of these tests have shown that the proposed method provide more accurate and speed results.

کلمات کلیدی:
Page segmentation, pyramidal image structure, connected components, horizontal and vertical merging

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/682903/