Impact of linguistic and ontology information on improvement of Persian text clustering
سال انتشار: 1395
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 602
فایل این مقاله در 19 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
CBCONF01_0020
تاریخ نمایه سازی: 16 شهریور 1395
چکیده مقاله:
In Persian language, words have various writing forms and it is impossible to cover all the grammatical points of words by applying a series of certain rules. For this reason, automatic key words extraction from Persian texts is complex and difficult. In this paper, we try to provide more meaningful keywords by using linguistic information and thesaurus. By using thesaurus that has a structured system, we can complete and improve the equivalent, hierarchical and dependent words. So we can increase adaptation between users' search and textual key words and also comprehensiveness of the search.At the first stage, unimportant and general words are removed. Then the roots of words are extracted and in the following, a numerical weight is assigned to each word to define the importance of words by using weighting methods that shows the amount of importance of word in relation to text subject and in comparison with other used words in the text. The above operations specially using thesaurus would cause a more accurate text classification and also somehow hierarchical category of texts in information retrieval context is specified. Test results on some text in different subjects show accurately and ability of proposed method in extracting key words in adaptation with user's demand and lead to more accurate text clustering.
کلیدواژه ها:
نویسندگان
Farzad Tarhani
Assistant Professor, Department of Management, Malek Ashtar University, Tehran
Maryam Hourali
Assistant Professor, Department of Computer Engineering, Malek Ashtar University, Tehran,
Ali Nozari
Graduate Student, Department of Computer Engineering, Artificial Intelligence orientation, Malek Ashtar University, Tehran
Reza Javidan
Assistant Professor, Department of Management, Shiraz Industrial University, Shiraz
مراجع و منابع این مقاله:
لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :