Novel Features Ranking Method for High Dimensional Biological Data

سال انتشار: 1396
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 374

نسخه کامل این مقاله ارائه نشده است و در دسترس نمی باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

IBIS07_034

تاریخ نمایه سازی: 29 فروردین 1397

چکیده مقاله:

Undoubtedly, feature selection phase plays a signifcant role in the accurate result of any machine learning process in general and data classification in particular. Biological data often contain irrelevant and noisy features. So selecting a small subset out of thousands of features in biological dataset is a considerable step to identify biological variables and meaningful patterns based on DNA or protein sequences. Existing feature selection methods are computationally expensive and time consuming. Since nominal datasets are the major parts of data in biological world, in this paper, we propose a new feature ranking method on high dimensional nominal datasets. In order to increase the accuracy of classification, we first measure significance of each (feature) gene by (feature) gene ranking method via introducing a new metric which calculate the importance of all features based on the correlation between the feature and class label and then allocate a score to every features. Therefore our method can improve classification accuracy without interruption of redundant features. Finally, experiments on real influenza sequence data and the UCI datasets show that our method compares very well with other well known methods (including: Information gain, Chi Squared, Information gain ratio and Relief) in classification and ranking while being computationally efficient and scalable.

نویسندگان

Fatemeh Kargarfard

Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran

Esmaeil Ebrahimie

The University of Adelaide, School of Medicine, Adelaide, South Australia, Australia.

Ashkan Sami

Department of Computer Science and Engineering, School of Electrical Engineering and Computer, Shiraz University, Shiraz, Iran