An Enhanced SMOTE Algorithm Using Entropy and Clustering for Imbalanced Accident Data

سال انتشار: 1393
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 543

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CITCONF02_513

تاریخ نمایه سازی: 19 اردیبهشت 1395

چکیده مقاله:

Over the course of the century, many real-world applications of imbalanced data are emerged. One of its implication which is first considered in this context, is imbalanced accident data. In this paper, the data of transportation and accidents in Tehran-Bazargan highway between 2010 and 2015 is considered. In the pre-processing step, SMOTE is considered as one of the most important over-sampling technique that effectively balance the imbalanced data. However, it brings noise and other problems and a great need is felt for improving this method. To solve these problems, several techniques have been proposed in this study such as combination of dynamic selected, weighted attribute and distance weighted techniques along with mixture of classification and clustering techniques. Performance of the proposed algorithm is measured by f-measure and ROC curve and the results are compared by Weka’s SMOTE with different algorithms.

کلیدواژه ها:

نویسندگان

Sima Sharifirad

Master student of computer science, AmirKabir University

Azra Nazari

Graduate student of master of computer science, AmirKabir University

Mahdi Ghatee

Assistant professor of computer science, AmirKabir University

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • J. Laurikkala, (2001), "Improving Identification of Difficult Small Classes by ...
  • Chawla, N.V., K. Bowyer, Hall.l and Kegelmeyer , W.(2002), SMOTE: ...
  • Lopez, V. Fernandez, A and Garcfa, S.I, (2013) _ insight ...
  • European Transport Safety Council, (2013). Back on track to reach ...
  • World Health Report: Making a difference. Geneva, (1999), World Health ...
  • .Raj aNews.com, (1 393). ...
  • Wu, J., S.C. Brubaker, M.D. Mullin and J.M. Rehg, (2008)." ...
  • He, H.B. and E.A. Garcia, 2009. Learning from imibalanced data. ...
  • Ying, ..(201 2), "Imbalanced classification based On Active Learning SMOTE, ...
  • Lewis, D. and W. Gale, (1998). "Training text classifiere by ...
  • Ling, C. and Li, C. (1998). "Data Mining for Direct ...
  • Japkowicz, N. (Ed.). (200). Proceedings of the AAAI 200) Workshop ...
  • Nitesh V.C et al, (2002), ;" SMOTE: Synthetic Minority Oversampling ...
  • H. Han, W.Y. Wang, and B.H. Mao, (2005) _ orderline- ...
  • M. Kubat and S. Matwin, (1997) "Addressing the Curse of ...
  • G.E.A.P.A. Batista, R.C. Prati, and M.C. Monard, (204), "A Study ...
  • X. Xiao, and H. Ding, (2012), "Enhancemet of K-nearest Neighbor ...
  • L. Jiang, Z. Cai, D. Wang, and S. Jiang, (2007)Survey ...
  • J. Wu, Z. Cai and Z. Gao, (20 10), "Dynamic ...
  • O.Kwon, W.Rhee and Y.Yoon, (201 5), "Application of classification algorithms ...
  • نمایش کامل مراجع