30 اردیبهشت 1403
محسن جواهريان

محسن جواهریان

مرتبه علمی: استادیار
نشانی:
تحصیلات: دکترای تخصصی / فیزیک - فیزیک نجومی
تلفن:
دانشکده: مرکز تحقیقات نجوم و اختر فیزیک مراغه

مشخصات پژوهش

عنوان
A hybrid algorithm for feature subset selection in high-dimensional datasets using FICA and IWSSr algorithm
نوع پژوهش مقاله چاپ شده
کلیدواژه‌ها
Feature subset selection, FICA, IWSSr algorithm, High dimensional classification problems
سال
2015
مجله APPLIED SOFT COMPUTING
شناسه DOI 10.1016/j.asoc.2015.03.049
پژوهشگران مصطفی مرادخانی ، علی امیری ، محسن جواهریان ، حسین صفری

چکیده

Feature subset selection is a substantial problem in the field of data classification tasks. The purpose of feature subset selection is a mechanism to find efficient subset retrieved from original datasets to increase both efficiency and accuracy rate and reduce the costs of data classification. Working on high-dimensional datasets with a very large number of predictive attributes while the number of instances is presented in a low volume needs to be employed techniques to select an optimal feature subset. In this paper, a hybrid method is proposed for efficient subset selection in high-dimensional datasets. The proposed algorithm runs filter-wrapper algorithms in two phases. The symmetrical uncertainty (SU) criterion is exploited to weight features in filter phase for discriminating the classes. In wrapper phase, both FICA (fuzzy imperialist competitive algorithm) and IWSSr (Incremental Wrapper Subset Selection with replacement) in weighted feature space are executed to find relevant attributes. The new scheme is successfully applied on 10 standard high-dimensional datasets, especially within the field of biosciences and medicine, where the number of features compared to the number of samples is large, inducing a severe curse of dimensionality problem. The comparison between the results of our method and other algorithms confirms that our method has the most accuracy rate and it is also able to achieve to the efficient compact subset.