TY - JOUR T1 - Review of the Effect of Feature Selection for Microarray Data on the Classification Accuracy for Cancer Data Sets AU - Elkhani, Naeimeh AU - Muniyandi, Ravie Chandren JO - International Journal of Soft Computing VL - 11 IS - 5 SP - 334 EP - 342 PY - 2016 DA - 2001/08/19 SN - 1816-9503 DO - ijscomp.2016.334.342 UR - https://makhillpublications.co/view-article.php?doi=ijscomp.2016.334.342 KW - Microarray cancer data sets KW -feature selection methods KW -classification accuracy KW -wrappers KW -experimental AB - DNA microarrays can be used to monitor the expression level of thousands of genes simultaneously and gene microarray data can be used in cancer diagnosis and classification. Many machine learning techniques have been developed for computational analyses of microarray data. A common difficulty for all techniques is the large number of genes compared to the small sample size which has a negative impact on their speed and accuracy. To overcome these limitations, feature selection techniques are applied to distinguish between significant and redundant or irrelevant genes. Feature selection methods are used for two main goals. The first is to identify the relationship between specific diseases and genes. The second is to examine a compact set of discriminative genes to develop a pattern classifier with good generalizability and limited complexity. Here, we review different feature selection methods for cancer microarray data sets and analyze their accuracy. We describe methods commonly used for selecting significant features including filters, wrappers and embedded methods, categorized according to their experimental methodology. We then compare the classification accuracy of the methods for various cancer data sets and their time complexity to make some suggestions regarding the use of suitable methods for cancer data sets. ER -