TY  - JOUR
T1  - Review of the Effect of Feature Selection for Microarray Data on the
Classification Accuracy for Cancer Data Sets
AU - Elkhani, Naeimeh AU - Muniyandi, Ravie Chandren 
JO  - International Journal of Soft Computing
VL  - 11
IS  - 5
SP  - 334
EP  - 342
PY  - 2016
DA  - 2001/08/19
SN  - 1816-9503
DO  - ijscomp.2016.334.342
UR  - https://makhillpublications.co/view-article.php?doi=ijscomp.2016.334.342
KW  - Microarray cancer data sets
KW  -feature selection methods
KW  -classification accuracy
KW  -wrappers
KW  -experimental
AB  - DNA microarrays can be used to monitor the expression level of thousands of genes simultaneously and gene microarray data can be used in cancer diagnosis and classification. Many machine learning techniques have been developed for computational analyses of microarray data. A common difficulty for all techniques is the large number of genes compared to the small sample size which has a negative impact on their speed and accuracy. To overcome these limitations, feature selection techniques are applied to distinguish between significant and redundant or irrelevant genes. Feature selection methods are used for two main goals. The first is to identify the relationship between specific diseases and genes. The second is to examine a compact set of discriminative genes to develop a pattern classifier with good generalizability and limited complexity. Here, we review different feature selection methods for cancer microarray data sets and analyze their accuracy. We describe methods commonly used for selecting significant features including filters, wrappers and embedded methods, categorized according to their experimental methodology. We then compare the classification accuracy of the methods for various cancer data sets and their time complexity to make some suggestions regarding the use of suitable methods for cancer data sets.
ER  -