Huiqing Liu (huiqing@lit.a-star.edu.sg)
Jinyan Li (jinyan@lit.a-star.edu.sg)
Limsoon Wong (limsoon@lit.a-star.edu.sg)
Laboratories for Information Technology, 21 Heng Mui Keng Terr, 119613 Singapore
Feature selection plays an important role in classification. We present a comparative study on six feature selection heuristics by applying them to two sets of data. The first set of data are gene expression profiles from Acute Lymphoblastic Leukemia (ALL) patients. The second set of data are proteomic patterns from ovarian cancer patients. Based on features chosen by these methods, error rates of several classification algorithms were obtained for analysis. Our results demonstrate the importance of feature selection in accurately classifying new samples.