Show simple item record Shen, Qiang Shang, Changjing 2008-01-24T11:37:26Z 2008-01-24T11:37:26Z 2006
dc.identifier.citation Shen , Q & Shang , C 2006 , ' Aiding classification of gene expression data with feature selection: a comparative study ' Journal of Computational Intelligence Research (IJCIR) , pp. 68-76 . en
dc.identifier.issn 0973-1873
dc.identifier.other PURE: 74345
dc.identifier.other dspace: 2160/472
dc.identifier.uri en
dc.description C. Shang and Q. Shen. Aiding classification of gene expression data with feature selection: a comparative study. Computational Intelligence Research, 1(1):68-76. en
dc.description.abstract This paper presents an application of supervised machine learning approaches to the classification of the yeast S. cerevisiae gene expression data. Established feature selection techniques based on information gain ranking and principal component analysis are, for the first time, applied to this data set to support learning and classification. Different classifiers are implemented to investigate the impact of combining feature selection and classification methods. Learning classifiers implemented include K-Nearest Neighbours (KNN), Naive Bayes and Decision Trees. Results of comparative studies are provided, demonstrating that effective feature selection is essential to the development of classifiers intended for use in highdimension domains. In particular, amongst a large corpus of systematic experiments carried out, best classification performance is achieved using a subset of features chosen via information gain ranking for KNN and Naive Bayes classifiers. Naive Bayes may also perform accurately with a relatively small set of linearly transformed principal features in classifying this difficult data set. This research also shows that feature selection helps increase computational efficiency while improving classification accuracy. en
dc.format.extent 9 en
dc.language.iso eng
dc.relation.ispartof Journal of Computational Intelligence Research (IJCIR) en
dc.title Aiding classification of gene expression data with feature selection: a comparative study en
dc.type Text en
dc.type.publicationtype Article (Journal) en
dc.contributor.institution Department of Computer Science en
dc.contributor.institution Advanced Reasoning Group en
dc.description.status Peer reviewed en

Files in this item

Aside from theses and in the absence of a specific licence document on an item page, all works in Cadair are accessible under the CC BY-NC-ND Licence. AU theses and dissertations held on Cadair are made available for the purposes of private study and non-commercial research and brief extracts may be reproduced under fair dealing for the purpose of criticism or review. If you have any queries in relation to the re-use of material on Cadair, contact

This item appears in the following Collection(s)

Show simple item record

Search Cadair

Advanced Search