作者
Sung-Bae Cho, Hong-Hee Won
发表日期
2003/1/1
图书
Proceedings of the First Asia-Pacific Bioinformatics Conference on Bioinformatics 2003-Volume 19
页码范围
189-198
简介
The development of microarray technology has supplied a large volume of data to many fields. In particular, it has been applied to prediction and diagnosis of cancer, so that it expectedly helps us to exactly predict and diagnose cancer. To precisely classify cancer we have to select genes related to cancer because extracted genes from microarray have many noises. In this paper, we attempt to explore many features and classifiers using three benchmark datasets to systematically evaluate the performances of the feature selection methods and machine learning classifiers. Three benchmark datasets are Leukemia cancer dataset, Colon cancer dataset and Lymphoma cancer data set. Pearson’s and Spearman’s correlation coefficients, Euclidean distance, cosine coefficient, information gain, mutual information and signal to noise ratio have been used for feature selection. Multi-layer perceptron, k-nearest neighbour, support vector machine and structure adaptive self–organizing map have been used for classification. Also, we have combined the classifiers to improve the performance of classification. Experimental results show that the ensemble with several basis classifiers produces the best recognition rate on the benchmark dataset.
引用总数
20042005200620072008200920102011201220132014201520162017201820192020202120222023202481623192322281117171121271319252319262811
学术搜索中的文章
SB Cho, HH Won - Proceedings of the First Asia-Pacific Bioinformatics …, 2003