Download | - View accepted manuscript: A Multi-strategy approach to informative gene identification from gene expression data (PDF, 775 KiB)
|
---|
DOI | Resolve DOI: https://doi.org/10.1142/S0219720010004495 |
---|
Author | Search for: Liu, Ziying1; Search for: Phan, Sieu1; Search for: Famili, Fazel1; Search for: Pan, Youlian1; Search for: Lenferink, Anne E. G.2; Search for: Cantin, Christiane2; Search for: Collins, Catherine2; Search for: O'Connor-McCourt, Maureen D.2 |
---|
Affiliation | - National Research Council of Canada. NRC Institute for Information Technology
- National Research Council of Canada. NRC Biotechnology Research Institute
|
---|
Format | Text, Article |
---|
Subject | Gene expression data analysis; Multi-strategy learning; Data mining and knowledge discovery |
---|
Abstract | An unsupervised multi-strategy approach has been developed to identify informative genes from high throughput genomic data. Several statistical methods have been used in the field to identify differentially expressed genes. Since different methods generate different lists of genes, it is very challenging to determine the most reliable gene list and the appropriate method. This paper presents a multi-strategy method, in which a combination of several data analysis techniques are applied to a given dataset and a confidence measure is established to select genes from the gene lists generated by these techniques to form the core of our final selection. The remainder of the genes that form the peripheral region are subject to exclusion or inclusion into the final selection. This paper demonstrates this methodology through its application to an in-house cancer genomics dataset and a public dataset. The results indicate that our method provides more reliable list of genes, which are validated using biological knowledge, biological experiments and literature search. We further evaluated our multi-strategy method by consolidating two pairs of independent datasets, each pair is for the same disease, but generated by different labs using different platforms. The results showed that our method has produced far better results. |
---|
Publication date | 2010-02-01 |
---|
In | |
---|
Language | English |
---|
Peer reviewed | Yes |
---|
NPARC number | 16435932 |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | c2977845-7f81-4623-8ac2-c407ee06c4cd |
---|
Record created | 2010-11-25 |
---|
Record modified | 2020-04-17 |
---|