Full metadata record
DC FieldValueLanguage
dc.contributor.authorHuang, Hui-Lingen_US
dc.contributor.authorWu, Yu-Chungen_US
dc.contributor.authorSu, Li-Jenen_US
dc.contributor.authorHuang, Yun-Juen_US
dc.contributor.authorCharoenkwan, Phasiten_US
dc.contributor.authorChen, Wen-Liangen_US
dc.contributor.authorLee, Hua-Chinen_US
dc.contributor.authorChu, William Cheng-Chungen_US
dc.contributor.authorHo, Shinn-Yingen_US
dc.date.accessioned2019-04-03T06:41:24Z-
dc.date.available2019-04-03T06:41:24Z-
dc.date.issued2015-02-21en_US
dc.identifier.issn1471-2105en_US
dc.identifier.urihttp://dx.doi.org/10.1186/s12859-015-0463-xen_US
dc.identifier.urihttp://hdl.handle.net/11536/124569-
dc.description.abstractBackground: Few studies have investigated prognostic biomarkers of distant metastases of lung cancer. One of the central difficulties in identifying biomarkers from microarray data is the availability of only a small number of samples, which results overtraining. Recently obtained evidence reveals that epithelial-mesenchymal transition (EMT) of tumor cells causes metastasis, which is detrimental to patients' survival. Results: This work proposes a novel optimization approach to discovering EMT-related prognostic biomarkers to predict the distant metastasis of lung cancer using both microarray and survival data. This weighted objective function maximizes both the accuracy of prediction of distant metastasis and the area between the disease-free survival curves of the non-distant and distant metastases. Seventy-eight patients with lung cancer and a follow-up time of 120 months are used to identify a set of gene markers and an independent cohort of 26 patients is used to evaluate the identified biomarkers. The medical records of the 78 patients show a significant difference between the disease-free survival times of the 37 non-distant-and the 41 distant-metastasis patients. The experimental results thus obtained are as follows. 1) The use of disease-free survival curves can compensate for the shortcoming of insufficient samples and greatly increase the test accuracy by 11.10%; and 2) the support vector machine with a set of 17 transcripts, such as CCL16 and CDKN2AIP, can yield a leave-one-out cross-validation accuracy of 93.59%, a test accuracy of 76.92%, a large disease-free survival area of 74.81%, and a mean survival prediction error of 3.99 months. The identified putative biomarkers are examined using related studies and signaling pathways to reveal the potential effectiveness of the biomarkers in prospective confirmatory studies. Conclusions: The proposed new optimization approach to identifying prognostic biomarkers by combining multiple sources of data (microarray and survival) can facilitate the accurate selection of biomarkers that are most relevant to the disease while solving the problem of insufficient samples.en_US
dc.language.isoen_USen_US
dc.subjectDistant metastasisen_US
dc.subjectGenetic algorithmen_US
dc.subjectLung canceren_US
dc.subjectMicroarrayen_US
dc.subjectPrognostic biomarkeren_US
dc.subjectSurvival curveen_US
dc.titleDiscovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival dataen_US
dc.typeArticleen_US
dc.identifier.doi10.1186/s12859-015-0463-xen_US
dc.identifier.journalBMC BIOINFORMATICSen_US
dc.citation.volume16en_US
dc.citation.spage0en_US
dc.citation.epage0en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.department分子醫學與生物工程研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.contributor.departmentInstitute of Molecular Medicine and Bioengineeringen_US
dc.identifier.wosnumberWOS:000350370000001en_US
dc.citation.woscount12en_US
Appears in Collections:Articles


Files in This Item:

  1. a93fbb40cef7430039829a877559cc96.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.