高级检索

    面向信息系统设计的癌症阶段生物标志物识别

    Identification of the Cancer Stage Biomarkers for Information System Design

    • 摘要: 癌症是1种复杂且动态变化的高异质性疾病. 它的发生发展伴随着大量的基因突变与功能失调. 识别癌症阶段相关的生物标志物,对于了解癌症的致病机理与发展机制至关重要. 然而,现有的癌症生物标志物识别方法通常将各个基因看作是孤立的节点,且仅关注癌症的二分类效果,忽略了不同基因之间的交互关系变化以及不同病理阶段的显著差别. 为解决上述问题,首先为癌症各阶段构建回归残差网络(regression residual network,RRN),分析每个阶段RRN的节点和边,并结合生物通路进行多源数据挖掘,刻画了癌症随阶段演化的整个过程. 通过对癌症的演化分析,分别获得癌症二分类和阶段多分类的生物网络标志物,并在GSE10072和GSE42171数据集上进行了验证. 实验结果表明,仅使用2个基因ALDOA和NME1组成的生物标志物,可以在肺腺癌二分类问题上获得跟现有研究结果具有同样竞争力的预测精度,而使用由17条边构成的阶段生物标志物,则可以在肺腺癌阶段多分类问题上获得比现有方法高出14.86%的预测精度.

       

      Abstract: Cancer is an exceptionally complex and highly heterogeneous disease with dynamic changes. Its occurrence and development are accompanied by a large number of gene mutations and functional disorders. Identifying biomarkers related to cancer stages is crucial for understanding the pathogenic and developmental mechanisms of cancer. However, the existing research on cancer biomarker recognition often treat individual genes as isolated nodes and usually only focused on the binary classification of cancer, ignoring the significant differences among different stages of cancer. To overcome the above issues, this study first constructs a RRN (regression residual network) for each cancer stage, and then analyzes the nodes and edges of RRN in each stage. After that, the multi-source data mining were conducted in biological pathways, and the entire process of cancer evolution was characterized along with stages. By doing this, both biomarkers for cancer binary classification and multi-stage classification were obtained, and they were validated on the GSE10072 and GSE42171, respectively. The experimental results showed that the obtained biomarkers ALDOA and NME1 achieved competitive accuracy like existing methods by use only two genes for lung adenocarcinoma, and the biomarkers consist of 17 edges achieved the improved accuracy by 14.86% by comparing with existing methods in terms of multi-stage classification.

       

    /

    返回文章
    返回