高级检索
许楚璇, 姜飞, 沈伟涛, 孙岩, 沈孝兵. 基于代谢组学和机器学习探究胃癌血浆诊断标志物[J]. 中国公共卫生, 2023, 39(2): 164-169. DOI: 10.11847/zgggws1139011
引用本文: 许楚璇, 姜飞, 沈伟涛, 孙岩, 沈孝兵. 基于代谢组学和机器学习探究胃癌血浆诊断标志物[J]. 中国公共卫生, 2023, 39(2): 164-169. DOI: 10.11847/zgggws1139011
XU Chu-xuan, JIANG Fei, SHEN Wei-tao, . Plasma markers for gastric cancer diagnosis: a metabolomics- and machine learning-based exploratory study[J]. Chinese Journal of Public Health, 2023, 39(2): 164-169. DOI: 10.11847/zgggws1139011
Citation: XU Chu-xuan, JIANG Fei, SHEN Wei-tao, . Plasma markers for gastric cancer diagnosis: a metabolomics- and machine learning-based exploratory study[J]. Chinese Journal of Public Health, 2023, 39(2): 164-169. DOI: 10.11847/zgggws1139011

基于代谢组学和机器学习探究胃癌血浆诊断标志物

Plasma markers for gastric cancer diagnosis: a metabolomics- and machine learning-based exploratory study

  • 摘要:
      目的   基于代谢组学和机器学习算法探究血浆代谢物对胃癌的诊断价值。
      方法  分别收集20例胃癌患者(胃癌组)和20名健康自愿者(健康对照组)的血浆样本,用甲醇提取血浆样本中代谢物,对提取的代谢物进行液相色谱串联质谱分析;通过与 mzCloud、mzVault 和 Masslist 数据库比对,对分析后的代谢物进行注释;以变量权重值(variable importance of projection,VIP) > 1、P < 0.05,log2|Fold Change| > 1的标准筛选2组血浆差异代谢物;采用超几何检验将筛选出的差异代谢物富集至京都基因与基因组百科全书(kyoto encyclopedia of genes and genomes,KEGG)数据库进行富集分析,确定2组间的差异代谢通路;通过Boruta算法建立基于特征代谢物的诊断模型和绘制受试者工作特征曲线(receiver operating characteristic,ROC),确定胃癌血浆特征诊断性代谢物,并计算其在2组中的相对含量。
      结果  从2组血浆提取物中共筛选出230个差异代谢物;富集到5条代谢差异通路,分别为苯丙氨酸代谢,苯丙氨酸、酪氨酸和色氨酸的生物合成,精氨酸的生物合成,组氨酸代谢,泛酸和辅酶A的生物合成;共确定9个重要的特征诊断性代谢物,分别为天门冬氨酸、苯乙胺、鸟氨酸、马尿酸、瓜氨酸、泛酰巯基乙胺、1 – 甲基组胺、酪氨酸及组氨酸。胃癌组血浆中9种特征诊断性代谢物相对含量均明显低于健康对照组(P < 0.05)。
      结论  富集到的5条代谢差异通路可能参与了胃癌的发展过程,确定的9种特征诊断性代谢物可作为胃癌诊断的生物标志物。代谢组学与机器学习算法相结合有助于确定胃癌诊断标志物。

     

    Abstract:
      Objective  To investigate the significance of plasma metabolites for gastric cancer diagnosis based on metabolomics and machine learning algorithms.
      Methods  Plasma samples were collected from 20 gastric cancer patients and 20 gender- and age-matched healthy volunteers (controls). After extracted with methanol, the metabolites in the plasma samples were analyzed with chromatography-mass spectrometry and annotated with mzCloud, mzVault and Masslist databases. The differential metabolites between the cases and controls were screened with the value of variable importance of projection (VIP: > 1, P < 0.05, log2|fold change| > 1); the identified differential metabolites were subjected to the Kyoto Encyclopedia of Genes and Genomes (KEGG) database enrichment using hypergeometric tests to detect differential metabolic pathways between the two groups. Specific metabolite-based diagnostic model and receiver operating characteristic (ROC) curve were established using Boruta algorithm to identify significant differential metabolites for gastric cancer diagnosis. Relative contents of the differential metabolites were also calculated.
      Results  A total of 230 differential metabolites were screened out from the plasma extracts of the two groups and 5 differential metabolic pathways were identified, including phenylalanine metabolism, biosynthesis of phenylalanine/tyrosine/tryptophan, arginine biosynthesis, histidine metabolism, and pantothenic acid/coenzyme A biosynthesis. The identified 9 significant differential metabolites for gastric cancer diagnosis were L-aspartic acid, 2-phenylethylamine, ornithine, hippuric acid, citrulline, pantetheine, 1-methylhistamine, L-tyrosine, and L-histidine and the relative levels of all the 9 differential metabolites were significantly lower in the plasma from the cases than those from the controls (P < 0.05 for all).
      Conclusion  The enriched five metabolic differential pathways may be involved in the development of gastric cancer, and the nine differential metabolites can be used as metabolic biomarkers for gastric cancer diagnosis. The combination of metabolomics and machine learning algorithms could help identify markers for gastric cancer diagnosis.

     

/

返回文章
返回