|Table of Contents|

Research on breast cancer diagnosis model based on clinlabomics and machine learning

Journal Of Modern Oncology[ISSN:1672-4992/CN:61-1415/R]

Issue:
2024 07
Page:
1264-1272
Research Field:
Publishing date:

Info

Title:
Research on breast cancer diagnosis model based on clinlabomics and machine learning
Author(s):
LU Feng12ZHANG Kaijiong3WU Lichun3JIANG Xuchuan2JI Chengjie2LIU Jinbo1
1.Medical Laboratory Department of the Affiliated Hospital of Southwest Medical University,Sichuan Luzhou 646000,China;2.Department of Experimental Medicine,Jianyang City People's Hospital,Sichuan Jianyang 641400,China;3.Sichuan Cancer Hospital Institute/Sichuan Cancer Prevention Center/Medical Laboratory Department of the Affiliated Tumor Hospital of University of Electronic Science and Technology,Sichuan Chengdu 610000,China.
Keywords:
breast cancermachine learningclinlabomicsdiagnostic modelartificial intelligence
PACS:
R737.9
DOI:
10.3969/j.issn.1672-4992.2024.07.014
Abstract:
Objective:To construct a breast cancer diagnostic model using machine learning algorithms based on big data from routine clinical laboratories,in order to explore the application value of clinlabomics in the diagnosis of breast cancer.Methods:A retrospective study was conducted,collecting clinical and laboratory test data from 6 089 breast cancer patients and 6 830 patients with benign breast diseases.Various machine learning algorithms including extreme gradient boosting(XGBoost),neural network(NN),support vector machine(SVM),random forest(RF),K-nearest neighbors(KNN),Logistic regression(LR),linear discriminant analysis(LDA),naive bayes,gradient boosting machine(GBM) algorithm,and C5.0 decision tree were utilized for establishing the breast cancer diagnostic model.Ten-fold cross validation was used to train the model,and the accuracy,AUC,average accuracy,specificity,sensitivity,positive predictive value,negative predictive value,and Kappa value were used to evaluate the performance of each model.Results:Nine indicators were selected from 28 conventional clinical test indicators,including GLU,DBIL,RDW-CV,MONO,TG,ALB,RBC,LYMPH and UREA,and then age was added for model construction.The model was evaluated by using 10 machine learning algorithms,and the results show that the gradient elevator algorithm,has the best diagnostic performance compared with other models.The accuracy,AUC,average accuracy,specificity,sensitivity,positive predictive value,negative predictive value,and Kappa value of gradient elevator algorithm in diagnosing breast cancer were 0.80,0.80,0.80,0.77,0.82,0.78,0.81 and 0.59,respectively.Conclusion:Based on the routine clinical laboratory data,the machine learning algorithm is used to construct the breast cancer diagnosis model,which can provide decision support for the auxiliary diagnosis of breast cancer.

References:

[1]BRAY F,LAVERSANNE M,WEIDERPASS E,et al.The ever-increasing importance of cancer as a leading cause of premature death worldwide[J].Cancer,2021,127(16):3029-3030.
[2]SUNG H,FERLAY J,SIEGEL RL,et al.Global cancer statistics 2020:GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries[J].CA Cancer J Clin,2021,71(3):209-249.
[3]赫捷,陈万青,李霓,等.中国女性乳腺癌筛查与早诊早治指南(2021,北京)[J].中华肿瘤杂志,2021,43(04):357-382. HE J,CHEN WQ,LI N,et al.Guidelines for screening,early diagnosis and treatment of breast cancer in Chinese women(2021,Beijing)[J].Chin J Cancer,2021,43(04):357-382.
[4]MIN N,WEI Y,ZHENG Y,et al.Advancement of prognostic models in breast cancer:a narrative review[J].Gland Surg,2021,10(9):2815-2831.
[5]YOUSEFI B,AKBARI H,MALDAGUE X.Detecting vasodilation as potential diagnostic biomarker in breast cancer using deep learning-driven thermomics[J].Biosensors(Basel),2020,10(11):164.
[6]LIU S,SUN Y,HOU Y,et al.A novel lncRNA ROPM-mediated lipid metabolism governs breast cancer stem cell properties[J].J Hematol Oncol,2021,14(1):178.
[7]ZHANG Y,HUANG YX,WANG DL,et al.LncRNA DSCAM-AS1 interacts with YBX1 to promote cancer progression by forming a positive feedback loop that activates FOXA1 transcription network[J].Theranostics,2020,10(23):10823-10837.
[8]TARIGHATI E,KEIVAN H,MAHANI H.A review of prognostic and predictive biomarkers in breast cancer[J].Clin Exp Med,2023,23(1):1-16.
[9]DAI J,WANG H,XU Y,et al.Clinical application of AI-based PET images in oncological patients[J].Semin Cancer Biol,2023,91:124-142.
[10]SALTO-TELLEZ M,MAXWELL P,HAMILTON P.Artificial intelligence-the third revolution in pathology[J].Histopathology,2019,74(3):372-376.
[11]TANG Y,TIAN W,XIE J,et al.Prognosis and dissection of Immunosuppressive microenvironment in breast cancer based on fatty acid metabolism-related signature[J].Front Immunol,2022,13:843515.
[12]TAN L,MAI D,ZHANG B,et al.PIWI-interacting RNA-36712 restrains breast cancer progression and chemoresistance by interaction with SEPW1 pseudogene SEPW1P RNA[J].Mol Cancer,2019,18(1):9.
[13]LIU Y,BAI F,TANG Z,et al.Integrative transcriptomic,proteomic,and machine learning approach to identifying feature genes of atrial fibrillation using atrial samples from patients with valvular heart disease[J].BMC Cardiovasc Disord,2021,21(1):52.
[14]WONG J,HORWITZ MM,ZHOU L,et al.Using machine learning to identify health outcomes from electronic health record data[J].Curr Epidemiol Rep,2018,5(4):331-342.
[15]BEAM AL,KOHANE IS.Big data and machine learning in health care[J].JAMA,2018,319(13):1317-1318.
[16]BANERJEE M,REYNOLDS E,ANDERSSON HB,et al.Tree-based analysis[J].Circ Cardiovasc Qual Outcomes,2019,12(5):e004879.
[17]CHANG W,LIU Y,XIAO Y,et al.A machine-learning-based prediction method for hypertension outcomes based on medical data[J].Diagnostics(Basel),2019,9(4):178.
[18]KIM Y,KIM J,KIM S,et al.Machine learning-based risk prediction model for canine myxomatous mitral valve disease using electronic health record data[J].Front Vet Sci,2023,10:1189157.
[19]BENJAMENS S,DHUNNOO P,MESKO B.The state of artificial intelligence-based FDA-approved medical devices and algorithms:an online database[J].NPJ Digit Med,2020,3:118.
[20]CUI M,ZHANG DY.Artificial intelligence and computational pathology[J].Lab Invest,2021,101(4):412-422.
[21]CONTI A,DUGGENTO A,INDOVINA I,et al.Radiomics in breast cancer classification and prediction[J].Semin Cancer Biol,2021,72:238-250.
[22]DUGGENTO A,CONTI A,MAURIELLO A,et al.Deep computational pathology in breast cancer[J].Semin Cancer Biol,2021,72:226-237.
[23]ZHAO W,GENG D,LI S,et al.LncRNA HOTAIR influences cell growth,migration,invasion,and apoptosis via the miR-20a-5p/HMGA2 axis in breast cancer[J].Cancer Med,2018,7(3):842-855.
[24]LI B,ZHANG N,WANG YG,et al.Genomic prediction of breeding values using a subset of SNPs identified by three machine learning methods[J].Front Genet,2018,9:237.
[25]MENZE BH,KELM BM,MASUCH R,et al.A comparison of random forest and its gini importance with standard chemometric methods for the feature selection and classification of spectral data[J].BMC Bioinformatics,2009,10:213.
[26]刘伟,张阳,黄姣祺,等.检验组学在精准医疗中的应用价值[J].国际检验医学杂志,2018,39(09):1025-1028. LIU W,ZHANG Y,HUANG JQ,et al.Application value of clinlabomics in precision medicine [J].International Journal of Laboratory Medicine,2018,39(09):1025-1028.
[27]WEN X,LENG P,WANG J,et al.Clinlabomics:leveraging clinical laboratory data by data mining strategies[J].BMC Bioinformatics,2022,23(1):387.
[28]LI H,LIN J,XIAO Y,et al.Colorectal cancer detected by machine learning models using conventional laboratory test data[J].Technol Cancer Res Treat,2021,20:15330338211058352.
[29]PODNAR S,KUKAR M,GUNCAR G,et al.Diagnosing brain tumours by routine blood tests using machine learning[J].Sci Rep,2019,9(1):14481.
[30]WU J,ZAN X,GAO L,et al.A machine learning method for identifying lung cancer based on routine blood indices:qualitative feasibility study[J].JMIR Med Inform,2019,7(3):e13476.
[31]王鹏,丁晓凌,谢鸣杰,等.基于机器学习算法预测腹膜后脂肪肉瘤患者总体生存期的临床研究 [J].现代肿瘤医学,2023,31(07):1291-1296. WANG P,DING XL,XIE MJ,et al.Prediction of overall survival in patients with retroperitoneal liposarcoma based on machine learning algorithm [J].Modern Oncology,2023,31(07):1291-1296.

Memo

Memo:
四川省科技厅重点研发项目第二版(编号:2022YFS0335);四川省成都市医学科研课题(编号:2022417);四川省简阳市人民医院科研课题(编号:JY202234)
Last Update: 2024-02-29