Research Article

Development of an artificial intelligence-based precision medicine decision support system for radiogenomics data sets

Volume: 47 Number: 2 June 21, 2025
EN TR

Development of an artificial intelligence-based precision medicine decision support system for radiogenomics data sets

Abstract

Aim: This study aims to apply deep learning algorithms for superpixel segmentation, herbaceous thresholding, and disease reference position estimation from DICOM images and clinical data of Non-Small Cell Lung Cancer (NSCLC) patients. Quantitative imaging data was integrated with clinical information. Various machine learning algorithms were employed to identify biomarkers and evaluate classification performance based on clinical data, imaging data, and their combination, assessing the model improvement rates. Materials and Methods: The clinical dataset included 43 patients with and 168 without an Epidermal Growth Factor Receptor (EGFR) mutation, and 38 with and 173 without a Kirsten Rat Sarcoma Viral Oncogene Homolog (KRAS) mutation, totaling 211 NSCLC cases. A total of 2,231 images were analyzed. Using the VGG16 deep learning model, 25,088 features were extracted from each image. XGBoost, CatBoost, Random Forest, and Support Vector Machine (SVM) classification algorithms were used to predict mutation status. Findings: Clinical data revealed significant differences in mutation status among NSCLC patients. The Random Forest algorithm was employed for feature selection, identifying the 50 most important variables for model training. XGBoost and CatBoost achieved the highest classification performance, with results for accuracy, balanced accuracy, precision, sensitivity, F1-score, and ROC-AUC as follows: 0.965 ± 0.015, 0.954 ± 0.021, 0.953 ± 0.024, 0.994 ± 0.007, 0.973 ± 0.011, and 0.990 ± 0.005, respectively. Result: The study’s findings demonstrate that XGBoost and CatBoost models were highly effective in predicting KRAS mutation status from imaging data. CatBoost also performed best in determining EGFR mutation status, outperforming other machine learning methods.

Keywords

References

  1. 1. Lambin, P. et al. Radiomics: extracting more information from medical images using advanced feature analysis. European journal of cancer 48, 441-446 (2012).
  2. 2. Van Griethuysen, J. J. et al. Computational radiomics system to decode the radiographic phenotype. Cancer research 77, e104-e107 (2017).
  3. 3. Chaddad, A., Daniel, P., Sabri, S., Desrosiers, C. & Abdulkarim, B. Integration of radiomic and multi-omic analyses predicts survival of newly diagnosed IDH1 wild-type glioblastoma. Cancers 11, 1148 (2019).
  4. 4. Song, L. et al. Clinical, conventional CT and radiomic feature-based machine learning models for predicting ALK rearrangement status in lung adenocarcinoma patients. Frontiers in Oncology 10, 369 (2020).
  5. 5. Xu, Y. et al. Deep learning predicts lung cancer treatment response from serial medical imaging. Clinical Cancer Research 25, 3266-3275 (2019).
  6. 6. Yamamoto, S. et al. ALK molecular phenotype in non–small cell lung cancer: CT radiogenomic characterization. Radiology 272, 568-576 (2014).
  7. 7. Shi, L. et al. Radiomics for response and outcome assessment for non-small cell lung cancer. Technology in cancer research & treatment 17, 1533033818782788 (2018).
  8. 8. Armanious, K. et al. MedGAN: Medical image translation using GANs. Computerized medical imaging and graphics 79, 101684 (2020).

Details

Primary Language

English

Subjects

Health Informatics and Information Systems

Journal Section

Research Article

Publication Date

June 21, 2025

Submission Date

June 3, 2025

Acceptance Date

June 11, 2025

Published in Issue

Year 2025 Volume: 47 Number: 2

AMA
1.Pınar A, Arslan AK, Güldoğan E. Development of an artificial intelligence-based precision medicine decision support system for radiogenomics data sets. CMJ. 2025;47(2):35-44. doi:10.7197/cmj.1713462