Mayo Clinic Arizona
United States
Multimodal AI Fusion Model for Early Detection for Pancreatic Cancer
Pancreatic ductal adenocarcinoma (PDAC), accounting for 90% of all pancreatic cancers, is among the deadliest, due largely to late-stage diagnosis and the aggressive nature of the disease. The critical challenge lies in early detection, which is currently not viable for the general population due to low annual incidence and a significant risk of false positives even with highly specific tests. While current risk assessment tools rely on static factors such as age, obesity, and diabetes, recent studies suggest the potential for imaging biomarkers derived from pre-cancerous computed tomography (CT) scans to predict PDAC. Our project aims to develop a comprehensive and scalable risk prediction model that fuses imaging and non-imaging data to enable early detection of PDAC in asymptomatic individuals. The model, termed "PRECISE" (PancREas Cancer multImodal riSk prEdiction), will employ novel algorithmic adversarial debiasing techniques to ensure fairness, meaning it should perform accurately across different demographic and socioeconomic subgroups. In Aim 1, we will develop deep learning models that segment imaging biomarkers from abdominal CTs, applying adversarial debiasing to ensure fair representation across diverse patient factors and image acquisition techniques. Validation will be done using data from Mayo Clinic, Cornell University, and UCSF. Aim 2 involves the creation of the PRECISE fusion model. It will combine imaging biomarkers from CTs with clinical data from electronic medical records (EMRs) to predict the risk of PDAC. We will employ a graph neural network model to capture the semantic relations between multimodal data. The model's prognostic performance will be compared with baseline models. In Aim 3, we plan to deploy and evaluate the PRECISE model prospectively across disparate geographical sites. The model's performance will be assessed by comparing its predictions with patient outcomes collected at regular intervals. This proposal's overall goal is to create a fair and effective PDAC risk prediction tool, PRECISE, that leverages both imaging and non-imaging data to calculate unbiased risk estimates. If successful, our scalable automated risk stratification could potentially transform PDAC early detection, enabling opportunistic screening for patients undergoing routine abdominopelvic CT scans for non-pancreatic cancer indications. This could significantly improve PDAC survival rates by enabling earlier intervention and treatment.
Publications
- Peng Y, Malin BA, Rousseau JF, Wang Y, Xu Z, Xu X, Weng C, Bian J. From GPT to DeepSeek: Significant gaps remain in realizing AI in healthcare. Journal of biomedical informatics. 2025 Mar;163:104791. Epub 2025 Feb 10. PMID: 39938624
- Tam TYC, Sivarajkumar S, Kapoor S, Stolyar AV, Polanska K, McCarthy KR, Osterhoudt H, Wu X, Visweswaran S, Fu S, Mathur P, Cacciamani GE, Sun C, Peng Y, Wang Y. A framework for human evaluation of large language models in healthcare derived from literature review. NPJ digital medicine. 2024 Sep 28;7(1):258. PMID: 39333376
- Kim JW, Khan AU, Banerjee I. Systematic Review of Hybrid Vision Transformer Architectures for Radiological Image Analysis. Journal of imaging informatics in medicine. 2025 Jan 27. Epub 2025 Jan 27. PMID: 39871042
- Zhou Y, Ong H, Kennedy P, Wu CC, Kazam J, Hentel K, Flanders A, Shih G, Peng Y. Evaluating GPT-V4 (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs. Radiology. 2024 May;311(2):e233270. PMID: 38713028
- Flanders AE, Wang X, Wu CC, Kitamura FC, Shih G, Mongan J, Peng Y. The Evolution of Radiology Image Annotation in the Era of Large Language Models. Radiology. Artificial intelligence. 2025 Jul;7(4):e240631. PMID: 40304582
- Le D, Correa-Medero R, Tariq A, Patel B, Yano M, Banerjee I. Opportunistic Screening for Pancreatic Cancer using Computed Tomography Imaging and Radiology Reports. ArXiv. 2025 Mar 31. PMID: 40236838
- Wei Y, Wang X, Ong H, Zhou Y, Flanders A, Shih G, Peng Y. Enhancing Disease Detection in Radiology Reports Through Fine-tuning Lightweight LLM on Weak Labels. AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science. 2025 Jun 10;2025:614-623. eCollection 2025. PMID: 40502255
- Sun C, Teichman K, Zhou Y, Critelli B, Nauheim D, Keir G, Wang X, Zhong J, Flanders AE, Shih G, Peng Y. Generative Large Language Models Trained for Detecting Errors in Radiology Reports. Radiology. 2025 May;315(2):e242575. PMID: 40392090
- Lin M, Wang S, Ding Y, Zhao L, Wang F, Peng Y. An empirical study of using radiology reports and images to improve intensive care unit mortality prediction. JAMIA open. 2025 Feb 20;8(1):ooae137. doi: 10.1093/jamiaopen/ooae137. eCollection 2025 Feb. PMID: 39980476