Large language models improve transferability of electronic health record-based predictions across countries and coding systems.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Matteo Ferro, Andrea Ganna, Matthias Kirchler, Christoph Lippert, Veronica Lorenzini

Ngôn ngữ: eng

Ký hiệu phân loại:

Thông tin xuất bản: United States : medRxiv : the preprint server for health sciences , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 680597

Variation in medical practices and reporting standards across healthcare systems limits the transferability of prediction models based on structured electronic health record (EHR) data. We introduce GRASP, a novel transformer-based architecture that enhances the generalizability of EHR-based prediction by embedding medical codes into a unified semantic space using a large language model. We applied GRASP to predict the onset of 21 diseases and all-cause mortality in over one million individuals from UK Biobank (UK), FinnGen (Finland) and Mount Sinai (USA), all harmonized to OMOP common data model. Trained on the UK Biobank and evaluated in FinnGen and Mount Sinai, GRASP achieved an average ΔC-index that was 83% and 35% higher than language-unaware models, respectively. GRASP also showed significantly higher correlations with polygenic risk scores for 62% of diseases. Notably, GRASP mantained robust performance even when datasets were not harmonized to the same data model, accurately predicting disease risk from ICD-10-CM codes without direct mappings to OMOP. GRASP enables accurate and transferable disease predictions across heterogeneous healthcare systems with minimal resource requirements.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH