ARCH: Large-scale knowledge graph via aggregated narrative codified health records analysis.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Edmon Begoli, Clara-Lea Bonzel, Tianrun Cai, Tianxi Cai, Kelly Cho, Lauren Costa, Ziming Gan, J Michael Gaziano, Kimberly F Greco, Yuk-Lam Ho, Chuan Hong, Katherine P Liao, Junwei Lu, George Ostrouchovm, Vidul A Panickan, Everett Rush, Shuting Shen, Jun Wen, Zongqi Xia, Xin Xiong, Zhiwei Xu, Doudou Zhou

Ngôn ngữ: eng

Ký hiệu phân loại: 809.008 History and description with respect to kinds of persons

Thông tin xuất bản: United States : Journal of biomedical informatics , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 716643

OBJECTIVE: Electronic health record (EHR) systems contain a wealth of clinical data stored as both codified data and free-text narrative notes (NLP). The complexity of EHR presents challenges in feature representation, information extraction, and uncertainty quantification. To address these challenges, we proposed an efficient Aggregated naRrative Codified Health (ARCH) records analysis to generate a large-scale knowledge graph (KG) for a comprehensive set of EHR codified and narrative features. METHODS: Using data from 12.5 million Veterans Affairs patients, ARCH first derives embedding vectors and generates similarities along with associated p-values to measure the strength of relatedness between clinical features with statistical certainty quantification. Next, ARCH performs a sparse embedding regression to remove indirect linkage between features to build a sparse KG. Finally, ARCH was validated on various clinical tasks, including detecting known relationships between entity pairs, predicting drug side effects, disease phenotyping, as well as sub-typing Alzheimer's disease patients. RESULTS: ARCH produces high-quality clinical embeddings and KG for over 60,000 codified and narrative EHR concepts. The KG and embeddings are visualized in the R-shiny powered web-API. CONCLUSION: The proposed ARCH algorithm generates large-scale high-quality semantic representations and knowledge graph for both codified and NLP EHR features, useful for a wide range of predictive modeling tasks.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH