Sparse Bernoulli mixture modeling with negative-unlabeled data: an approach to identify and characterize long COVID.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Tingyi Cao, Andrea S Foulkes, Harrison T Reeder

Ngôn ngữ: eng

Ký hiệu phân loại:

Thông tin xuất bản: England : Biometrics , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 699369

SARS-CoV-2-infected individuals have reported a diverse collection of persistent and often debilitating symptoms commonly referred to as long COVID or post-acute sequelae of SARS-CoV-2 (PASC). Identifying PASC and its subphenotypes is challenging because available data are "negative-unlabeled" as uninfected individuals must be PASC negative, but those with prior infection have unknown PASC status. Moreover, feature selection among many potentially informative characteristics can facilitate reaching a concise and easily interpretable PASC definition. Therefore, to characterize PASC and the spectrum of PASC subphenotypes while identifying a minimal set of features, we propose a Bernoulli mixture model with novel parameterization to accommodate negative-unlabeled data and Bayesian priors to induce sparsity. We present an efficient expectation-maximization algorithm for estimation, and a grid search procedure to select the number of clusters and level of sparsity. We evaluate the proposed method with a simulation study and an analysis of data on self-reported symptoms from the ongoing Researching COVID to Enhance Recovery-Adult Cohort study.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH