Multidimensional scaling improves distance-based clustering for microbiome data.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Guanhua Chen, Qiang Sun, Zheng-Zheng Tang, Xinyue Wang

Ngôn ngữ: eng

Ký hiệu phân loại:

Thông tin xuất bản: England : Bioinformatics (Oxford, England) , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 90850

MOTIVATION: Clustering patients into subgroups based on their microbial compositions can greatly enhance our understanding of the role of microbes in human health and disease etiology. Distance-based clustering methods, such as partitioning around medoids (PAM), are popular due to their computational efficiency and absence of distributional assumptions. However, the performance of these methods can be suboptimal when true cluster memberships are driven by differences in the abundance of only a few microbes, a situation known as the sparse signal scenario. RESULTS: We demonstrate that classical multidimensional scaling (MDS), a widely used dimensionality reduction technique, effectively denoises microbiome data and enhances the clustering performance of distance-based methods. We propose a two-step procedure that first applies MDS to project high-dimensional microbiome data into a low-dimensional space, followed by distance-based clustering using the low-dimensional data. Our extensive simulations demonstrate that our procedure offers superior performance compared to directly conducting distance-based clustering under the sparse signal scenario. The advantage of our procedure is further showcased in several real data applications. AVAILABILITY AND IMPLEMENTATION: The R package MDSMClust is available at https://github.com/wxy929/MDS-project.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH