ScatTR: Estimating the Size of Long Tandem Repeat Expansions from Short-Reads.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Rashid Al-Abri, Gamze Gürsoy

Ngôn ngữ: eng

Ký hiệu phân loại: 202.112 Attributes of God, of the gods

Thông tin xuất bản: United States : bioRxiv : the preprint server for biology , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 692464

Tandem repeats (TRs) are sequences of DNA where two or more base pairs are repeated back-to-back at specific locations in the genome. The expansions of TRs are implicated in over 50 conditions, including Friedreich's ataxia, autism, and cancer. However, accurately measuring the copy number of TRs is challenging, especially when their expansions are larger than the fragment sizes used in standard short-read genome sequencing. Here we introduce ScatTR, a novel computational method that leverages a maximum likelihood framework to estimate the copy number of large TR expansions from short-read sequencing data. ScatTR calculates the likelihood of different alignments between sequencing reads and reference sequences that represent various TR lengths and employs a Monte Carlo technique to find the best match. In simulated data, ScatTR outperforms state-of-the-art methods, particularly for TRs with longer motifs and those with lengths that greatly exceed typical sequencing fragment sizes. When applied to data from the 1000 Genomes Project, ScatTR detected potential large TR expansions that other methods missed, highlighting its ability to better identify genome-wide characterization of TR variation. ScatTR can be accessed via: https://github.com/g2lab/scattr.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH