Scaling Structure Aware Virtual Screening to Billions of Molecules with SPRINT.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Abhinav K Adduri, Monica T Dayao, Caleb N Ellington, David R Koes, Andrew T McNutt, Hosein Mohimani, Eric P Xing

Ngôn ngữ: eng

Ký hiệu phân loại: 373.1 Organization and activities in secondary education

Thông tin xuất bản: United States : ArXiv , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 643100

Virtual screening of small molecules against protein targets can accelerate drug discovery and development by predicting drug-target interactions (DTIs). However, structure-based methods like molecular docking are too slow to allow for broad proteome-scale screens, limiting their application in screening for off-target effects or new molecular mechanisms. Recently, vector-based methods using protein language models (PLMs) have emerged as a complementary approach that bypasses explicit 3D structure modeling. Here, we develop SPRINT, a vector-based approach for screening entire chemical libraries against whole proteomes for DTIs and novel mechanisms of action. SPRINT improves on prior work by using a self-attention based architecture and structure-aware PLMs to learn a co-embedding space for drugs and targets, enabling efficient binder prediction, search, and retrieval. SPRINT achieves SOTA enrichment factors in virtual screening on LIT-PCBA, DTI classification benchmarks, and binding affinity prediction benchmarks, while providing interpretability in the form of residue-level attention maps. In addition to being both accurate and interpretable, SPRINT is ultra-fast: querying the whole human proteome against the ENAMINE Real Database (6.7B drugs) for the 100 most likely binders per protein takes 16 minutes. SPRINT promises to enable virtual screening at an unprecedented scale, opening up new opportunities for
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH