Effective Gene Expression Prediction and Optimization from Protein Sequences.

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Han Gao, Feifei Guan, Huoqing Huang, Yanjun Li, Bo Liu, Tuoyu Liu, Huiying Luo, Jian Tian, Tao Tu, Pengtao Wang, Ningfeng Wu, Guoshun Xu, Bin Yao, Yiyang Zhang

Ngôn ngữ: eng

Ký hiệu phân loại: 599.073 Collections of living mammals

Thông tin xuất bản: Germany : Advanced science (Weinheim, Baden-Wurttemberg, Germany) , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 641468

High soluble protein expression in heterologous hosts is crucial for various research and applications. Despite considerable research on the impact of codon usage on expression levels, the relationship between protein sequence and expression is often overlooked. In this study, a novel connection between protein expression and sequence is uncovered, leading to the development of SRAB (Strength of Relative Amino Acid Bias) based on AEI (Amino Acid Expression Index). The AEI served as an objective measure of this correlation, with higher AEI values enhancing soluble expression. Subsequently, the pre-trained protein model MP-TRANS (MindSpore Protein Transformer) is developed and fine-tuned using transfer learning techniques to create 88 prediction models (MPB-EXP) for predicting heterologous expression levels across 88 species. This approach achieved an average accuracy of 0.78, surpassing conventional machine learning methods. Additionally, a mutant generation model, MPB-MUT, is devised and utilized to enhance expression levels in specific hosts. Experimental validation demonstrated that the top 3 mutants of xylanase (previously not expressed in Escherichia coli) successfully achieved high-level soluble expression in E. coli. These findings highlight the efficacy of the developed model in predicting and optimizing gene expression based on protein sequences.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 36225755 | Email: tt.thuvien@hutech.edu.vn

Copyright @2024 THƯ VIỆN HUTECH