Section 01
Introduction: GXQ-Create — A Multimodal Virus Host Prediction Tool Integrating Genomic Features and Protein Language Models
GXQ-Create is an innovative multimodal virus host prediction tool that combines k-mer genomic features with the ESM-2 protein language model, using a late-fusion SVM architecture, and achieves a cross-validation accuracy of 96.4% in eukaryotic host prediction. This tool is of great value for preventing cross-species virus transmission and assessing the risk of emerging infectious diseases.