Drug-Target Interaction (DTI) prediction is one of the core components of drug discovery. While traditional high-throughput screening methods are effective, they are costly and time-consuming. Computational methods face the following challenges:
Data Heterogeneity: Drug molecules are usually represented as SMILES strings or molecular graphs, while target proteins are presented as amino acid sequences or 3D structures. These two distinct data modalities require special fusion strategies.
Scarcity of Labeled Data: Experimentally validated DTI data is relatively limited. How to train models with strong generalization capabilities on limited data is a key issue.
Interpretability Requirements: Drug discovery requires understanding the biological mechanisms behind model predictions, which black-box models struggle to meet.
Uncertainty Quantification: In practical applications, knowing the confidence of model predictions is crucial for decision-making, especially in the field of drug development involving human health.