Section 01
Introduction: Protein Large Language Models Facilitate Cross-Species Single-Cell Transcriptome Integration
Core Overview
This project was developed by KKzhongyi and released on GitHub (original title: pLLM-cross-species-integration, link: https://github.com/KKzhongyi/pLLM-cross-species-integration, release date: 2026-05-27). Its core is to use the protein large language model ESM2 to achieve cross-species gene homology mapping, providing a complete workflow with 5 different strategies to solve the problem of gene naming differences in cross-species integration of single-cell transcriptome data. Keywords: protein language model, ESM2, single-cell, transcriptomics, cross-species, gene homologue, bioinformatics