Section 01
[Introduction] Protein Large Language Models (pLLMs) Assist Cross-Species Single-Cell Transcriptome Integration: A New Paradigm for Gene Homology Mapping
Title: Protein Large Language Models (pLLMs) Assist Cross-Species Single-Cell Transcriptome Integration: A New Paradigm for Gene Homology Mapping Abstract: This article introduces a cross-species single-cell transcriptome integration method based on Protein Large Language Models (pLLMs), which achieves gene homology mapping through protein sequence embedding, providing a new tool for comparative genomics and evolutionary biology research. Keywords: Protein language model, cross-species integration, single-cell transcriptome, gene homology mapping, ESM-2, computational biology, comparative genomics Original author/maintainer: KKzhongyi Source platform: GitHub Original title: pLLM-cross-species-integration Original link: https://github.com/KKzhongyi/pLLM-cross-species-integration Source release time/update time: 2026-05-27T14:14:22Z
Core观点: This project proposes a cross-species single-cell transcriptome integration method based on protein large language models (e.g., ESM-2), which achieves gene homology mapping through protein sequence embedding. It solves the problems of traditional methods relying on databases and ignoring functional conservation, providing a new tool for comparative genomics and evolutionary biology.