Section 01
[Overview] Collaborative Large Model Inference in LEO Satellites: A New Solution to Break Through On-Satellite Resource Constraints
This paper proposes a communication-efficient collaborative inference scheme for LEO satellite networks. Through three core technologies—model partitioning, pipeline parallelism, and adaptive activation compression—it achieves significant results: 42% reduction in inference latency and 71% decrease in communication overhead, while keeping the accuracy loss below 1%. This effectively breaks through the memory, power, and communication resource constraints of a single satellite, opening up a new path for on-board intelligent computing.