Click-Through Rate (CTR) prediction is a core task in online recommendation systems, used to estimate the probability of users clicking on ads or products. With the diversification of e-commerce platform businesses—covering multiple vertical domains such as online shopping, ride-hailing, food delivery, and professional services—recommendation systems need to have cross-domain prediction capabilities, namely Multi-Domain CTR Prediction (MDCTR).
However, traditional multi-domain CTR prediction faces two core challenges:
Challenge 1: Lack of Semantic Representation for Domains
Traditional MDCTR models usually encode domains as discrete identifiers (e.g., domain_id=1,2,3), completely ignoring the rich semantic relationships between domains. For example, "books" and "electronics" have significant differences in product attributes and user behavior patterns, but traditional models cannot capture these semantic differences and commonalities. This makes it difficult for models to generalize to new domains not seen during training.
Challenge 2: Seesaw Phenomenon
In multi-domain joint training, models are often biased by certain dominant domains with large data volumes, leading to a significant drop in performance in other domains. This seesaw effect of trade-offs severely restricts the practical application effect of multi-domain models.