OneRanker: Tencent's Unified Model for Advertising Recommendation Shows 1.34% GMV Lift
March 2026 — In industrial-scale advertising systems, a fundamental tension exists between recommending items a user might like and items that drive business value. Traditional cascaded architectures—where a generation stage retrieves candidates and a ranking stage orders them—often suffer from irreversible information loss between stages. Single-stage unified models, meanwhile, struggle with optimization conflicts between competing objectives.
Researchers from Tencent propose a solution in their paper "OneRanker: Unified Generation and Ranking with One Model in Industrial Advertising Recommendation," which has been fully deployed on Tencent's WeiXin (WeChat) channels advertising system. The model demonstrates a 1.34% increase in Gross Merchandise Volume (GMV), a significant lift for a platform of that scale.
The Core Innovation: Architectural-Level Deep Integration
OneRanker is not merely another multi-task model. It is designed as an architectural-level deep integration of the generation and ranking stages within a single transformer-based framework. The goal is to maintain the efficiency of a unified model while solving three practical deployment challenges:
- Misalignment between interest and value: Maximizing user engagement does not always maximize revenue or conversion.
- Target-agnostic generation: Pure generative processes often lack explicit signals for downstream business metrics.
- Disconnection between stages: Separated pipelines cause information loss; fused pipelines cause optimization tension.
Key Technical Mechanisms
The paper details three novel components that enable this integration:
1. Value-Aware Multi-Task Decoupling Architecture
Instead of forcing all tasks to share the exact same representation space, OneRanker uses task token sequences and a causal mask to separate the "interest coverage" and "value optimization" spaces within the model's shared layers. This allows the model to learn shared foundational representations while maintaining dedicated pathways for different objectives, effectively alleviating target conflicts.
2. Coarse-to-Fine Collaborative Target Awareness
To bridge the gap between generative candidate creation and final ranking, the system employs a two-phase awareness mechanism:
- Implicit Awareness (Generation): Uses "Fake Item Tokens" during the generation stage to incorporate preliminary value signals without disrupting the generative flow.
- Explicit Alignment (Ranking): A dedicated ranking decoder then performs explicit value alignment at the individual candidate level, ensuring the final ordered list optimizes for business KPIs.
3. Input-Output Dual-Side Consistency Guarantees
To ensure the generation and ranking stages are collaboratively optimized end-to-end, the researchers introduce consistency constraints:
- Key/Value Pass-Through Mechanisms: Allow information to flow seamlessly between the generation and ranking components within the model's attention layers.
- Distribution Consistency (DC) Constraint Loss: A novel loss function that aligns the probability distributions of the generation and ranking outputs, preventing them from diverging and ensuring the ranked list reflects the generative intent.
Industrial Deployment and Results
The model was deployed in the advertising recommendation system for Tencent's WeiXin Channels, a major social media and content platform. The reported results show a +1.34% improvement in GMV for the "Normal" traffic bucket (likely the main experimental group). This metric is a direct measure of the total sales value generated through the ads, making it a critical business indicator.

The success underscores the paper's contribution: providing "a new paradigm with industrial feasibility for generative advertising recommendations." It moves beyond academic proof-of-concept to demonstrate a scalable architecture that reconciles user experience with business performance.
The Broader Context: A Shift in Recommendation Systems
This work is part of a broader trend highlighted in the paper: "The end-to-end generative paradigm is revolutionizing advertising recommendation systems." The industry is actively moving away from brittle, multi-stage cascaded systems toward more fluid, unified models. Recent related research on arXiv, such as work on "modeling evolving user interests," complements this direction by focusing on better understanding the user side of the equation.

OneRanker represents a significant step in this evolution by tackling the core engineering challenge of multi-objective optimization within a single, deployable model.


