Technique · inference
Continuous Batching
A scheduling technique that adds/removes requests at the iteration level rather than the batch level, dramatically increasing throughput for LLM serving.
Origin: Seoul National University, 2022-07Read origin paper →Also known as: Dynamic Batching, Iteration-level scheduling
0
Products deploying
—
Avg research → prod
—
First commercial deploy
Deployment timeline
No verified deployments yet in our tracked product set.