Technique · alignment
Identity Preference Optimization (IPO)
A preference-optimization variant that avoids DPO's over-fitting by adding an explicit regularizer.
0
Products deploying
—
Avg research → prod
—
First commercial deploy
Deployment timeline
No verified deployments yet in our tracked product set.