Robust DPO with Stochastic Negatives Improves Multimodal Sequential Recommendations

📰 Dev.to · gentic news

New research introduces RoDPO, a method that improves recommendation ranking by using stochastic sampling from a dynamic candidate pool for negative s

Published 2 Apr 2026