Yaowen Ye, Cassidy Laidlaw, Jacob Steinhardt: Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision. ICLR 2025