"CRPO: Confidence-Reward Driven Preference Optimization for Machine ..."

Guofeng Cui et al. (2025)

Details and statistics

DOI: 10.18653/V1/2025.FINDINGS-ACL.31

access: open

type: Conference or Workshop Paper

metadata version: 2026-06-10