Yufan Li, Jialiang Mao, Iavor Bojinov: Balancing Risk and Reward: A Batched-Bandit Strategy for Automated Phased Release. NeurIPS 2023