Joshua Davidson, Christopher Archibald, Michael Bowling: Baseline: practical control variates for agent evaluation in zero-sum domains. AAMAS 2013: 1005-1012