Vivek Deulkar, Jayakrishnan Nair: Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning. SmartGridComm 2021: 83-88