Adwaitvedant S. Mathkar, Vivek S. Borkar: Distributed Reinforcement Learning via Gossip. CoRR abs/1310.7610 (2013)