Guoyang Chen, Xipeng Shen: Free launch: optimizing GPU dynamic kernel launches through thread reuse. MICRO 2015: 407-419