Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar: Efficient Stagewise Pretraining via Progressive Subnetworks. CoRR abs/2402.05913 (2024)