Zhuohua Li, Ruyun Wang, Fuqing Zhu, Jizhong Han, Songlin Hu: Pyramidal Cross-Modal Transformer with Sustained Visual Guidance for Multi-Label Image Classification. ICMR 2024: 740-748