Jiecheng Lu, Shihao Yang: Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting. ICML 2025