


default search action
64th ACL 2026: San Diego, California, USA - Long Papers
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens:

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2026, San Diego, California, United States, July 2-7, 2026. Association for Computational Linguistics 2026, ISBN 979-8-89176-390-6 - Frontfatter.

- Pan Lu, Bowen Chen, Sheng Liu, Rahul Thapa, Joseph Boen, James Zou:

OctoTools: A Multi-Agent Framework with Extensible Tools for Complex Reasoning. 1-86 - Jimin Jung, MyoungJin Kim, Jaehyung Seo, Heuiseok Lim:

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand. 87-116 - Chengwu Liu, Yichun Yin, Ye Yuan, Jiaxuan Xie, Botao Li, Siqi Li, Jianhao Shen, Yan Xu, Lifeng Shang, Ming Zhang:

Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4. 117-133 - Chung-ju Huang, Huiqiang Zhao, Yuanpeng He, Lijian Li, Wenpin Jiao, Zhi Jin, Peixuan Chen, Leye Wang:

Your Inference Request Will Become a Black Box: Confidential Inference for Cloud-based Large Language Models. 134-154 - Louie Hong Yao, Vishesh Anand, Yuan Zhuang, Tianyu Jiang:

Rhetorical Questions in LLM Representations: A Linear Probing Study. 155-172 - Yifu Chen, Shengpeng Ji, Zhengqing Liu, Qian Chen, Wen Wang, Ziqing Wang, Yangzhuo Li, Tianle Liang, Zhou Zhao:

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models. 173-208 - Daria Kryvosheieva, Andrea Gregor de Varda, Evelina Fedorenko, Greta Tuckute:

Different types of syntactic agreement recruit the same units within large language models. 209-227 - Mengshi Chen, Yuxiang Sun, Tengchao Li, Jianwei Wang, Kai Wang, Xuemin Lin, Ying Zhang, Wenjie Zhang:

Empowering Tabular Data Preparation with Language Models: Why and How? 228-246 - Zhiyuan Fan, Guanqiao Chen, Yanyi Huang, Mingkuan Zhao, Dadi Guo, Yi R. Fung:

Learning Diverse Responses with Prefix-Conditioned Supervised Fine-Tuning. 247-276 - Myunghoon Kang, Dahyun Jung, Suhyune Son, Seonmin Koo, Changwoo Chun, Daniel Rim, Haeyoung Kwon, Yuna Hur, Heuiseok Lim:

EASE: Entity-Aware Sub-table Generation for Real-world Multi-table QA. 277-302 - Yizhen Yuan, Rui Kong, Dongze Li, Yuanchun Li, Yunxin Liu:

Benchmarking LLM's Capability in Reasoning over Conflicting Web References. 303-322 - Qianli Wang, Van Bach Nguyen, Yihong Liu, Fedor Splitt, Nils Feldhus, Christin Seifert, Hinrich Schütze, Sebastian Möller, Vera Schmitt:

Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation. 323-346 - Seungyoon Lee, Minhyuk Kim, Seongtae Hong, Youngjoon Jang, Dongsuk Oh, Heuiseok Lim:

CLEAR: Cross-Lingual Enhancement in Retrieval via Reverse-training. 347-362 - Chenming Tang, Yutong Yang, Kexue Wang, Yunfang Wu:

Aligning Language Models with Real-time Knowledge Editing. 363-378 - Jiho Choi, Seojeong Park, Seongjong Song, Hyunjung Shim:

PosterForest: Hierarchical Multi-Agent Collaboration for Scientific Poster Generation. 379-401 - Lukas Helff, Ahmad Omar, Felix Friedrich, Antonia Wüst, Hikaru Shindo, Rupert Mitchell, Tim Woydt, Patrick Schramowski, Wolfgang Stammer, Kristian Kersting:

SLR: Automated Synthesis for Scalable Logical Reasoning. 402-426 - Niclas Doll, Jasper Schulze Buschhoff, Shalaka Satheesh, Hammam Abdelwahab, Héctor Allende-Cid, Katrin Klug:

Can Continual Pretraining Bridge the Performance Gap between General-purpose and Specialized Language Models in the Medical Domain? 427-444 - Jie He, Nan Hu, Wanqiu Long, Jiaoyan Chen, Jeff Z. Pan:

MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Long-tail Knowledge. 445-479 - Adam Storek, Mukur Gupta, Samira Hajizadeh, Prashast Srivastava, Suman Jana:

Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Understanding. 480-498 - Pierluigi Cassotti, Naomi Baes, Stefano De Pascale, Jáder Martins Camboim de Sá, Francesco Periti, Nick Haslam, Dirk Geeraerts, Nina Tahmasebi:

SenseRel: A Sense-Level Benchmark for Denotational and Connotational Meaning Relations. 499-515 - Yifei He, Pranit Chawla, Yaser Souri, Subhojit Som, Xia Song:

WebSTAR: Scalable Data Synthesis for Computer Use Agents with Step-Level Filtering. 516-533 - Behrooz Azarkhalili, Linyi Li, Maxwell W. Libbrecht:

PR-XAI: PageRank-Based Feature Attribution for Transformers. 534-554 - Shengli Zhou, Xiangchen Wang, Guanhua Chen, Feng Zheng:

CAPruner: Conceptual-Adjacent Scene Graph Pruner for Enhancing 3D Spatial Reasoning of Large Language Models. 555-567 - Angelo Ortiz Tandazo, Manel Khentout, Youssef Benchekroun, Thomas Hueber, Emmanuel Dupoux:

MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery. 568-585 - Zi-Ao Ma, Xian-Ling Mao, Tian Lan, Chen Xu, Zhijing Wu:

Your Reasoning Model Knows What Counts: Self-Guided Chain-of-Thought Pruning for Efficient Reasoning. 586-605 - Ching-Yun Ko, Payel Das, Sihui Dai, Georgios Kollias, Subhajit Chaudhury, Aurélie C. Lozano, Pin-Yu Chen:

ImReasoner: Improving Memory-based Language Models for Reasoning-in-a-Haystack Tasks. 606-622 - Zidi Xiong, Yuping Lin, Wenya Xie, Pengfei He, Zirui Liu, Jiliang Tang, Himabindu Lakkaraju, Zhen Xiang:

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior. 623-645 - Tianyang Zhou, Ziyi Zhang, Haowen Lin, Somesh Jha, Mihai Christodorescu, Kirill Levchenko, Varun Chandrasekaran:

SACTOR: LLM-Driven Correct and Idiomatic C to Rust Translation with Static Analysis and FFI-Based Verification. 646-673 - Fengbo Ma, Zixin Rao, Xiaoting Li, Zhetao Chen, Hongyue Sun, Yiping Zhao, Xianyan Chen, Zhen Xiang:

IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review. 674-715 - Can Jin, Rui Wu, Tong Che, Qixin Zhang, Hongwu Peng, Jiahui Zhao, Zhenting Wang, Wenqi Wei, Ligong Han, Zhao Zhang, Yuan Cao, Ruixiang Tang, Dimitris N. Metaxas:

Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety. 716-749 - Maoxiao Ye, Xinfeng Ye, Sathiamoorthy Manoharan:

Hybrid Autoregressive-Diffusion Model for Real-Time Sign Language Production. 750-763 - Zhiqing Cui, Binwu Wang, Qingxiang Liu, Yeqiang Wang, Zhengyang Zhou, Yuxuan Liang, Yang Wang:

Augur: Modeling Covariate Causal Associations in Time Series via Large Language Models. 764-787 - Zheyuan Zhang, Kaiwen Shi, Zhengqing Yuan, Zehong Wang, Tianyi Ma, Keerthiram Murugesan, Vincent Galassi, Chuxu Zhang, Yanfang Ye:

AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering. 788-809 - Haonan Chen, Hong Liu, Yuping Luo, Liang Wang, Nan Yang, Furu Wei, Zhicheng Dou:

MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings. 810-823 - Yaxun Dai, Wenxuan Xie, Xialie Zhuang, Tianyu Yang, Ziyi Liu, Haiqin Yang, Yiying Yang, Yuhang Zhao, Pingfu Chao, Wenhao Jiang:

ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL. 824-847 - Tingfeng Hui, Pengyu Zhu, Bowen Ping, Ling Tang, Guanting Dong, Yaqi Zhang, Sen Su:

DecIF: Improving Instruction-Following through Decomposition. 848-867 - Lingfeng Zhang, Xiaoshuai Hao, Yingbo Tang, Haoxiang Fu, Xinyu Zheng, Pengwei Wang, Zhongyuan Wang, Wenbo Ding, Shanghang Zhang:

NavA³: Understanding Any Instruction, Navigating Anywhere, Finding Anything. 868-878 - Gen Li, Peiyu Liu:

FastV-RAG: Towards Fast and Fine-Grained Video QA with Retrieval-Augmented Generation. 879-889 - Ruiyi Yan, Yugo Murawaki:

Efficient Provably Secure Linguistic Steganography via Range Coding. 890-907 - Kai Zou, Ziqi Huang, Yuhao Dong, Shulin Tian, Dian Zheng, Hongbo Liu, Jingwen He, Bin Liu, Yu Qiao, Ziwei Liu:

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark. 908-924 - Sheng Zhang, Junyi Li, Yingyi Zhang, Pengyue Jia, Yichao Wang, Xiaowei Qian, Wenlin Zhang, Maolin Wang, Yong Liu, Xiangyu Zhao:

MemSearch-o1: Empowering Large Language Models with Reasoning-Aligned Memory Growth in Agentic Search. 925-943 - Bolun Sun, Charles Chang, Yuen Yuen Ang, Ruotong Mu, Yuchen Xu, Zhengxin Zhang, Pingxu Hao:

CAPC-CG: A Large-Scale, Expert-Directed LLM-Annotated Corpus of Adaptive Policy Communication in China. 944-966 - Shuhang Chen, Hangjie Yuan, Yunqiu Xu, Pengwei Liu, Tao Feng, Jun Cen, Zeying Huang, Yi Yang:

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems. 967-992 - Ruiyi Yan, Shiao Meng, Yugo Murawaki:

Anchored Sliding Window: Toward Robust and Imperceptible Linguistic Steganography. 993-1012 - Guangzeng Han, Xiaolei Huang:

What Makes Good Instruction-Tuning Data? An In-Context Learning Perspective. 1013-1027 - Xinyu Shi, Kairong Luo, Zhen Zheng, Wenguang Chen:

RoBSA: RoPE-based Blockwise Sparse Multi-head Latent Attention. 1028-1044 - Nuo Chen, Andre Huikai Lin, Jiaying Wu, Junyi Hou, Zining Zhang, Qian Wang, Xidong Wang, Bingsheng He:

XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration. 1045-1074 - Beidan Liu, Zhengqiu Zhu, Chen Gao, Tianle Pu, Yong Zhao, Wei Qi, Quanjun Yin:

Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution. 1075-1094 - Guoxi Zhang, Jiawei Chen, Tianzhuo Yang, Jiaming Ji, Yaodong Yang, Juntao Dai:

A Game-Theoretica Negotiation Framework for Cross-Cultural Consensus. 1095-1134 - Zifeng Cheng, Lingyun Qian, Zhiwei Jiang, Cong Wang, Yafeng Yin, Fei Shen, Ao Zhou, Qing Gu:

Focusing Condition: Inference-Time Self-Contrastive Steering Elicits Better Conditional Text Embeddings in LLMs. 1135-1147 - Ziheng Wang, Zihao Yue, Wenxuan Wang, Qin Jin:

Exploring Attention Attractors in Large Language Models. 1148-1160 - Yulin Ou, Yu Wang, Yang Xu, Hendrik Buschmeier:

Identifying the Periodicity of Information in Natural Language. 1161-1175 - Jing Ye, Lu Xiang, Yaping Zhang, Chengqing Zong:

EmoHarbor: Evaluating Personalized Emotional Support by Simulating the User's Internal World. 1176-1202 - Jianlyu Chen, Junwei Lan, Chaofan Li, Defu Lian, Zheng Liu:

ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval. 1203-1221 - Binxian Su, Haoye Lou, Shucheng Zhu, Weikang Wang, Ying Liu, Dong Yu, Pengyuan Liu:

SPAGBias: Uncovering and Tracing Structured Spatial Gender Bias in Large Language Models. 1222-1251 - Yixiao He, Menghao Zhang, Haifeng Sun, Jing Wang, Kangheng Lin, Jinghan Wang, Chenye Xu, Pengfei Ren, Qi Qi, Jingyu Wang:

VALU: A Benchmark for Video Anomaly Temporal Localization and Understanding at Multiple Semantic Levels. 1252-1296 - Xiaoyang Yi, Jing Chen, Yuru Bao, Jian Zhang:

CoreGaze: Core Subgraph-Driven Visual Gaze Diffusion for Training-Free Referring Multimodal Large Language Models. 1297-1315 - Shakhrul Iman Siam, Tiantian Feng, Jiankun Zhang, Shrikanth Narayanan, Mi Zhang:

RespiraMFM: A Multimodal Foundation Model with Contrastive Audio-Language Alignment for Respiratory Disease Identification. 1316-1330 - Conghui Niu, Ningxin Wu, Ziran Zhao, Dong Yu, Chen Kang, Pengyuan Liu:

Beyond Detection: Evaluating Fallacy Awareness of LLMs in Interactive Scenarios. 1331-1352 - Deniz Bayazit, Aaron Mueller, Antoine Bosselut:

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining. 1353-1377 - Zihan Zhang, Yu Bao, Xiao Ding, Tianyi Jiang, Kai Xiong:

Is EEG-to-Text Feasible in Real-World Scenarios? An In-Depth Analysis Using a Neuropsychology-Inspired Benchmark. 1378-1393 - Yuxuan Jiang, Zehua Chen, Zeqian Ju, Yusheng Dai, Weibei Dou, Jun Zhu:

ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling. 1394-1413 - Sapir Harary, Eran Hirsch, Aviv Slobodkin, David Wan, Mohit Bansal, Ido Dagan:

PrefixNLI: Detecting Factual Inconsistencies as Soon as They Arise. 1414-1433 - Ziyang Zhou, Ziqi Liu, Yan Wang, Yiming Lin, Yangbin Chen:

RAM-SD: Retrieval-Augmented Multi-agent framework for Sarcasm Detection. 1434-1448 - Michelle Chao Chen, Moritz Miller, Bernhard Schölkopf, Siyuan Guo:

On the Emergence and Test-Time Use of Structural Information in Large Language Models. 1449-1465 - Sher Badshah, Ali Emami, Hassan Sajjad:

SAGE: A Search-AuGmented Evaluation of Large Language Models on Free-Form QA. 1466-1491 - Austin Xu, Yilun Zhou, Xuan-Phi Nguyen, Caiming Xiong, Shafiq Joty:

J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization. 1492-1511 - Xianren Zhang, Shreyas Prasad, Di Wang, Qiuhai Zeng, Suhang Wang, Wenbo Yan, Mat Hans:

A Functionality-Grounded Benchmark for Evaluating Web Agents in E-commerce Domains. 1512-1528 - Jiongxiao Wang, Qiaojing Yan, Yawei Wang, Yijun Tian, Soumya Smruti Mishra, Zhichao Xu, Megha Gandhi, Panpan Xu, Lin Lee Cheong:

Reinforcement Learning for Self-Improving Agent with Skill Library. 1529-1550 - Rui Wang, Junda Wu, Yu Xia, Tong Yu, Ruiyi Zhang, Ryan A. Rossi, Subrata Mitra, Lina Yao, Julian J. McAuley:

CachePrune: Teaching LLMs What Not to Follow via KV-Cache Editing. 1551-1570 - Parsa Hejabi, Elnaz Rahmati, Alireza Salkhordeh Ziabari, Morteza Dehghani:

Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs. 1571-1587 - Zedian Shao, Hongbin Liu, Yuepeng Hu, Neil Zhenqiang Gong:

Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection. 1588-1604 - Ningyuan Yang, Kaizhu Huang:

Logic Matters in Lightweight Hallucination Classification for RAG System. 1605-1617 - Shuyao Xu, Cheng Peng, Jiangxuan Long, Weidi Xu, Wei Chu, Yuan Qi:

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning. 1618-1639 - Mayank Singh, Vikas Yadav, Shiva Krishna Reddy Malay, Shravan Nayak, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Eduardo Blanco:

Grammar Search for Multi-Agent Systems. 1640-1655 - David H. Yang, Yuxuan Zhu, Mohammad Mohammadi Amiri, Keerthiram Murugesan, Tejaswini Pedapati, Subhajit Chaudhury, Pin-Yu Chen:

ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval. 1656-1669 - Suhaib Abdurahman, Etsuko Ishii, Katerina Margatina, Divya Bhargavi, Monica Sunkara, Yi Zhang:

Explicit Trait Inference for Multi-Agent Coordination. 1670-1704 - Yuxin Xiao, Shujian Zhang, Marzyeh Ghassemi, Wenxuan Zhou:

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe. 1705-1718 - Amin Banayeeanzade, Ala N. Tak, Fatemeh Bahrani, Anahita Bolourani, Leonardo Blas, Emilio Ferrara, Jonathan Gratch, Sai Praneeth Karimireddy:

Psychological Steering in LLMs: An Evaluation of Effectiveness and Trustworthiness. 1719-1771 - Siqi Ouyang, Shuoyang Ding, Oleksii Hrinchuk, Vitaly Lavrukhin, Brian Yan, Boris Ginsburg, Lei Li:

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech. 1772-1787 - Songtao Jiang, Yuan Wang, Ruizhe Chen, Yan Zhang, Ruilin Luo, Bohan Lei, Yeying Jin, Sibo Song, Zhibo Yang, Jimeng Sun, Jian Wu, Zuozhu Liu:

Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering. 1788-1805 - Tomer Ashuach, Dana Arad, Aaron Mueller, Martin Tutek, Yonatan Belinkov:

CRISP: Persistent Concept Unlearning via Sparse Autoencoders. 1806-1825 - Penghui Yang, Cunxiao Du, Fengzhuo Zhang, Haonan Wang, Tianyu Pang, Chao Du, Bo An:

LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification. 1826-1844 - Dota Tianai Dong, Yifan Luo, Po-Ya Angela Wang, Asli Özyürek, Paula Rubio-Fernández:

Using Perspectival Words Is Harder Than Vocabulary Words for Humans - and Even More So for Multimodal Language Models. 1845-1870 - Noy Sternlicht, Tom Hope:

CHIMERA: A Knowledge Base of Scientific Idea Recombinations for Research Analysis and Ideation. 1871-1905 - Houxing Ren, Mingjie Zhan, Zimu Lu, Ke Wang, Yunqiao Yang, Haotian Hou, Hongsheng Li:

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning. 1906-1933 - Jianwen Luo, Yiming Huang, Jinxiang Meng, Fangyu Lei, Shizhu He, Xiao Liu, Shanshan Jiang, Bin Dong, Jun Zhao, Kang Liu:

GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks. 1934-1961 - Huazheng Wang, Yongcheng Jing, Haifeng Sun, Yingjie Wang, Jingyu Wang, Jianxin Liao, Dacheng Tao:

Erasing Without Remembering: Implicit Knowledge Forgetting in Large Language Models. 1962-1994 - Xingyu Zhu, Junfeng Fang, Shuo Wang, Beier Zhu, Zhicai Wang, Yonghui Yang, Xiangnan He:

Mitigating Hallucinations in Large Vision-Language Models without Performance Degradation. 1995-2009 - Yuxiang Huang, Mingye Li, Xu Han, Chaojun Xiao, Weilin Zhao, Ao Sun, Ziqi Yuan, Hao Zhou, Fandong Meng, Zhiyuan Liu:

APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention. 2010-2025 - Sultan Alrowili, Younes Samih, Abed Alhakim Freihat, Mathan Kumar Eswaran:

AraVQA: Building a New Arabic Factoid Visual Question Answering Dataset from Wikipedia. 2026-2042 - Andrew Halterman, Katherine A. Keith:

What is a protest anyway? Codebook conceptualization is still a first-order concern in LLM-era classification. 2043-2059 - Chaewan Chun, Delvin Ce Zhang, Dongwon Lee:

When Misinformation Speaks and Converses: Rethinking Fact-Checking in Audio Platforms. 2060-2075 - Jerry Huang, Siddarth Madala, Cheng Niu, Julia Hockenmaier, Tong Zhang:

Contextual Relevance and Adaptive Sampling for LLM-Based Document Reranking. 2076-2089 - Jinseok Chung, Minkyoung Song, Hyunji Jung, Namhoon Lee:

Quantifying Aleatoric Uncertainty of In-Context Learning for Robust Measure of LLM Prediction Confidence. 2090-2108 - Yanxiao Zhao, Yaqian Li, Zihao Bo, Rinyoichi Takezoe, Haojia Hui, Mo Guang, Lei Ren, Xiaolin Qin, Kaiwen Long:

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs. 2109-2131 - Xi Xiao, Chenrui Ma, Yunbei Zhang, Chen Liu, Zhuxuanzi Wang, Yanshu Li, Lin Zhao, Guosheng Hu, Tianyang Wang, Hao Xu:

Not All Directions Matter: Towards Structured and Task-Aware Low-Rank Model Adaptation. 2132-2154 - Peixuan Zhang, Zijian Jia, Ziqi Cai, Shuchen Weng, Si Li, Boxin Shi:

ReContraster: Making Your Posters Stand Out with Regional Contrast. 2155-2171 - Xiangchen Song, Aashiq Muhamed, Yujia Zheng, Lingjing Kong, Zeyu Tang, Mona T. Diab, Virginia Smith, Kun Zhang:

Mechanistic Interpretability Should Prioritize Feature Consistency in Sparse Autoencoders. 2172-2210 - Jiawei Liu, Qisi Chen, Jianshu Zhang, Quan Liu, Defu Lian:

EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning. 2211-2226 - Xinjie Chen, Minpeng Liao, Guoxin Chen, Chengxi Li, Biao Fu, Kai Fan, Xinggao Liu:

From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization. 2227-2242 - Moshe Kimhi, Nimrod Shabtay, Raja Giryes, Chaim Baskin, Eli Schwartz:

CARES: Context-Aware Resolution Selector for VLMs. 2243-2256 - Simon Lupart, Mohammad Aliannejadi, Evangelos Kanoulas:

ChatR1: Reinforcement Learning for Conversational Reasoning and Retrieval Augmented Question Answering. 2257-2274 - Zhichen Liu, Yongyuan Li, Yang Xu:

Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities. 2275-2288 - Mehul Agarwal, Aditya Aggarwal, Arnav Goel, Medha Hira, Anubha Gupta:

MORPHOGEN: A Multilingual Benchmark for Evaluating Gender-Aware Morphological Generation. 2289-2313 - Yuxuan Gu, Wuyang Zhou, Giorgos Iacovides, Danilo P. Mandic:

TeRA: Vector-based Random Tensor Network for High-Rank Adaptation of Large Language Models. 2314-2329 - Xi Chen, Chuan Qin, Jinpeng Li, Shasha Hu, Chao Wang, Hengshu Zhu, Hui Xiong:

GenDis: Generative-Discriminative Dual-View Co-Training for Generalized Category Discovery. 2330-2351 - Jingwei Shi, Xinxiang Yin, Jing Huang, Shengyu Tao, Jinman Zhao:

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions. 2352-2382 - Kevin Stowe, Svetlana Afanaseva, Rodolfo Raimundo, Yitao Sun, Kailash Patil:

Identifying Bias in Machine-generated Text Detection. 2383-2395 - Yage Zhang, Yukun Jiang, Michael Backes, Yang Zhang:

DE-CLIP: Few-Shot Anomaly Detection via Difference-Guided Embedding Editing. 2396-2407 - Tianyi Hu, Andrea Morales-Garzón, Jingyi Zheng, Maria Maistro, Daniel Hershcovich:

Culinary Crossroads: A RAG Framework for Enhancing Diversity in Cross-Cultural Recipe Adaptation. 2408-2423 - Yuhang Zhou, Mingrui Zhang, Ke Li, Mingyi Wang, Qiao Liu, Qifei Wang, Jiayi Liu, Fei Liu, Serena Li, Weiwei Li, Mingze Gao, Abhishek Kumar, Xiangjun Fan, Zhuokai Zhao, Lizhu Zhang:

Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding. 2424-2439 - Biao Wu, Yutong Xie, Zeyu Zhang, Vu Minh Hieu Phan, Qi Chen, Ling Chen, Qi Wu:

MMCLIP: Cross-Modal Attention Masked Modelling for Medical Language-Image Pre-Training. 2440-2455 - Jinming Wu, Zihao Deng, Wei Li, Yiding Liu, Bo You, Bo Li, Zejun Ma, Ziwei Liu:

MMSearch-R1: Incentivizing LMMs to Search. 2456-2487 - Yanrui Du, Fenglei Fan, Sendong Zhao, Jiawei Cao, Ming Ma, Danyang Zhao, Shuren Qi, Ting Liu, Bing Qin:

Toward Secure Tuning: Mitigating Security Risks from Instruction Fine-Tuning. 2488-2506 - Wenxuan Xie, Yaxun Dai, Wenhao Jiang:

SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes. 2507-2525 - Eylon Caplan, Tania Chakraborty, Dan Goldwasser:

Splits! Flexible Sociocultural Linguistic Investigation at Scale. 2526-2550 - Lujain Ibrahim, Myra Cheng:

Thinking beyond the anthropomorphic paradigm benefits LLM research. 2551-2563 - Qisheng Hu, Quanyu Long, Wenya Wang:

Coordinating Search-Informed Reasoning and Reasoning-Guided Search in Claim Verification. 2564-2585 - Xiao Pu, Zepeng Cheng, Lin Yuan, Yu Wu, Xiuli Bi:

Breaking the Generator Barrier: Disentangled Representation for Generalizable AI-Text Detection. 2586-2598 - Chen Xu, Yu Ji, Zhenyu Lv, Yang Yi, Yizhe Yang, Luyao Ji, Chaoyi Chen, Xianyang Wang, Tian Lan, Zhihua Wang, Juan Wang, Xunde Dong, Fuze Tian, Qunxi Dong, Bin Hu:

PUPPET: Neural-Symbolic Standardized Patients for Mental Health. 2599-2634 - Jingyuan Wang, Yankai Chen, Zhonghang Li, Chao Huang:

LightReasoner: Can Small Language Models Teach Large Language Models Reasoning? 2635-2663 - Zhanyu Liu, Shiyao Wang, Xingmei Wang, Rongzhou Zhang, Jiaxin Deng, Honghui Bao, Jinghao Zhang, Wuchao Li, Penggei Zheng, Xiangyu Wu, Yifei Hu, Qigen Hu, Xinchen Luo, Lejian Ren, Zixing Zhang, Qianqian Wang, Kuo Cai, Yunfan Wu, Hongtao Cheng, Zexuan Cheng, Lu Ren, Huanjie Wang, Yi Su, Ruiming Tang, Kun Gai, Guorui Zhou:

OneRec-Think: In-Text Reasoning for Generative Recommendation. 2664-2681 - Xiaojian Li, Rongwu Xu, Tianyun Zhang, Yue Wang, Shuo Chen, Qiner Lyu, Briana Zhang, Peiran Yang, Kyle Xue Chen, Haoyuan Shi, Yu Wang, Wei Xu:

AwarenessBench: Assessing Cognitive Capabilities of Language Models. 2682-2741 - Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang:

Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning. 2742-2759 - Zongqi Wang, Tianle Gu, Chen Gong, Xin Tian, Siqi Bao, Yujiu Yang:

SCAN: Structured Capability Assessment and Navigation for LLMs. 2760-2799 - Hamin Koo, Jaehyung Kim:

EMCEE: Improving Multilingual Capability of LLMs via Bridging Knowledge and Reasoning with Extracted Synthetic Multilingual Context. 2800-2822 - Sirui Xia, Aili Chen, Xintao Wang, Tinghui Zhu, Yikai Zhang, Jiangjie Chen, Yanghua Xiao:

Can LLMs Learn to Map the World from Local Descriptions? 2823-2845 - Yanhao Li, Lu Ma, Jiaran Zhang, Lexiang Tang, Wentao Zhang, Guibo Luo:

LEASH: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model. 2846-2856 - Chunhua Liu, Kabir Manandhar Shrestha, Sukai Huang:

ALIGN: Word Association Learning for Cultural Alignment in Large Language Models. 2857-2879 - Enzhi Wang, Jiaming Zhou, Yuhang Jia, Aobo Kong, Qicheng Li, Yong Qin:

RealTalk-CN: A Realistic Chinese Speech Task-Oriented Dialogue Benchmark with Cross-Modal Analysis. 2880-2897 - Tu-Phuong Mai, Minh-Ha H. Le, Duc-Luong Tran, Phuong-Anh Chu, Duy-Cat Can, Hoang-Quynh Le:

HOPE: Hybrid Optimized Parallel Encoding with Supervised and Unsupervised Semantic Fusion for Depression Symptom Detection. 2898-2911 - Kangwen Zhao, Jianfeng Cai, Jinhua Zhu, Ruopei Sun, Dongyun Xue, Wengang Zhou, Li Li, Houqiang Li:

Bias Fitting to Mitigate Length Bias of Reward Model in RLHF. 2912-2927 - Xinyu Tang, Yuliang Zhan, Zhixun Li, Xin Zhao, Zhenduo Zhang, Zujie Wen, Zhiqiang Zhang, Jun Zhou:

Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards. 2928-2954 - Jinyang Wu, Mingkuan Feng, Shuai Zhang, Feihu Che, Zhengqi Wen, Chonghua Liao, Ling Yang, Haoran Luo, Zheng Lian, Jianhua Tao:

Beyond Examples: Towards Automated Thought-level In-Context Reasoning for Large Language Models. 2955-2995 - Mingkuan Feng, Jinyang Wu, Siyuan Liu, Shuai Zhang, Hongjian Fang, Ruihan Jin, Feihu Che, Pengpeng Shao, Zhengqi Wen, Jianhua Tao:

Two-Stage Regularization-Based Structured Pruning for LLMs. 2996-3012 - Donald Shenaj, Ondrej Bohdal, Taha Ceritli, Mete Ozay, Pietro Zanuttigh, Umberto Michieli:

K-Merge: Online Continual Merging of Adapters for On-device Large Language Models. 3013-3029 - Wenxi Chen, Ruiqi Yan, Yushen Chen, Zhikang Niu, Ziyang Ma, Xiquan Li, Yuzhe Liang, Wenhan Lin, Shunshun Yin, Ming Tao, Xinsheng Wang, Xie Chen:

SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization. 3030-3048 - Zhongyuan Peng, Yifan Yao, Kaijing Ma, Shuyue Guo, Yizhe Li, Yichi Zhang, Chenchen Zhang, Yifan Zhang, Zhouliang Yu, Luming Li, Minghao Liu, Yihang Xia, Jiawei Shen, Yuchen Wu, Yixin Cao, Zhaoxiang Zhang, Wenhao Huang, Jiaheng Liu, Ge Zhang:

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization. 3049-3088 - Zhihao Gong, Zeyu Sun, Dong Huang, Qingyuan Liang, Jie M. Zhang, Dan Hao:

TRACE: Evaluating Execution Efficiency of LLM-Based Code Translation. 3089-3117 - Diana Abagyan, Alejandro Salamanca, Andrés Felipe Cruz-Salinas, Kris Cao, Hangyu Lin, Acyr Locatelli, Marzieh Fadaee, Ahmet Üstün, Sara Hooker:

One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers. 3118-3136 - Kevin Du, Clara Kümpel, Michelle Wastl, Alex Warstadt:

It's Not What You Say, It's How You Say It: Evaluating LLM Responses to Expressions of Belief. 3137-3151 - Junyi Li, Yongqiang Chen, Ningning Ding:

CiPO: Counterfactual Unlearning for Large Reasoning Models through Iterative Preference Optimization. 3152-3170 - Jason S. Lucas, Matt Murtagh-White, Ali Al-Lawati, Uchendu Uchendu, Adaku Uchendu, Dongwon Lee:

DIA-HARM: Dialectal Disparities in Harmful Content Detection Across 50 English Dialects. 3171-3214 - Dahyun Jung, Jaewook Lee, Heuiseok Lim:

Towards Scalable Lifelong Knowledge Editing with Selective Knowledge Suppression. 3215-3231 - Zhengxiang Cheng, Dongping Chen, Mingyang Fu, Tianyi Zhou:

Optimizing Length Compression in Large Reasoning Models. 3232-3250 - Yupeng Hou, Jiacheng Li, Xiangjun Fu, Zhankui He, An Yan, Xiusi Chen, Julian J. McAuley:

Bridging Language and Items for Retrieval and Recommendation: Benchmarking LLMs as Semantic Encoders. 3251-3265 - Zhenhua Liu, Lijun Li, Ruizhe Chen, Yuxian Jiang, Tong Zhu, Zhaochen Su, Wenliang Chen, Jing Shao:

Evolutionary Guided Decoding: Iterative Value Refinement for LLMs. 3266-3283 - Adarsh Singh, Kushal Raj Bhandari, Jianxi Gao, Soham Dan, Vivek Gupta:

CRAFT: Training-Free Cascaded Retrieval for Tabular QA. 3284-3298 - Jinu Lee, Kyoung-Woon On, Sophia Simeng Han, Arman Cohan, Julia Hockenmaier:

Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics. 3299-3322 - Nikita Tatarinov, Vidhyakshaya Kannan, Haricharana Srinivasa, Arnav Raj, Harpreet Singh Anand, Varun Singh, Aditya Luthra, Ravij Lade, Agam Shah, Sudheer Chava:

KG-MuLQA: A Framework for KG-based Multi-Level QA Extraction and Long-Context LLM Evaluation. 3323-3359 - Guangya Wan, Mingyang Ling, Xiaoqi Ren, Rujun Han, Sheng Li, Zizhao Zhang:

COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context. 3360-3380 - Abdelrahman Abdallah, Mohammed Ali, Bhawna Piryani, Adam Jatowt:

BracketRank: Large Language Model Document Ranking via Reasoning-based Competitive Elimination. 3381-3397 - Zirui Yan, Dennis Wei, Dmitriy A. Katz, Prasanna Sattigeri, Ali Tajer:

Multi-component Causal Tracing in Large Language Models. 3398-3418 - Ruixuan Deng, Xiaoyang Hu, Miles Gilberti, Shane Storks, Aman Taxali, Mike Angstadt, Chandra Sekhar Sripada, Joyce Chai:

Sparse Feature Coactivation Reveals Causal Semantic Modules in Large Language Models. 3419-3451 - Ido Andrew Atad, Itamar Zimerman, Shahar Katz, Lior Wolf:

TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors. 3452-3468 - Mingtian Tan, Mike A. Merrill, Zachary Gottesman, Tim Althoff, David Evans, Thomas Hartvigsen:

Inferring Events from Time Series using Language Models. 3469-3490 - Minjae Lee, Minhyuk Seo, Tingyu Qu, Tinne Tuytelaars, Jonghyun Choi:

OASIS: Online Sample Selection for Continual Instruction Tuning. 3491-3515 - Michael Lan, Narmeen Fatimah Oozeer, Chaithanya Bandi, Philip Quirke, Austin Meek, Fazl Barez, Amir Abdullah:

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing. 3516-3540 - Boyang Xue, Bin Wu, Shuofei Qiao, Sheng Wang, Rui Wang, Yiming Du, Hongru Wang, Jeff Z. Pan, Emine Yilmaz, Kam-Fai Wong, Aldo Lipani:

Mitigating Context Interference for Reliable and Efficient Search Agents. 3541-3558 - Shaohua Duan, Pengcheng Huang, Xinze Li, Zhenghao Liu, Xiaoyuan Yi, Yukun Yan, Shuo Wang, Yu Gu, Ge Yu, Maosong Sun:

Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization. 3559-3575 - Ghadir Alselwi, Hao Xue, Shoaib Jameel, Basem Suleiman, Flora D. Salim, Imran Razzak:

Long Context Modeling with Ranked Memory-Augmented Retrieval. 3576-3590 - Congmin Zheng, Jiachen Zhu, Zhuoying Ou, Yuxiang Chen, Kangning Zhang, Rong Shan, Zeyu Zheng, Mengyue Yang, Jianghao Lin, Yong Yu, Weinan Zhang:

A Comprehensive Survey of Process Reward Models: Data Generation, Model Construction, and Usage. 3591-3607 - Xue Jiang, Yihong Dong, Mengyang Liu, Hongyi Deng, Tian Wang, Yongding Tao, Zhi Jin, Wenpin Jiao, Ge Li:

CODERL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment. 3608-3622 - Yihong Dong, Zhaoyu Ma, Xue Jiang, Zhiyuan Fan, Jiaru Qian, Yongmin Li, Jianha Xiao, Zhi Jin, Ge Li:

Saber: Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model in Code Generation. 3623-3642 - Hao Li, Jiayang Gu, Jingkuan Song, An Zhang, Lianli Gao:

Debiased Orthogonal Boundary-Driven Efficient Noise Mitigation. 3643-3662 - Yi Jiang, Sendong Zhao, Jianbo Li, Haochun Wang, Lizhe Zhang, Yan Liu, Bing Qin:

Collaborative Chain-of-Agents for Parametric-Retrieved Knowledge Synergy. 3663-3680 - Pengyun Zhu, Qiheng Sun, Long Wen, Yanbo Wang, Yang Cao, Junxu Liu, Deyi Xiong, Jinfei Liu, Zhibo Wang, Kui Ren:

APPSI-139: A Parallel Corpus of English Application Privacy Policy Summarization and Interpretation. 3681-3706 - Da Li, Yuxiao Luo, Keping Bi, Jiafeng Guo, Wei Yuan, Biao Yang, Yan Wang, Fan Yang, Tingting Gao, Guorui Zhou:

Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding. 3707-3718 - Minh Chu Xuan, Tien-Phat Nguyen, Linh Ngo Van, Dinh Viet Sang, Nguyen Thi Ngoc Diep, Trung Le:

LLM-XTM: Enhancing Cross-Lingual Topic Models with Large Language Models. 3719-3737 - Min-Jae Kim, Jun-Yeong Moon, Mujeen Sung, Gyeong-Moon Park:

Open Your Model's Eyes: Video and Context-Aware Multimodal Backchannel Prediction. 3738-3755 - Joonhyung Park, Jaeyun Song, Sihwan Park, Eunho Yang:

Bringing Real-World Relations into Video Generation with Graph-Structured Knowledge. 3756-3771 - Yoonhyung Lee, Hyunsin Park, Jinhwan Park, Jinkyu Lee:

FC-TTS: Style and Timbre Control in Zero-Shot Text-to-Speech with Disentangled Speech Representations. 3772-3791 - Shuo Yang, Zheyu Zhang, Bardh Prenkaj, Gjergji Kasneci:

SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation. 3792-3807 - Payel Santra, Lavisha Sharma, Madhusudan Ghosh, Partha Basuchowdhuri:

Mask-to-Correct⁺: Leveraging Retriever Diversity for Masking-guided Faithful Fact Correction. 3808-3825 - Jusen Du, Jiaxi Hu, Zhang Tao, Weigao Sun, Yu Cheng:

Native Hybrid Attention for Efficient Sequence Modeling. 3826-3842 - Woongyeong Yeo, Kangsan Kim, Soyeong Jeong, Jinheon Baek, Sung Ju Hwang:

UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities. 3843-3871 - Yi Su, Dian Yu, Linfeng Song, Juntao Li, Haitao Mi, Zhaopeng Tu, Min Zhang, Dong Yu:

Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains. 3872-3892 - Song-Li Wu, Zhaocheng Du, Weinan Gan, Jingyi Wang, Xianquan Wang:

From ID to LLM: Rethinking Representation Learning for Recommendation. 3893-3905 - Tianle Liu, Zhiliang Tian, Zhen Huang, Tianlun Liu, Jingyuan Huang, Zhaoning Zhang, Chengcheng Shao, Dongsheng Li:

DMHM: Density-aware Manifold Learning and Hybrid Mahalanobis Energy for LLMs-generated Text Detection. 3906-3929 - Li Zheng, Yanyi Luo, Hao Fei, Yuzhe Ding, Yujie Huang, Fei Li, Chong Teng, Donghong Ji:

Dynamic Emotion and Personality Profiling for Multimodal Deception Detection. 3930-3940 - Jingsheng Zheng, Jintian Zhang, Yujie Luo, Yuren Mao, Yunjun Gao, Lun Du, Huajun Chen, Ningyu Zhang:

Can We Predict Before Executing Machine Learning Agents? 3941-3974 - Md Mokarram Chowdhury, Daniel Agyei Asante, Ernie Chang, Yang Li:

IMPACT: Importance-Aware Activation Space Reconstruction. 3975-3992 - Yupeng Chang, Yuan Wu, Yi Chang:

SOS-LoRA: Static Orthogonal-Subspace Low-Rank Adaptation with Fixed Multi-Scale Scaling. 3993-4005 - Jingyu Lu, Yuhan Wang, Fan Zhuo, Xize Cheng, Changhao Pan, Xueyi Pu, Yifu Chen, Chenyuhao Wen, Tianle Liang, Zhou Zhao:

SDiaReward: Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness. 4006-4028 - Satyam Kumar, Kaustubh Shivshankar Shejole, Pushpak Bhattacharyya:

Looking at Radiology Report Generation through a Causal Lens: A Survey. 4029-4057 - Tianlun Liu, Zhiliang Tian, Zhen Huang, Xingzhi Zhou, Wanlong Yu, Tianle Liu, Feng Liu, Dongsheng Li:

CTTA-T: Continual Test-Time Adaptation for Text Understanding via Teacher-Student with a Domain-aware and Generalized Teacher. 4058-4078 - Hongyuan Lu, Zixuan Li, Zefan Zhang, Bowen Cao, Wai Lam:

Adam's Law: Textual Frequency Law on Large Language Models. 4079-4105 - Dongqi Liu, Hang Ding, Qiming Feng, Xurong Xie, Zhucun Xue, Chengjie Wang, Jian Li, Jiangning Zhang, Yabiao Wang:

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation. 4106-4136 - Zhihao Zhan, Yuhao Chen, Jiaying Zhou, Qinhan Lyu, Hao Liu, Keze Wang, Liang Lin, Guangrun Wang:

Stable Language Guidance for Vision-Language-Action Models. 4137-4159 - Yongqi Li, Hao Lang, Tieyun Qian, Yongbin Li:

Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions. 4160-4180 - Zhengyi Zhao, Shubo Zhang, Zezhong Wang, Yuxi Zhang, Huimin Wang, Yutian Zhao, Yefeng Zheng, Binyang Li, Kam-Fai Wong, Xian Wu:

Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation. 4181-4205 - Li Zheng, Xin Zhang, Shuyi He, Fei Li, Chong Teng, Jiang-Ming Yang, Donghong Ji, Zhuang Li:

Are Emotion and Rhetoric Neurons in LLM? Neuron Recognition and Adaptive Masking for Emotion-Rhetoric Prediction Steering. 4206-4216 - Yifan Yang, Bing Han, Hui Wang, Wei Wang, Ziyang Ma, Long Zhou, Zengrui Jin, Guanrou Yang, Tianrui Wang, Xu Tan, Xie Chen:

Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training. 4217-4235 - Chenfei Liao, Wensong Wang, Zichen Wen, Xu Zheng, Yiyu Wang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Xin Zou, Yuqian Fu, Bin Ren, Linfeng Zhang, Xuming Hu:

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods. 4236-4253 - Bo Li, Mingda Wang, Gexiang Fang, Shikun Zhang, Wei Ye:

Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning. 4254-4274 - Shei Pern Chua, Zhen Leng Thai, Kai Jun Teh, Xiao Li, Qibing Ren, Xiaolin Hu:

Between a Rock and a Hard Place: The Tension Between Ethical Reasoning and Safety Alignment in LLMs. 4275-4310 - Zhengyi Zhao, Shubo Zhang, Yiming Du, Bin Liang, Baojun Wang, Zhongyang Li, Binyang Li, Kam-Fai Wong:

EventWeave: A Dynamic Framework for Capturing Core and Supporting Events in Dialogue Systems. 4311-4339 - Zhenyun Yin, Shujie Wang, Xuhong Wang, Xingjun Ma, Yingchun Wang:

Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with Constraints. 4340-4354 - Zhiyuan Yu, Shijian Xiao, Cam-Tu Nguyen, Zhangyue Yin, Lekai Xing, Wenzhong Li, Sanglu Lu:

Thermometer of Thoughts: Enhancing LLM's Exploration via Attention Temperature Modulation. 4355-4368 - Sizhe Wang, Zhengren Wang, Dongsheng Ma, Yongan Yu, Rui Ling, Zhiyu Li, Feiyu Xiong, Wentao Zhang:

CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation. 4369-4402 - Chuyi Kong, Wei Gao, Jing Ma, Hongzhan Lin, Yuxi Sun:

REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control. 4403-4431 - Haoming Xu, Ningyuan Zhao, Yunzhi Yao, Weihong Xu, Hongru Wang, Xinle Deng, Shumin Deng, Jeff Z. Pan, Huajun Chen, Ningyu Zhang:

Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency. 4432-4457 - Sensen Gao, Shanshan Zhao, Xu Jiang, Lunhao Duan, Yong Xien Chng, Qing-Guo Chen, Weihua Luo, Kaifu Zhang, Jia-Wang Bian, Mingming Gong:

Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding. 4458-4489 - Yanzhi Tian, Cunxiang Wang, Zeming Liu, Heyan Huang, Wenbo Yu, Dawei Song, Jie Tang, Yuhang Guo:

Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation. 4490-4524 - Yongxin Guo, Wenbo Deng, Zhenglin Cheng, Xiaoying Tang:

G²RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance. 4525-4539 - Chen Zhang, Jiuheng Lin, Zhiyuan Liao, Yansong Feng:

Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion. 4540-4557 - Yukun Jiang, Xinyue Shen, Michael Backes, Zheng Li, Yang Zhang:

Open Schrödinger's Closed Box: Identifying Retrieval Augmented Generation in API-Accessible Large Language Model Services. 4558-4580 - Yuhao Zhang, Liang Yan, Shaoming Duan, Xinyu Zha, Jinhang Su, Peiyi Han, Chuanyi Liu:

AFT-Tab: Adversarial Fine-Tuning for Tabular Data Synthesis with Long Text Columns. 4581-4594 - Yang Liu, Hongming Li, Melissa Xiaohui Qin, Chao Huang, Qiankun Liu:

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models. 4595-4618 - Yichuan Ma, Linyang Li, Yongkang Chen, Peiji Li, Xiaozhe Li, Qipeng Guo, Dahua Lin, Kai Chen:

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic. 4619-4636 - Xiaoyu Liu, Yun Zhang, Wei Li, Simiao Li, Xudong Huang, Hanting Chen, Yehui Tang, Jie Hu, Zhiwei Xiong, Yunhe Wang:

Multi-Granularity Semantic Revision for Large Language Model Distillation. 4637-4658 - Zhijie Tan, Xu Chu, Guanyu Wang, Ziyu Li, Weiping Li, Tong Mo:

RADO: Reasoning Audit-Driven Optimization for Rigorous Reasoning in High-Stakes Domains. 4659-4683 - Bo Li, Mingda Wang, Shikun Zhang, Wei Ye:

Instruction Data Selection via Answer Divergence. 4684-4702 - Dianyun Wang, Qingsen Ma, Yuhu Shang, Zhifeng Lu, Zhenbo Xu, Lechen Ning, Huijia Wu, Zhaofeng He:

Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation. 4703-4721 - Kangcheng Luo, Tinglang Wu, Yansong Feng:

D²Plan: Dual-Agent Dynamic Global Planning for Complex Retrieval-Augmented Reasoning. 4722-4754 - Qingyu Ren, Qianyu He, Powei Chang, Jie Zeng, Zeye Sun, Fei Yu, Jiaqing Liang, Yanghua Xiao:

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following. 4755-4776 - Zehua Pei, Hui-Ling Zhen, Lancheng Zou, Xianzhi Yu, Wulong Liu, Sinno Jialin Pan, Mingxuan Yuan, Bei Yu:

Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis. 4777-4789 - Xu Chu, Guanyu Wang, Zhijie Tan, Xinrong Chen, Ziyu Li, Tong Mo, Weiping Li:

Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization. 4790-4805 - Junhao Liu, Haonan Yu, Zhenyu Yan, Xin Zhang:

Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models. 4806-4844 - Yakun Zhu, Yutong Huang, Shengqian Qin, Zhongzhen Huang, Shaoting Zhang, Xiaofan Zhang:

MedMCP-Calc: Benchmarking LLMs for Realistic Medical Calculator Scenarios via MCP Integration. 4845-4873 - Tarek Mahmoud, Veronika Solopova, Premtim Sahitaj, Ariana Sahitaj, Max Upravitelev, Mervat Abassy, Hana Fatima Shaikh, Neda Foroutan, Vera Schmitt, Preslav Nakov:

Uncovering Temporal Framing in the News. 4874-4902

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














