


default search action
IJCNLP 2025: Mumbai, India - Findings
- Kentaro Inui, Sakriani Sakti, Haofen Wang, Derek F. Wong, Pushpak Bhattacharyya, Biplab Banerjee, Asif Ekbal, Tanmoy Chakraborty, Dhirendra Pratap Singh:

Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, IJCNLP-AACL 2025, Mumbai, India, December 20-24, 2025. The Asian Federation of Natural Language Processing and The Association for Computational Linguistics 2025, ISBN 979-8-89176-303-6 - Xin Guan, PeiHsin Lin, Zekun Wu, Ze Wang, Ruibo Zhang, Emre Kazim, Adriano S. Koshiyama:

MPF: Aligning and Debiasing Language Models post Deployment via Multi-Perspective Fusion. 1-27 - Philipp Seeberger, Steffen Freisinger, Tobias Bocklet, Korbinian Riedhammer:

Generalizing to Unseen Disaster Events: A Causal View. 28-37 - Yifeng Peng, Zhizheng Wu, Chen Chen:

Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs. 38-55 - Qingcheng Zeng, Guanhong Liu, Zhaoqian Xue, Diego Ford, Rob Voigt, Loni Hagen, Lingyao Li:

Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt. 56-68 - Anuj Attri, Arnav Attri, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Nikesh Garera, Pushpak Bhattacharyya:

LLMs as Architects and Critics for Multi-Source Opinion Summarization. 69-101 - Po-Chun Chen, Hen-Hsen Huang, Hsin-Hsi Chen:

Diverge to Induce Prompting: Multi-Rationale Induction for Zero-Shot Reasoning. 102-115 - Donghyeon Ko, Sohee Yang, Donghyun Kwak, Sang-Woo Lee:

Building Helpful-Only Large Language Models: A Complete Approach from Motivation to Evaluation. 116-131 - Abhinav Lalwani, Tasha Kim, Lovish Chopra, Christopher Hahn, Zhijing Jin, Mrinmaya Sachan:

Autoformalizing Natural Language to First-Order Logic: A Case Study in Logical Fallacy Detection. 132-147 - Caiqi Zhang, Ruihan Yang, Zhisong Zhang, Xinting Huang, Sen Yang, Dong Yu, Nigel Collier:

Atomic Calibration of LLMs in Long-Form Generations. 148-169 - Siyi Guo, Myrl G. Marmarelis, Fred Morstatter, Kristina Lerman:

Estimating Causal Effects of Text Interventions Leveraging LLMs. 170-190 - Tiankai Yang, Junjun Liu, Michael Siu, Jiahang Wang, Zhuangzhuang Qian, Chanjuan Song, Cheng Cheng, Xiyang Hu, Yue Zhao:

AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection. 191-205 - Kalyan Nakka, Jimmy Dani, Ausmit Mondal, Nitesh Saxena:

LiteLMGuard: Seamless and Lightweight On-Device Guardrails for Small Language Models against Quantization Vulnerabilities. 206-223 - Muhammad Haroon, Magdalena Wojcieszak, Anshuman Chhabra:

"Whose Side Are You On?" Estimating Ideology of Political and News Content Using Large Language Models and Few-shot Demonstration Selection. 224-243 - Jun Suzuki, Ryoma Ishigaki, Eisaku Maeda:

AnaToM: A Dataset Generation Framework for Evaluating Theory of Mind Reasoning Toward the Anatomy of Difficulty through Structurally Controlled Story Generation. 244-257 - Robin Young:

Information-theoretic Distinctions Between Deception and Confusion. 258-268 - Atanu Mandal, Madhusudan Ghosh, Pratick Maiti, Sudip Kumar Naskar:

Whispering in Ol Chiki: Cross-Lingual Transfer Learning for Santali Speech Recognition. 269-278 - Daniil Gurgurov, Katharina Trinley, Ivan Vykopal, Josef van Genabith, Simon Ostermann, Roberto Zamparelli:

Multilingual Political Views of Large Language Models: Identification and Steering. 279-298 - Iñigo Pikabea, Iñaki Lacunza, Oriol Pareras Velasco, Carlos Escolano, Aitor Gonzalez-Agirre, Javier Hernando, Marta Villegas:

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization. 299-337 - Sachin Yadav, Dominik Schlechtweg:

XL-DURel: Finetuning Sentence Transformers for Ordinal Word-in-Context Classification. 338-351 - Ashok Urlana, Gopichand Kanumolu, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati, Rahul Mishra:

HalluCounter: Reference-free LLM Hallucination Detection in the Wild! 352-383 - Md. Tanzib Hosain, Md. Kishor Morol:

Intrinsic Linguistic Bias in Formal vs. Informal Bengali Pragmatics with Progressive Context Inflation. 384-396 - Keisuke Mizutani, Koriki Ryonosuke, Kento Tokuyama:

Enhancing LLM-Based Molecular Captioning with Molecular Fingerprints. 397-410 - Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha:

SiLQ: Simple Large Language Model Quantization-Aware Training. 411-422 - Harshil Vejendla:

Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement. 423-431 - Xinye Zhao, Spyridon Mastorakis:

SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching. 432-445 - Harshil Vejendla:

RewriteNets: End-to-End Trainable String-Rewriting for Generative Sequence Modeling. 446-451 - Long Nguyen, Quynh Vo, Hung Luu, Tho Quan:

When in Doubt, Ask First: A Unified Retrieval Agent-Based System for Ambiguous and Unanswerable Question Answering. 452-472 - Vrund Dobariya, Jatayu Baxi, Bhavika Gambhava, Brijesh Bhatt:

Smruti: Grammatical Error Correction for Gujarati using LLMs with Non-Parametric Memory. 473-485 - Sumin Jo, Junseong Choi, Jiho Kim, Edward Choi:

R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs. 486-509 - Zhiqiang Shi:

Data Augmentation for Low-resource Neural Machine Translation: A Systematic Analysis. 510-522 - Xiaoxuan Li, Lin Ni, Xin Wang, Tang Yitong, Ruoxuan Li, Jiamou Liu, Zhongsheng Wang:

LLM-based Business Process Models Generation from Textual Descriptions. 523-533 - Roberto Ceraolo, Dmitrii Kharlapenko, Ahmad Khan, Amélie Reymond, Rada Mihalcea, Bernhard Schölkopf, Mrinmaya Sachan, Zhijing Jin:

Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries. 534-563 - Zhichao Xu, Zhiqi Huang, Shengyao Zhuang, Vivek Srikumar:

Distillation versus Contrastive Learning: How to Train Your Rerankers. 564-578 - Shanta Kallur, Basavaraj S. Anami:

A Word-Splitting Approach to Kannada Sanskrit Sandhi Words Useful in Effective English Translation. 579-588 - Kelvin Han, Claire Gardent:

Generating Questions Under Discussion with Reinforcement Learning using Ranking and Scoring for Reward and Evaluation. 589-615 - Alexander Nemecek, Yuzhou Jiang, Erman Ayday:

The Feasibility of Topic-Based Watermarking on Academic Peer Reviews. 616-634 - Girish, Mohd Mujtaba Akhtar, Farhan Sheth, Muskaan Singh:

Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning. 635-645 - Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze:

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models. 646-659 - Guangliang Liu, Zimo Qi, Xitong Zhang, Lu Cheng, Kristen Marie Johnson:

Moral Self-correction is Not An Innate Capability in Language Models. 660-683 - Ruosi Shao, Md Shamim Seraj, Kangyi Zhao, Yingtao Luo, Lincan Li, Bolin Shen, Averi Bates, Yue Zhao, Chongle Pan, Lisa Hightow-Weidman, Shayok Chakraborty, Yushun Dong:

LLM-Empowered Patient-Provider Communication: A Data-Centric Survey From a Clinical Perspective. 684-705 - Hanwen Shen, Jiajie Lu, Yupeng Cao, Xiaonan Yang:

Enhancing Scene Transition Awareness in Video Generation via Post-Training. 706-721 - Arie Cattan, Alon Jacovi, Alex Fabrikant, Jonathan Herzig, Roee Aharoni, Hannah Rashkin, Dror Marcus, Avinatan Hassidim, Yossi Matias, Idan Szpektor, Avi Caciularu:

DoubleDipper: Recycling Contexts for Efficient and Attributed In-Context Learning. 722-737 - Oshayer Siddique, J. M. Areeb Uzair Alam, Md Jobayer Rahman Rafy, Syed Rifat Raiyan, Hasan Mahmud, Md. Kamrul Hasan:

PhysicsEval: Inference-Time Techniques to Improve the Reasoning Proficiency of Large Language Models on Physics Problems. 738-760 - Damien Sileo:

Attention Overflow: Language Model Input Blur during Long-Context Missing Items Identification. 761-767 - Danial Namazifard, Lukas Galke Poech:

Isolating Culture Neurons in Multilingual Large Language Models. 768-785 - Diya Saha, Sudeshna Jana, Manjira Sinha, Tirthankar Dasgupta:

Benchmarking Bangla Causality: A Dataset of Implicit and Explicit Causal Sentences and Cause-Effect Relations. 786-794 - Farhad Nooralahzadeh, Yi Zhang, Jonathan Fürst, Kurt Stockinger:

Multi-Modal Data Exploration via Language Agents. 795-813 - Lekkala Sai Teja, Annepaka Yadagiri, Partha Pakray, Chukhu Chunka, Mangadoddi Srikar Vardhan:

Fine-Grained Detection of AI-Generated Text Using Sentence-Level Segmentation. 814-828 - Yuya Chiba, Ryuichiro Higashinaka:

Incorporating Dialogue State Tracking into Japanese Full-duplex Task-oriented Spoken Dialogue Model. 829-836 - Arjun T. D, Anand Kumar Madasamy, Sheela Ramanna:

SeqTNS: Sequential Tolerance-based Classifier for Identification of Rhetorical Roles in Indian Legal Documents. 837-847 - Junseok Kim, Nakyeong Yang, Kyomin Jung:

Persona is a Double-Edged Sword: Rethinking the Impact of Role-play Prompts in Zero-shot Reasoning Tasks. 848-862 - Danny Brahman, Mohammad Mahoor:

CodeEval: A pedagogical approach for targeted evaluation of code-trained Large Language Models. 863-883 - Braeden Sherritt, Isar Nejadgholi, Efstratios Aivaliotis, Khaled Mslmani, Marzieh Amini:

WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada. 884-902 - Sahil Bansal, Sai Shruthi Sistla, Aarti Arikatala, Sebastian Schreiber:

Planning Agents on an Ego-Trip: Leveraging Hybrid Ego-Graph Ensembles for Improved Tool Retrieval in Enterprise Task Planning. 903-918 - Inaya Rahmanisa, Lyzander Marciano Andrylie, Mahardika Krisna Ihsani, Alfan Farizki Wicaksono, Haryo Akbarianto Wibowo, Alham Fikri Aji:

Unveiling the Influence of Amplifying Language-Specific Neurons. 919-968 - Dohyeon Kim, Gayeon Jung, Jeongseon Cho, Jihoon Yang:

Enhancing Coreference Resolution with LLM-driven Data Augmentation and Adversarial Filtering. 969-984 - Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya:

TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context. 985-1002 - Darshita Rathore, Vineet Kumar, Chetna Bansal, Anindya Moitra:

How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness. 1003-1013 - Shuzhou Yuan, Ercong Nie, Lukas Kouba, Helmut Schmid, Hinrich Schütze, Michael Färber:

LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification. 1014-1027 - Shuzhou Yuan, Zhan Qu, Mario Tawfelis, Michael Färber:

From Monolingual to Bilingual: Investigating Language Conditioning in Large Language Models for Psycholinguistic Tasks. 1028-1040 - Abhishek Kuber, Enrico Liscio, Ruixuan Zhang, Caroline A. Figueroa, Pradeep K. Murukannaiah:

Signs of Struggle: Spotting Cognitive Distortions across Language and Register. 1041-1054 - Kushal Chawla, Alfy Samuel, Anoop Kumar, Daben Liu:

FB-RAG: Improving RAG with Forward and Backward Lookup. 1055-1071 - Kaushal Attaluri, Radhika Mamidi, Sireesha Chittepu, Anirudh Chebolu, Hitendra Sarma Thogarcheti:

Emotion-Aware Dysarthric Speech Reconstruction: LLMs and Multimodal Evaluation with MCDS. 1072-1080 - Veer Chheda, Avantika Sankhe, Aaditya Uday Ghaisas:

Iterative Critique-Driven Simplification: Targeted Enhancement of Complex Definitions with Small Language Models. 1081-1096 - Haein Kong, A M. Muntasir Rahman, Ruixiang Tang, Vivek Singh:

SafePersuasion: A Dataset, Taxonomy, and Baselines for Analysis of Rational Persuasion and Manipulation. 1097-1111 - Manveer Singh Tamber, Jimmy Lin:

Illusions of Relevance: Arbitrary Content Injection Attacks Deceive Retrievers, Rerankers, and LLM Judges. 1112-1127 - Jiadong Gary Liang, Adam Kabbara, Jiaying Liu, Ronaldo Luo, Kina Kim, Michael Guerzhoy:

Semantic, Orthographic, and Phonological Biases in Humans' Wordle Gameplay. 1128-1135 - Sora Kadotani, Kosuke Nishida, Kyosuke Nishida:

Learning from Hallucinations: Mitigating Hallucinations in LLMs via Internal Representation Intervention. 1136-1143 - Rhitabrat Pokharel, Yufei Tao, Ameeta Agrawal:

CAPO: Confidence Aware Preference Optimization Learning for Multilingual Preferences. 1144-1156 - Haau-Sing Li, Patrick Fernandes, Iryna Gurevych, André F. T. Martins:

Formalizing Test-Time Compute for Function-Level Code Generation. 1157-1170 - Ziwei Chen, Bernhard Bermeitinger, Christina Niklaus:

BioMistral-Clinical: A Scalable Approach to Clinical LLMs via Incremental Learning and RAG. 1171-1184 - Diego Alves, Sergei Bagdasarov, Elke Teich:

Surprisal Dynamics for the Detection of Multi-Word Expressions in English. 1185-1194 - Manon Reusens, Philipp Borchert, Jochen De Weerdt, Bart Baesens:

Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance. 1195-1215 - Yuqi Liang, Wenjing Xu, Hongzhi Xu:

Improving Proficiency and Grammar Accuracy for Chinese Language Learners with Large Language Models. 1216-1232 - Raj Vardhan Tomar, Preslav Nakov, Yuxia Wang:

UnsafeChain: Enhancing Reasoning Model Safety via Hard Cases. 1233-1247 - Aneesha Sampath, Oya Aran, Emily Mower Provost:

SEER: The Span-based Emotion Evidence Retrieval Benchmark. 1248-1267 - Bhavana Akkiraju, Srihari Bandarupalli, Swathi Sambangi, Vijaya Saraswathi R, Vasavi Ravuri, Anil Vuppala:

TeluguST-46: A Benchmark Corpus and Comprehensive Evaluation for Telugu-English Speech Translation. 1268-1275 - Sourava Kumar Behera, Rohit Saluja:

HiLearners: Non-Native Spoken Hindi Error Correction. 1276-1288 - Rhitabrat Pokharel, Ameeta Agrawal:

MTQ-Eval: Multilingual Text Quality Evaluation for Language Models. 1289-1304 - Rahul Ghosh, Chun-Hao Liu, Gaurav Rele, Vidya Sagar Ravipati, Hazar Aouad:

TelcoAI: Advancing 3GPP Technical Specification Search through Agentic Multi-Modal Retrieval-Augmented Generation. 1305-1317 - Cheng Zhang, Rajasekhar Kakarla, Kangda Wei, Ruihong Huang:

ENG-DRB: PDTB-style Discourse Relation Bank on Engineering Tutorial Video Scripts. 1318-1330 - Aitaro Yamamoto, Hiroki Ouchi, Kota Tsubouchi, Tatsuo Yamashita, Ryo Tsujimoto, Yuki Matsuda, Hirohiko Suwa:

Did the Writer Actually Visit the Location? Analysis of Location Reviews from Visit Experience. 1331-1337 - Soumyadeep Jana, Sanasam Ranbir Singh:

Teaching Sarcasm: Few-Shot Multimodal Sarcasm Detection via Distillation to a Parameter-Efficient Student. 1338-1349 - Nidhi Gupta, Qinghua Li:

Seeing Through the Mask: AI-Generated Text Detection with Similarity-Guided Graph Reasoning. 1350-1360 - Yuanjun Shi, Zhaopeng Qiu:

Reasoning Enhanced Missing Knowledge Retrieval Augmented Generation Framework for Domain Specific Question Answering. 1361-1379 - Tanja Baeumel, Daniil Gurgurov, Yusser Al Ghussin, Josef van Genabith, Simon Ostermann:

Modular Arithmetic: Language Models Solve Math Digit by Digit. 1380-1409 - Leon Hammerla, Alexander Mehler, Giuseppe Abrami:

Standardizing Heterogeneous Corpora with DUUR: A Dual Data- and Process-Oriented Approach to Enhancing NLP Pipeline Integration. 1410-1425 - Soham Chaudhuri, Dipanjan Saha, Dipankar Das:

LLMForum-RAG: A Multilingual, Multi-domain Framework for Factual Reasoning via Weighted Retrieval and LLM Collaboration. 1426-1431 - Leon Hammerla, Andy Lücking, Carolin Reinert, Alexander Mehler:

D-Neg: Syntax-Aware Graph Reasoning for Negation Detection. 1432-1454 - Abeer Aldayel, Areej Alokaili:

EMBRACE: Shaping Inclusive Opinion Representation by Aligning Implicit Conversations with Social Norms. 1455-1472 - Lei Sheng, Xu Shuai Shuai:

CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning. 1473-1496 - Lei Sheng, Xu Shuai Shuai:

SLM-SQL: An Exploration of Small Language Models for Text-to-SQL. 1497-1512 - Sher Badshah, Moamen Moustafa, Hassan Sajjad:

CLEV: LLM-Based Evaluation Through Lightweight Efficient Voting for Free-Form Question-Answering. 1513-1531 - Yaxuan Ren, Krithika Ramesh, Yaxing Yao, Anjalie Field:

How do we measure privacy in text? A survey of text anonymization metrics. 1532-1544 - Saurabh Kumar Pandey, Sougata Saha, Monojit Choudhury:

To Generate or Discriminate? Methodological Considerations for Measuring Cultural Alignment in LLMs. 1545-1562 - Ritesh Goru, Shanay Mehta, Prateek Jain:

One-Pass to Reason: Token Duplication and Block-Sparse Mask for Efficient Fine-Tuning on Multi-Turn Reasoning. 1563-1574 - Chaohao Lin, Kaida Wu, Peihao Xiang, Yanzhao Wu, Ou Bai:

CLL-RetICL: Contrastive Linguistic Label Retrieval-based In-Context Learning for Text Classification via Large Language Models. 1575-1590 - Seyoung Song, Haneul Yoo, Jiho Jin, Kyunghyun Cho, Alice Oh:

Shared Heritage, Distinct Writing: Rethinking Resource Selection for East Asian Historical Documents. 1591-1610 - Srihari Bandarupalli, Bhavana Akkiraju, Sri Charan Devarakonda, Vamshi Raghu Simha Narasinga, Anil Vuppala:

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data. 1611-1617 - Shadab Hafiz Choudhury, Asha Kumar, Lara J. Martin:

Evaluating Human-LLM Representation Alignment: A Case Study on Affective Sentence Generation for Augmentative and Alternative Communication. 1618-1637 - T. Karthikeyan Himanshu Wadhwa, Manish Gupta Mausam:

Towards Multimodal Question Answering in Educational Domain. 1638-1649 - Tuan-Dung Le, Shohreh Haddadan, Thanh Thieu:

ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding. 1650-1662 - Lena Trigg, Dean F. Hougen:

Logical Table-to-Text Generation: Challenges, Methods, and Reasoning. 1663-1677 - Christos-Nikolaos Zacharopoulos, Revekka Kyriakoglou:

Decoding Emergent Big Five Traits in Large Language Models: Temperature-Dependent Expression and Architectural Clustering. 1678-1685 - Zony Yu, Yuqiao Wen, Lili Mou:

Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much). 1686-1694 - Takumi Takahashi, Tomoki Taniguchi, Chencheng Zhu, Tomoko Ohkuma:

Can LLMs Learn from Their Mistakes? Self-Correcting Instruction Tuning for Named Entity Recognition. 1695-1712 - Zhenyu Bi, Meng Lu, Yang Li, Swastik Roy, Weijie Guan, Morteza Ziyadi, Xuan Wang:

OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning. 1713-1728 - Harsh Kohli, Helian Feng, Lenon Minorics, Bhoomit Vasani, Xin He, Ali Kebarighotbi:

EWoRA: Expert Weighted Low-Rank Adaptation for Heterogeneous Data. 1729-1737 - Harshita Narnoli, Mihai Surdeanu:

The Alchemy of Thought: Understanding In-Context Learning Through Supervised Classification. 1738-1757 - Sidharth Pulipaka, Ashwin Sankar, Raj Dabre:

Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts. 1758-1776 - Matan Avitan, Moran Baruch, Nir Drucker, Itamar Zimerman, Yoav Goldberg:

Efficient Decoding Methods for Language Models on Encrypted Data. 1777-1794 - Zihan Wang, Naoki Yoshinaga:

Commentary Generation from Multimodal Game Data for Esports Moments in Multiplayer Strategy Games. 1795-1807 - Yingqi Hu, Zhuo Zhang, Jingyuan Zhang, Jinghua Wang, Qifan Wang, Lizhen Qu, Zenglin Xu:

Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models. 1808-1827 - Rohit Saxena, Pasquale Minervini, Frank Keller:

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization. 1828-1844 - Maxim Gordeev, Aleksandr Zuev, Mikhail Bakulin, Andrey Latyshev, Dmitry Kozlov, Yiwu Yao, Voronova Anastasia:

Hypercomplex Transformer: Novel Attention Mechanism. 1845-1851 - Christopher Rashidian, Sabine Brunswicker:

Merging Two Grammar Worlds: Exploring the Relationship between Universal Dependencies and Signal Temporal Logic. 1852-1866 - Debarchan Basu, Shashwat Bhardwaj, Vaibhav Sharma, Pooja Singh, Sandeep Kumar:

GARuD: Guided Alignment of Representations using Distillation for Ultra-Low-Resource Languages. 1867-1880 - Akhilesh Aravapalli, Mounika Marreddy, Radhika Mamidi, Manish Gupta, Subba Reddy Oota:

IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages? 1881-1905 - Jinu Nyachhyon, Mridul Sharma, Prajwal Thapa, Bal Krishna Bal:

Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks. 1906-1925 - Abhinav P. M, Priyanka Dasari, Nagaraju Vuppala, Parameswari Krishnamurthy:

Family helps one another: Dravidian NLP suite for Natural Language Understanding. 1926-1941 - Haoran Wang, Kai Shu:

Spatial-Aware Visual Program Guided Reasoning for Answering Complex Visual Questions. 1942-1953 - Jintao Liang, Gang Su, Huifeng Lin, You Wu, Rui Zhao, Ziyue Li:

Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges. 1954-1966 - Amar Parajuli, Koninika Pal:

Extracting Numeric Assertions from Text. 1967-1977 - Maya Srikanth, Run Chen, Julia Hirschberg:

Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection. 1978-1991 - Bhoomit Vasani, Jack FitzGerald, Anjie Fang, Sushmit Vaish:

PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint. 1992-1999 - Sayantan Pal, Souvik Das, Rohini K. Srihari:

Harmonious Minds: Benchmarking Intertwined Reasoning of Human Personality and Musical Preference. 2000-2018 - Blessed Guda, Lawrence Francis, Gabrial Zencha Ashungafac, Carlee Joe-Wong, Moise Busogi:

Quantifying and Mitigating Selection Bias in LLMs: A Transferable LoRA Fine-Tuning and Efficient Majority Voting Approach. 2019-2038 - Pritish Sahu, Anirudh Som, Ajay Divakaran, Dimitra Vergyri:

MINDS: A Cross-Cultural Dialogue Corpus for Social Norm Classification and Adherence Detection. 2039-2052 - Raavi Gupta, Pranav Hari Panicker, Sumit Bhatia, Ganesh Ramakrishnan:

Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts. 2053-2068 - Wenqi Pei, Hailing Xu, Henry Hengyuan Zhao, Shizheng Hou, Chen Han, Zining Zhang, Pingyi Luo, Bingsheng He:

Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models. 2069-2086 - Ram Mohan Rao Kadiyala, Siddhant Gupta, Jebish Purbey, Srishti Yadav, Suman Debnath, Alejandro Salamanca, Desmond Elliott:

Uncovering Cultural Representation Disparities in Vision-Language Models. 2087-2117 - Shifali Agrahari, Sujit Kumar, Sanasam Ranbir Singh:

Can You Really Trust That Review? ProtoFewRoBERTa and DetectAIRev: A Prototypical Few-Shot Method and Multi-Domain Benchmark for Detecting AI-Generated Reviews. 2118-2140 - Sazia Tabasum Mim, Jack Morris, Manish Dhakal, Yanming Xiu, Maria Gorlatova, Yi Ding:

Can a Unimodal Language Agent Provide Preferences to Tune a Multimodal Vision-Language Model? 2141-2156 - Fariha Anjum Shifa, Muhtasim Ibteda Shochcho, Abdullah Ibne Hanif Arean, Mohammad Ashfaq Ur Rahman, Akm Moshiur Rahman Mazumder, Ahaj Mahhin Faiak, Md Fahim, M. Ashraful Amin, Amin Ahsan Ali, A. K. M. Mahbubur Rahman:

SOMAJGYAAN: A Dataset for Evaluating LLMs on Bangla Culture, Social Knowledge, and Low-Resource Language Adaptation. 2157-2177 - Newaz Ben Alam, Akm Moshiur Rahman Mazumder, Mir Sazzat Hossain, Mysha Samiha, Md Alvi Noor Hossain, Md Fahim, Amin Ahsan Ali, Ashraful Islam, M. Ashraful Amin, AKMMahbubur Rahman:

CMBan: Cartoon-Driven Meme Contextual Classification Dataset for Bangla. 2178-2194 - Bassamtiano Renaufalgi Irnawan, Yoshimi Suzuki, Noriko Tomuro, Fumiyo Fukumoto:

Multi-Agent Cross-Lingual Veracity Assessment for Explainable Fake News Detection. 2195-2213 - Nihar Sanda, Rajat Shinde, Sumit Nawathe, William Seawright, Shaona Ghosh, Manil Maskey:

GeoSAFE - A Novel Geospatial Artificial Intelligence Safety Assurance Framework and Evaluation for LLM Moderation. 2214-2237 - Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe:

Investigating Omission as a Latency Reduction Strategy in Simultaneous Speech Translation. 2238-2258 - Adrita Anika, Md Messal Monem Miah:

Evaluating LLMs' Reasoning Over Ordered Procedural Steps. 2259-2267 - Arka Mukherjee, Shreya Ghosh:

mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models. 2268-2290 - Krithi Shailya, Akhilesh Kumar Mishra, Gokul S. Krishnan, Balaraman Ravindran:

Where Should I Study? Biased Language Models Decide! Evaluating Fairness in LMs for Academic Recommendations. 2291-2317 - Nirvan Patil, Malhar Abhay Inamdar, Agnivo Gosai, Guruprasad Pathak, Anish Joshi, Anish Joshirao, Raj Dandekar, Rajat Dandekar, Sreedath Panat:

Regional-TinyStories: A Small Language Model Framework for Evaluating Language Learning, Tokenizers, and Datasets. 2318-2367 - Yuchen Zhang, Yuze Gao, Bin Chen, Wenfeng Li, Shuo Sun, Jian Su:

High-Quality Complex Text-to-SQL Data Generation through Chain-of-Verification. 2368-2379 - Beso Mikaberidze, Temo Saghinadze, Simon Ostermann, Philipp Matthias Müller:

Cross-Prompt Encoder for Low-Performing Languages. 2380-2393 - Telem Joyson Singh, Sanasam Ranbir Singh, Priyankoo Sarmah:

An Information-Theoretic Approach to Reducing Fertility in LLMs for Manipuri Machine Translation. 2394-2404 - Dina Pisarevskaya, Arkaitz Zubiaga:

Agent-based Automated Claim Matching with Instruction-following LLMs. 2405-2414 - Ghazal Zamaninejad, MohammadAli SadraeiJavaheri, Farnaz Aghababaloo, Hamideh Rafiee, Milad Molazadeh Oskuee, Amirmohammad Salehoof:

Tooka-SBERT: Lightweight Sentence Embedding models for Persian. 2415-2425

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














