


default search action
31st COLING 2025: Abu Dhabi, UAE - Industry Track
- Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert, Kareem Darwish, Apoorv Agarwal:

Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025 - Industry Track, Abu Dhabi, UAE, January 19-24, 2025. Association for Computational Linguistics 2025, ISBN 979-8-89176-197-1 - Minjia Wang, Pingping Lin, Siqi Cai, Shengnan An, Shengjie Ma, Zeqi Lin, Congrui Huang, Bixiong Xu:

STAND-Guard: A Small Task-Adaptive Content Moderation Model. 1-20 - Hai Zhu, Yuankai Guo, Ronggang Dou, Kai Liu:

Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance. 21-28 - Mohammad Kachuee, Sarthak Ahuja, Vaibhav Kumar, Puyang Xu, Xiaohu Liu:

Improving Tool Retrieval by Leveraging Large Language Models for Query Generation. 29-38 - Rafael Teixeira de Lima, Shubham Gupta, Cesar Berrospi Ramis, Lokesh Mishra, Michele Dolfi, Peter W. J. Staar, Panagiotis Vagenas:

Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems. 39-57 - David Farr, Nico Manzonelli, Iain Cruickshank, Jevin West:

RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Linguistic Classifiers. 58-67 - Harsha Vardhan Khurdula, Basem Rizk, Indus Khaitan:

Beyond Visual Understanding Introducing PARROT-360V for Vision Language Model Benchmarking. 68-75 - Yiwen Duan, Yonghong Yu, Xiaoming Zhao, Yichang Wu, Wenbo Liu:

PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing. 76-90 - Sanjay Agrawal, Deep Nayak, Vivek Sembium:

Multilingual Continual Learning using Attention Distillation. 91-99 - Amit Agarwal, Srikant Panda, Kulbhushan Pachauri:

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding. 100-114 - Zhao Wang, Briti Gangopadhyay, Mengjie Zhao, Shingo Takamatsu:

OKG: On-the-Fly Keyword Generation in Sponsored Search Advertising. 115-127 - Dezhi Ye, Junwei Hu, Jiabin Fan, Bowen Tian, Jie Liu, Haijin Liang, Jin Ma:

Best Practices for Distilling Large Language Models into BERT for Web Search Ranking. 128-135 - Sanjay Agrawal, Faizan Ahemad, Vivek Sembium:

Rationale-Guided Distillation for E-Commerce Relevance Classification: Bridging Large Language Models and Lightweight Cross-Encoders. 136-148 - Diya Li, Asim Kadav, Aijing Gao, Rui Li, Richard Bourgon:

Automated Clinical Data Extraction with Knowledge Conditioned LLMs. 149-162 - Seyed Amin Tabatabaei, Sarah Fancher, Michael Parsons, Arian Askari:

Can Large Language Models Serve as Effective Classifiers for Hierarchical Multi-Label Classification of Scientific Documents at Industrial Scale? 163-174 - Elie Dina, Rania Ayachi Kibech, Miguel Couceiro:

EDAR: A pipeline for Emotion and Dialogue Act Recognition. 175-186 - Ashok Urlana, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati, Ajeet Kumar Singh, Rahul Mishra:

No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size. 187-203 - Toshiki Kuramoto, Jun Suzuki:

Predicting Fine-tuned Performance on Larger Datasets Before Creating Them. 204-212 - Navid Madani, Anusha Bagalkotkar, Supriya Anand, Gabriel Arnson, Rohini K. Srihari, Kenneth Joseph:

A Recipe For Building a Compliant Real Estate Chatbot. 213-235 - Uddeshya Singh, Ravi Shankar Devanapalli, Gowtham Bellala, Vikas Goel:

Geo-Spatially Informed Models for Geocoding Unstructured Addresses. 236-242 - Tobias Deußer, Max Hahnbück, Tobias Uelwer, Cong Zhao, Christian Bauckhage, Rafet Sifa:

Resource-Efficient Anonymization of Textual Data via Knowledge Distillation from Large Language Models. 243-250 - Maia Aguirre, Ariane Méndez, Arantza del Pozo, María Inés Torres, Manuel Torralbo:

Fine-Tuning Medium-Scale LLMs for Joint Intent Classification and Slot Filling: A Data-Efficient and Cost-Effective Solution for SMEs. 251-262 - Zusheng Tan, Xinyi Zhong, Jing-Yu Ji, Wei Jiang, Billy Chiu:

Enhancing Large Language Models for Scientific Multimodal Summarization with Multimodal Output. 263-275 - Mireia Hernandez Caralt, Ivan Sekulic, Filip Carevic, Nghia Khau, Diana Nicoleta Popa, Bruna Guedes, Victor Guimarães, Zeyu Yang, André Ferreira Manso, Meghana M. Reddy, Paolo Rosso, Roland Mathis:

"Stupid robot, I want to speak to a human!" User Frustration Detection in Task-Oriented Dialog Systems. 276-285 - Harsh Saini, Md. Tahmid Rahman Laskar, Cheng Chen, Elham Mohammadi, David Rossouw:

LLM Evaluate: An Industry-Focused Evaluation Tool for Large Language Models. 286-294 - Gilchan Park, Paul Baity, Byung-Jun Yoon, Adolfy Hoisie:

Enhancing Future Link Prediction in Quantum Computing Semantic Networks through LLM-Initiated Node Features. 295-304 - Hunter Heidenreich, Ratish Dalvi, Nikhil Verma, Yosheb Getachew:

Page Stream Segmentation with LLMs: Challenges and Applications in Insurance Document Automation. 305-317 - Xiaoping Shen, Yekun Chai:

Graph-Augmented Open-Domain Multi-Document Summarization. 318-330 - Jing Wu, Shushu Wang, Kai Fan, Wei Luo, Minpeng Liao, Zhongqiang Huang:

Improve Speech Translation Through Text Rewrite. 331-342 - Johannes Kirmayr, Lukas Stappen, Phillip Schneider, Florian Matthes, Elisabeth André:

CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding. 343-357 - Riyaz Ahmad Bhat, Jaydeep Sen:

XTR meets ColBERTv2: Adding ColBERTv2 Optimizations to XTR. 358-365 - Dahyun Kim, Yungi Kim, Wonho Song, Hyeonwoo Kim, Yunsu Kim, Sanghoon Kim, Chanjun Park:

sDPO: Don't Use Your Data All at Once. 366-373 - Yuya Asano, Sabit Hassan, Paras Sharma, Anthony B. Sicilia, Katherine Atwell, Diane J. Litman, Malihe Alikhani:

Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI. 374-386 - Parshin Shojaee, Sai Sree Harsha, Dan Luo, Akash Maharaj, Tong Yu, Yunyao Li:

Federated Retrieval Augmented Generation for Multi-Product Question Answering. 387-397 - Masha Belyi, Robert Friel, Shuai Shao, Atindriyo Sanyal:

Luna: A Lightweight Evaluation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost. 398-409 - Boqi Chen, Anuj Khare, Gaurav Kumar, Arjun R. Akula, Pradyumna Narayana:

Seeing Beyond: Enhancing Visual Question Answering with Multi-Modal Retrieval. 410-421 - Yungeng Liu, Zan Chen, Yuguang Wang, Yiqing Shen:

AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering. 422-430 - Yuanhao Yue, Chengyu Wang, Jun Huang, Peng Wang:

Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud. 431-444 - Hancheol Park, Geonmin Kim:

Where do LLMs Encode the Knowledge to Assess the Ambiguity? 445-452 - Paramita Das, Amartya Roy, Ritabrata Chakraborty, Animesh Mukherjee:

On the effective transfer of knowledge from English to Hindi Wikipedia. 453-465 - Shaowei Zhang, Deyi Xiong:

BackMATH: Towards Backward Reasoning for Solving Math Problems Step by Step. 466-482 - Yincen Qu, Hengyue Liu, Kun Wang, Xiangying Dai, Xiaoou Lu, Hui Zhou, Chao Ma:

Deploying Multi-task Online Server with Large Language Model. 483-495 - Hanchen Su, Wei Luo, Yashar Mehdad, Wei Han, Elaine Liu, Wayne Zhang, Mia Zhao, Joy Zhang:

LLM-Friendly Knowledge Representation for Customer Support. 496-504 - Divesh R. Kubal, Apurva Nagvenkar:

Leveraging Multilingual Models for Robust Grammatical Error Correction Across Low-Resource Languages. 505-510 - Yiran Xie, Debin Xiao, Ping Wang, Shuming Liu:

A Simple yet Efficient Prompt Compression Method for Text Classification Data Annotation Using LLM. 511-521 - Jeehyun Lee, Seung-Moo Yang, Won Ik Cho:

AMAN: Agent for Mentoring and Assisting Newbies in MMORPG. 522-532 - Elena Senger, Yuri Campbell, Rob van der Goot, Barbara Plank:

KARRIEREWEGE: A large scale Career Path Prediction Dataset. 533-545 - Baban Gain, Dibyanayan Bandyopadhyay, Samrat Mukherjee, Aryan Sahoo, Saswati Dana, Palanivel A. Kodeswaran, Sayandeep Sen, Asif Ekbal, Dinesh Garg:

Transforming Code Understanding: Clustering-Based Retrieval for Improved Summarization in Domain-Specific Languages. 546-560 - Frederic Thomas Kirstein, Terry Lima Ruas, Bela Gipp:

Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator. 561-574 - Mengtian Guo, Mutasem Al-Darabsah, Choon Hui Teo, Jonathan May, Tarun Agarwal, Rahul Bhagat:

Learning to Rewrite Negation Queries in Product Search. 575-582 - William Watson, Nicole Cho, Nishan Srishankar, Zhen Zeng, Lucas Cecchi, Daniel Scott, Suchetha Siddagangappa, Rachneet Kaur, Tucker Balch, Manuela Veloso:

LAW: Legal Agentic Workflows for Custody and Fund Services Contracts. 583-594 - Riyaz Ahmad Bhat, Jaydeep Sen, Rudra Murthy, Vignesh P:

UR2N: Unified Retriever and ReraNker. 595-602 - Chi Zhang, Vivek V. Datla, Aditya Shrivastava, Alfy Samuel, Zhiqi Huang, Anoop Kumar, Daben Liu:

An Automatic Method to Estimate Correctness of RAG. 603-611 - Junghoon Kang, Keunjoo Tak, Joungsu Choi, Myunghyun Kim, Junyoung Jang, Youjin Kang:

DaCoM: Strategies to Construct Domain-specific Low-resource Language Machine Translation Dataset. 612-624 - Ahmed Masry, Megh Thakkar, Aayush Bajaj, Aaryaman Kartha, Enamul Hoque, Shafiq Joty:

ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild. 625-643 - Akshara Prabhakar, Yuanzhi Li, Karthik Narasimhan, Sham M. Kakade, Eran Malach, Samy Jelassi:

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks. 644-655 - Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T. Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Minh Chien Vu, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin P. Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Barbosa Junior, Aleksandr Drozd, Jordan Clive, Kshitij Gupta, Liangyu Chen, Qi Sun, Ken Tsui, Nour Moustafa-Fahmy, Nicolo Monti, Tai Dang, Ziyang Luo, Tien-Tung Bui, Roberto Navigli, Virendra Mehta, Matthew Blumberg, Victor May, Hiep Nguyen, Sampo Pyysalo:

Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code. 656-678 - Zhipeng Li, Shuang Zheng, Jiaping Xiao, Xianneng Li, Lei Wang:

UCTG: A Unified Controllable Text Generation Framework for Query Auto-Completion. 679-688 - Aaron Zheng, Mansi Rana, Andreas Stolcke:

Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings. 689-696 - Mansi Rana, Kadri Hacioglu, Sindhuja Gopalan, Maragathamani Boothalingam:

Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems. 697-706 - Jessica Foo, Shaun Khoo:

LionGuard: A Contextualized Moderation Classifier to Tackle Localized Unsafe Content. 707-731 - Sayantan Adak, Pauras Mangesh Meher, Paramita Das, Animesh Mukherjee:

REVerSum: A Multi-staged Retrieval-Augmented Generation Method to Enhance Wikipedia Tail Biographies through Personal Narratives. 732-750 - Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, Filippo Pallucchini:

RE-FIN: Retrieval-based Enrichment for Financial data. 751-759 - Cheoneum Park, Seohyeong Jeong, Minsang Kim, KyungTae Lim, Yong-Hun Lee:

SCV: Light and Effective Multi-Vector Retrieval with Sequence Compressive Vectors. 760-770 - Yuta Nozaki, Dai Nakashima, Ryo Sato, Naoki Asaba, Shintaro Kawamura:

Efficient Vocabulary Reduction for Small Language Models. 771-783 - Zeyuan Chen, Haiyan Wu, Kaixin Wu, Wei Chen, Mingjie Zhong, Jia Xu, Zhongyi Liu, Wei Zhang:

Towards Boosting LLMs-driven Relevance Modeling with Progressive Retrieved Behavior-augmented Prompting. 784-793 - Changwoo Chun, Daniel Rim, Juhee Park:

LLM ContextBridge: A Hybrid Approach for Intent and Dialogue Understanding in IVSR. 794-806 - Saeed Abbasi, Aijun An, Heidar Davoudi, Ronald Di Carlantonio, Gary Farmaner:

Neural Document Segmentation Using Weighted Sliding Windows with Transformer Encoders. 807-816 - Shuxi Guo, Qi Qi, Haifeng Sun, Jianxin Liao, Jingyu Wang:

RecStream: Graph-aware Stream Management for Concurrent Recommendation Model Online Serving. 817-826 - Wenting Tan, Dongxiao Chen, Jieting Xue, Zihao Wang, Taijie Chen:

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models. 827-839

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














