


default search action
19th ICDAR 2025: Wuhan, China - Part III
- Xu-Cheng Yin

, Dimosthenis Karatzas
, Daniel Lopresti
:
Document Analysis and Recognition - ICDAR 2025 - 19th International Conference, Wuhan, China, September 16-21, 2025, Proceedings, Part III. Lecture Notes in Computer Science 16025, Springer 2026, ISBN 978-3-032-04623-9
Poster Papers
- Caroline Koudoro-Parfait, Marceau Hernandez, Gaël Lejeune, Yoann Dupont:

Epimethee - A Workflow from OCR to Spatial Mapping. 3-21 - Sandeep Khanna

, Atanu Saha
, Rahul Kumar Ray
, Rakesh Patibanda
, Chiranjoy Chattopadhyay
:
From Notes to Keys: A VR Learning Environment for Sheet Music Interpretation. 22-39 - Runqing Yan, Jianye An:

UniOne: A Document Parsing Dataset for Cross Task Association Modeling. 40-55 - Kehinde Ajayi

, Yi He, Jian Wu:
Uncertainty-Aware Complex Scientific Table Data Extraction. 56-73 - Tri-Cong Pham

, Mickaël Coustaty
, Aurélie Joseph
, Gaspar Deloin
, Vincent Poulain D'Andecy, Antoine Doucet
:
Few-Shot Document Classification in Real Applications: Boosting Precision with Novelty Detection. 74-98 - Liu Yong, Yuanshuang Miao, Lyuwen Huang:

Att-BiGRU-MulCNN: A New Approach for Intent Classification in Apple Pest and Disease. 99-114 - Kumari Priya, Suraj Kumar

, Aritra Dey
, Chandranath Adak
, Soumi Chattopadhyay
, Sukalpa Chanda
, Simone Marinai
:
Graph Convolutional Teacher-Student Framework for Writer Inspection from Intra-variable Handwritten Words. 115-129 - Somraj Gautam

, Nachiketa Purohit, Gaurav Harit
:
Table Detection with Active Learning. 130-146 - Lei Hu, Dongwei Liu, Yujia Chen, Zhenwei Wang, Yamin Li:

ExamCleaner: Examination-Paper Handwritten Text Erasure via Large Receptive Field Context Anchor Attention. 147-162 - Jing-Yao Zhang, Heng Zhang

, Fei Yin
:
CHSAM: Efficient Scene Text Segmentation via SAM with Convolutional Adapters and Hierarchical Decoding. 163-179 - Arnab Halder

, Shivakumara Palaiahnakote
, Umapada Pal
, Michael Blumenstein
, Yue Lu
:
A New Fourier-Attention Guided Approach for Domain-Agnostic Text Localization. 180-199 - Yiheng Huang

, Shuang She, Zewei Wei, Jianmin Lin
, Ming Yang, Wenyin Liu
:
StrokeNet: Unveiling How to Learn Fine-Grained Interactions in Online Handwritten Stroke Classification. 200-217 - Kai Ding, Sheng Jian, Lianwen Jin:

HisDoc-DETR: Integrating Semantic Learning and Feature Fusion for Historical Document Layout Analysis. 218-237 - Huiying Hu

, Zhicheng He
, Yixiao Zhou
, Tongwei Zhang, Xiaoqing Lyu
:
Multimodal Content Alignment with LLM for Visual Presentation of Papers. 238-256 - Yiming Wang

, Hongxi Wei
, Heng Wang
, Bo Sun:
VMF-Net: Visual-Aware Multi-representation Fusion Network for Artifact-Free Handwritten Mathematical Expressions Generation. 257-269 - Minsoo Khang

, Sang Chul Jung, Sungrae Park
, Teakgyu Hong:
KIEval: Evaluation Metric for Document Key Information Extraction. 270-286 - Jan Kohút

, Martin Docekal
, Michal Hradis
, Marek Vasko
:
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction. 287-304 - Giang Tran Thi Cam

, Cam-Nguyen Tran-Nhu
, Thuyen Tran Doan
, Thanh Duc Ngo
:
Towards Understanding the Logical Layout of Scene Text in Signboard Images. 305-322 - Glen Pouliquen

, Joseph Chazalon
, Guillaume Chiron, Thierry Géraud, Ahmad Montaser Awal
:
Verification of Dynamic Holographic Behavior in Identity Documents. 323-339 - Mariona Coll Ardanuy

, Iban Berganzo-Besga
, Ramon Sarobe
, Coral Cuadrada
:
Evaluating Handwritten Text Recognition in Medieval Notarial Manuscripts: A New Dataset and Comprehensive Analysis. 340-357 - Yaowei Yang, Zhonghua Sun

, Kaisaier Tuerxun
, Kurban Ubul
:
Scene Script Identification Using Dense Hierarchical Semantic Fusion. 358-374 - Nam Van Hai Phan

, Khoa Minh Nguyen, Thanh Trung Nguyen
, Trung Thanh Pham
, Phuong-Nam Tran
, Duc Ngoc Minh Dang
:
Mask CoMER: Enhancing Handwritten Mathematical Expression Recognition with Masked Language Pretraining and Regularization. 375-390 - Lisa Koopmans

, Maruf A. Dhali
, Lambert Schomaker
:
Self-HTR: A Novel Self-supervised Handwritten Text Recognition Framework Using Generative Adversarial Networks. 391-407 - Qiuming Luo

, Tao Zeng
, Xuan Wei, Chang Kong
:
Radical Sequence Encoding with Fine-Tuned CLIP for Handwritten Chinese Character Recognition. 408-424 - Rafael Sterzinger, Marco Peer

, Robert Sablatnig
:
Few-Shot Segmentation of Historical Maps via Linear Probing of Vision Foundation Models. 425-442 - Benjamin Kiessling

:
Version 5 of the Kraken ATR Engine for the Humanities. 443-458 - Hetao Wu, Kunchi Li

, Xu-Yao Zhang, Qiufeng Wang, Da-Han Wang:
OracleGCD: Generalized Category Discovery for Oracle Bone Scripts. 459-475 - Wenjun Sun

, Nancy Girdhar
, Tran Thi Hong Hanh
, Carlos-Emiliano González-Gallardo
, Mickaël Coustaty
, Antoine Doucet
:
Ar-Q-Former: Historical Newspaper Article Separation Based on Multimodal Transformer Structure. 476-492 - Baharan Pourahmadi

, Morten Sielnik Andersen
, Jakob Povl Holck
, Mogens Kragsig Jensen
, Mads Toudal Frandsen
, René Lynge Eriksen
:
Challenges in Revealing Readable Text from Fragments Hidden in Book Bindings: A Case Study from the Herlufsholm Collection. 493-508 - Yu Zhou

, Zhengxu Jin, Jianjun He
, Baochun Wu, Xinshu Cui, Ruirui Zheng
:
VLMAWR: A Method for Manchu Archives Word Recognition Based on Vision-Language Model. 509-525 - Sattaya Singkul

, Atthakorn Petchsod, Panya Sunantasaengtong, Theerat Sakdejayont, Tawunrat Chalothorn
:
Optimizing Thai-English Spoken Question Answering Interaction for Open Environments with Limited Resources. 526-544 - Suqiong Zhang, Dongyi Fan, Yi Liu, Lili He, Zuohua Ding:

Large Language Models for Online Log Parsing in AIOps. 545-562 - Siddartha Reddy, Harikrishnan P. M.

, Goutham Vignesh
, Varun V
, Vishal Vaddina
:
DocAnnot - Accelerating the Creation of Key Information Extraction Datasets with GenAI-Powered Auto-annotation. 563-576 - Shaon Bhattacharyya, Souvik Ghosh

, Prantik Deb
, Ajoy Mondal
, C. V. Jawahar
:
Adapting Vision-Language Models for Hindi OCR. 577-594 - Chang Liu, Elisa H. Barney Smith:

Watch and Act: Multi-orientation Open-Set Scene Text Recognition via Dynamic Expert Routing. 595-612 - Lei Kang

, Xuanshuo Fu
, Oriol Ramos Terrades
, Javier Vazquez-Corral
, Ernest Valveny
, Dimosthenis Karatzas
:
LLM-Driven Medical Document Analysis: Enhancing Trustworthy Pathology and Differential Diagnosis. 613-628 - Ruofan Li

, Wei Zhang
, Yong Liu
:
DAFSVFND: Dual Attention Fusion Network for Fake News Detection on Short Video Platforms. 629-646

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














