


default search action
20th ICDAR 2025: Wuhan, China - Workshops Part I
- Lianwen Jin

, Richard Zanibbi
, Veronique Eglin
:
Document Analysis and Recognition - ICDAR 2025 Workshops - Wuhan, China, September 20-21, 2025, Proceedings, Part I. Lecture Notes in Computer Science 16225, Springer 2026, ISBN 978-3-032-09367-7
The Fifth ICDAR International Workshop on Machine Learning (WML 2025)
- Gonzalo Mancera, Aythami Morales

, Julian Fierrez
, Ruben Tolosana
, Alejandro Peña
, Miguel Lopez-Duran
, Francisco Jurado
, Alvaro Ortigosa
:
PBa-LLM: Privacy- and Bias-Aware NLP Using Named-Entity Recognition (NER). 3-20 - Miguel Lopez-Duran

, Julian Fierrez
, Aythami Morales
, Ruben Tolosana
, Oscar Delgado-Mohatar
, Alvaro Ortigosa
:
Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs. 21-35 - Zi-Rui Wang:

Improving Handwritten Text Recognition via 3D Attention and Multi-scale Training. 36-52 - Martin Kiss

, Michal Hradis
:
Masked Self-supervised Pre-training for Text Recognition Transformers on Large-Scale Datasets. 53-70 - Surajit Mukherjee, Shivakumara Palaiahnakote, Sukalpa Chanda, Umapada Pal, Tong Lu:

Text Prompt to Image Generation for Classification of Similar and Non-similar Scene Images to Improve Text Spotting Performance. 71-91 - Eric López, Artemis Llabrés

, Ernest Valveny
:
Enhancing Document VQA Models via Retrieval-Augmented Generation. 92-107 - Shashwat Sarkar, Kunal Purkayastha

, Shivakumara Palaiahnakote, Umapada Pal, Muhammad Hammad Saleem, Palash Ghosal:
A New Multimodal Cross-Domain Network for Classification of Challenging Scene Images. 108-123 - Martin Kostelník, Karel Benes

, Michal Hradis
:
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation. 124-140 - Masaki Akiba, Shumpei Takezaki, Daichi Haraguchi, Seiichi Uchida:

Few-Part-Shot Font Generation. 141-157 - Ansh Kushwaha

, Sandeep Khanna
, Lenin Khangjrakpam, Chiranjoy Chattopadhyay
, Gaurav Bhatnagar
:
Non-linear Audio-Visual Storytelling from Scanned Comics: A Character-Centric Approach. 158-174 - Jun Muraoka, Daichi Haraguchi

, Naoto Inoue
, Wataru Shimoda
, Kota Yamaguchi
, Seiichi Uchida
:
Automatic Text Box Placement for Supporting Typographic Design. 175-191 - Lucas De Almeida Bandeira Macedo

, João Paulo Vieira Costa
, João Pedro Felix de Almeida
, Pedro Garcia Freitas
, Weigang Li
:
Visual Document Matching for Zero-Shot Document Classification. 192-208 - Mathias Seuret

, Oliver Traub, Ning Guo
, Florian Kordon
, Thomas Gorges
, Vincent Christlein
:
Evaluating Popular Scene Text Detection and Recognition Methods on Tombstones. 209-225 - Jun Xie, Yiming Xia, Sailong Wu, Ruiqing Wu, Yirong Chen:

Deep Learning for Defect Detection in Answer Document Image. 226-244 - Aniket Gurav, Sukalpa Chanda

, Marius Pedersen:
ResNet-TPP: A Parallel PHOC-PHOS Framework for Zero-Shot Handwritten Word Recognition in Low-Resource Scripts. 245-259 - Adnan Ben Mansour

, Ayoub Karine
, David Naccache
:
Interpret, Prune and Distill Donut: Towards Lightweight VLMs for VQA on Documents. 260-278 - Cuong Tuan Nguyen

, Ngoc Tuan Nguyen
, Triet Hoang Minh Dao
, Nhat Huy Nguyen Minh, Huy Truong Dinh
:
Link Prediction Graph Neural Networks for Structure Recognition of Handwritten Mathematical Expressions. 279-291 - Michael Jungo, Andreas Fischer:

Rule-Based Reinforcement Learning for Document Image Classification with Vision Language Models. 292-309
ICDAR 2025 Workshop on Multi-modal Mathematical Reasoning in Documents (M3RD 2025)
- Merouane Zouaid, Yejing Xie, Harold Mouchère:

Boosting Handwritten Mathematical Expression Recognition Through Contextual Reasoning with Vision Large Language Models (vLLMs). 313-326 - Zhi Chen, Yuhan Yang, Xiangdong Su, Haoran Zhang, Xinxiang Zhou, Wei Chen, Guanglai Gao:

SCANS: An Efficient Geometric Problem Solver with Content-Aware Attention and Adaptive Fusion. 327-343 - Kecheng Liang, Xinyu Li, Weixing Chen, Yang Liu:

GeoGRPO: Investigating the Stepwise-GRPO Enhancement in RLHF Framework. 344-361 - Yaqi Wang, Hui Wang, Yuanping Zhu:

Offline Handwritten Mathematical Formula Recognition Based on Primitive Representation. 362-377 - Changwei Li, Guangping Huang, Zihao Zhou, Qiufeng Wang:

Long Math Reasoning Problem Generation. 378-393

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














