


default search action
26th DICTA 2025: Adelaide, Australia
- International Conference on Digital Image Computing: Techniques and Applications, DICTA 2025, Adelaide, Australia, December 3-5, 2025. IEEE 2025, ISBN 979-8-3315-7145-0

- Gia Khanh Nguyen, Yifeng Huang, Minh Hoai:

Can Current AI Models Count What We Mean, Not What They See? A Benchmark and Systematic Evaluation. 1-8 - Muhammad Umer Ramzan, Ali Zia, Abdelwahed Khamis, Noman Ali, Usman Ali, Wei Xiang:

Split-Fuse-Transport: Annotation-Free Saliency via Dual Clustering and Optimal Transport Alignment. 1-8 - Anwaar Ulhaq, Khizer Ali, Jahan Hassan, Sajid Javed, Louise Hardman:

Plastic-GPT: Multi-Hop Visual Reasoning for Marine Plastic Detection, Polymer Classification, and Recyclability Assessment. 1-8 - Nilesh Ramgolam:

Learning to Defer to A Population with Limited Demonstrations. 1-7 - Usman Ali, Ali Zia, Abdul Rehman, Muhammad Umer Ramzan, Zohaib Hassan, Talha Sattar, Jing Wang, Wei Xiang:

2D-3D Feature Fusion via Cross-Modal Latent Synthesis and Attention-Guided Restoration for Industrial Anomaly Detection. 1-8 - Ahalya Ravendran, Madhawa Perera, Feng Xu, Lars Petersson, Dadong Wang, Xun Li:

IntentFuse: Language-Guided 3D Scene Understanding via Prompt Filtering and Fusion. 1-7 - Sonain Jamil, Damien Muselet, Alain Trémeau, Philippe Colantoni:

Dance3DRelight: Single-Image 3D Reconstruction and Physically-Based Relighting of Dance Performances. 1-8 - Mohammad Al-Fawa'reh, Brandon Abela, Luke Kelly, Jumana Abu-Khalaf, Martin Masek, David Suter, Ashu Gupta:

Estimating Femoral Neck Bone Mineral Density from Distal Radius X-Rays Using Deep Learning. 1-10 - Abderraouf Amrani, Hamid Laga, Volker Framenau

, Melissa L. Thomas:
Dynamic Adaptive Sampling for Accurate Image-Based 3D Insect Reconstruction Using Neural Implicit Surfaces. 1-8 - Kavindya Imbulgoda, Steven Korevaar, Ruwan B. Tennakoon, Alireza Bab-Hadiashar:

Variance-Penalized Robust Learning for Pareto-Optimal Group Robustness Under Clean and Noisy Annotations. 1-8 - Proloy Kumar Mondal, Chayan Mondal, Md Ariful Islam Mozumder, Duc-Son Pham, Hee-Cheol Kim:

MedAttnNet: A Novel Architecture for Precise Brain Cancer Classification. 1-8 - Ufaq Khan, Umair Nawaz, Mustaqeem Khan

, Abdulmotaleb El Saddik:
Robosurg: Resilience of Vision-Language Models Against Adversarial Attacks in Robotic Surgery. 1-8 - Ankit Yadav, Lingqiao Liu, Yuankai Qi

:
Exploring Primitive Visual Measurement Understanding and the Role of Output Format in Learning in Vision-Language Models. 1-8 - Anirudh Atmakuru, Pronab Sarker, Antu Chowdhury, Prabal Datta Barua, U. Rajendra Acharya, Abdul Hafeez-Baig, Subrata Chakraborty:

Understanding Non-Verbal Vocalizations from Minimally-Verbal Autistic Individuals: A Transfer Learning Approach. 1-8 - Chaohan Wang, Qi Chen, Yutong Xie, Qi Wu:

Filling in the Missing Piece: Advancing Automated Radiology Report Generation with Clinical Insights. 1-8 - Md Ismail Hossen, Mohammad Aminul Islam, Mohammad Awrangjeb, Shirui Pan:

FDR-CultivarNet: Feature Decomposition and Reconstruction Network for Few-Shot Cultivar Leaf Classification. 1-8 - Mehwish Mehmood, Shahzaib Iqbal, Tariq Mahmood Khan, Ivor T. A. Spence, Muhammad Fahim:

LFRA-Net: A Lightweight Focal and Region-Aware Attention Network for Retinal Vessel Segmentation. 1-8 - Chayan Mondal, Chai Ken Kai, Duc-Son Pham, Tele Tan, Tom Gedeon, Ashu Gupta:

Generating Clinically Relevant Reports from Chest X-Rays for Cardiomegaly Diagnosis. 1-8 - Roland Croft, Brian Du, Darcy Joseph, Sharath Kumar:

Investigating Adversarial Robustness Against Preprocessing Used in Blackbox Face Recognition. 1-8 - Feng Chen, Bohan Zhuang, Qi Wu:

Streaming Video Diffusion: Online Video Editing with Diffusion Models. 1-9 - Sadia Islam Sharmi, Sajib Saha, G. M. Atiqur Rahaman:

Resvit-Ippm: a Hybrid Cnn-Transformer Architecture for Accurate and Explainable Diabetic Foot Ulcer Classification. 1-7 - Muhammad Zeeshan Khan, Anuroop Gaddam:

PointQA: Multi-Modality Guided Cross-Attention for 3D Visual Question Answering on Point Clouds. 1-7 - Nadhir Hassen, Johan Verjans

:
Flow-Selectivity SSM: A Generative State-Space Model with History-Aware GFlowNet Policies. 1-21 - Tonmay Sen, G. M. Atiqur Rahaman, Sajib Saha:

A Lightweight CNN for Brain Tumor Classification in Resource-Constrained Settings. 1-8 - Chaohan Wang, Qi Chen, Minh-Son To, Numan Kutaiba, Jae-Gon Yoo, Yutong Xie, Qi Wu:

X-Gen: Enhancing Radiology Report Generation via LLM-Driven Data Augmentation and Decoupled Training. 1-8 - Siddharath Malavalli Nagesh, Rohil Sagar, Stellin John George, Elakkiya Rajasekar, J. Angel Arul Jothi:

Intelligent Prescription Digitization: Leveraging Multi-Engine OCR and Deep Contextual NLP for Reliable Medical Data Structuring. 1-8 - Ryan Faulkner, Anh-Dzung Doan, Simon Ratcliffe, Luke Haub, Ian D. Reid, Tat-Jun Chin:

Simultaneous Diffusion Sampling for Conditional LiDAR generation. 1-8 - Xinyu Huo, Yige Peng, Yupeng Xu, Jinman Kim, Suqin Yu, Lei Bi:

mmDFC: Multi-Modality Dynamic Fusion and Modal-Decoupled Alternating Optimization for Retinal Vessel Occlusion Prognosis. 1-6 - Md. Asikuzzaman, Katerina Biron, Tri-Tan Cao

:
Maritime Vessel Classification in Low-Resolution Synthetic Aperture Radar Imagery. 1-8 - Farhana Yasmin, Mahade Hasan

, Shahab A. Abdulla, Md. Mehedi Hassan, Radjabov Sukhrob Radjabovich, Abdolraheem Khader, Ali Ahmed, Yu Xue:
Optimizing Brain Tumor Segmentation Networks Through Evolutionary Neural Architecture Search. 1-8 - Xuanhua Yin, Dingxin Zhang, Yu Feng, Shunqi Mao, Jianhui Yu, Weidong Cai:

Beyond Random Masking: A Dual-Stream Approach for Rotation-Invariant Point Cloud Masked Autoencoders. 1-8 - Liqian Feng, Lintao Wang, Kun Hu, Dehui Kong, Zhiyong Wang:

Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production. 1-8 - Angus Maiden, Trong Viet Anh Nguyen, Bahareh Nakisa:

Complex Facial Expression Recognition Using Deep Knowledge Distillation of Basic Features. 1-8 - Nhi Kieu, Kien Nguyen

, Arnold Wiliem, Clinton Fookes, Sridha Sridharan:
Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation. 1-8 - Mahade Hasan

, Farhana Yasmin, Haipeng Liu
, Md. Mehedi Hassan, Radjabov Sukhrob Radjabovich, Yu Xue:
Automated Polyp Segmentation Using Evolutionary Neural Architecture Search. 1-7 - Ryan Chappell, Chayan Banerjee

, Kien Nguyen
, Clinton Fookes:
Physics-Informed Operator Learning for Hemodynamic Modeling. 1-8 - Zechao Sun, Shuying Piao, Haolin Jin, Chang Dong, Lin Yue, Weitong Chen, Luping Zhou:

AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation. 1-9 - Yu Ding, Lingqiao Liu, Peng Wang, Lei Wang:

Evaluating Forgetting in Pretrained Robotic Policy Networks: A Continual Learning Study with Octo. 1-8 - Stephen Tierney:

Single Image Blind Parameter Recovery. 1-8 - Yichen Wu, Xu Liu, Chenxuan Zhao, Xinyu Wu:

Prompt-Guided Dual Latent Steering for Inversion Problems. 1-8 - Muhammad Zafar Iqbal, Anwar Ul Haq, Srimannarayana Grandhi:

Implicit Visual Chain-of-Thought Reasoning for Anatomically Consistent Unsupervised 4D CT Deformable Image Registration. 1-8 - Anwaar Ulhaq, Johan Sebastian Ramirez Vallejo:

Low-Resource Vision-Language Learning for Brain Organoid Mitotic Classification. 1-8 - Haoran Pei, Yuguang Yang, Kexin Liu, Baochang Zhang:

Causally Guided Gaussian Perturbations for Out-of-Distribution Generalization in Medical Imaging. 1-7 - Md. Asikuzzaman, Tristrom Cooke, Syed Imranul Islam, Kazi Yasin Islam, Connor Luckett, Jerome Williams, Ben Yip, Tri-Tan Cao

, Sebastien Wong:
Hierarchical Cross-Modal Attention Network for Target Re-Identification. 1-8 - Akito Shinohara, Kohei Fukuda, Hiroaki Aizawa:

Logit Mixture Outlier Exposure for Fine-Grained Out-Of-Distribution Detection. 1-7 - Mario de Jesus da Graca, Jörg Dahlkemper, Peer Stelldinger:

Diffusion-Based Synthetic Brightfield Microscopy Images for Enhanced Single Cell Detection. 1-8 - Mohammad Mahbub Alam:

Predictive Imbalance: Bipartite Matching in DETR-Like Detectors. 1-9 - Chuong Nguyen

, Fahira Afzal Maken, Changming Sun, Yulia Arzhaeva, Sundaram Muthu, Michael Salim, Lars Andersson, Brett Downing, Jinguang Tong, Russell Tsuchida, Shayan Azizi, Lachlan Hetherton, Matt Bolger, Shaun Howard, Lars Petersson, Simon Dunstall:
ROSELLA: An Open Computer Vision and Measurement Toolkit for Advanced Manufacturing. 1-8 - Lynn Ricky Jude, J. Angel Arul Jothi, Elakkiya Rajasekar:

A Novel Adversarial Patch Attack on YOLOv8-based Brain Tumor Detection Model. 1-7 - Pengyu Wang, Shuchang Ye, Usman Naseem, Jinman Kim:

MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs. 1-7 - Yixin Wang, Xinyu Wang:

Rethinking Agentic and End-to-End Large Multimodal Models for Vision Tasks. 1-10 - Yuming Chen, Qi Wu, Yutong Xie:

MoE-Enhanced-TTT: Advancing Medical Image Segmentation. 1-8 - Derrick Effah, Ali Zia, Mohammad Awrangjeb, Yongsheng Gao, Sarpong Kwabena:

Weakly Supervised Pixel-Wise Classification of Hyperspectral Images with Noise-Adaptive Hybrid Attention and Triple Contrastive Learning. 1-8 - Mohammad Javad Shokri, Nandakishor Desai, Aravinda S. Rao, Yohanna Kusuma, Bernard Yan, Marimuthu Palaniswami:

Entropy-Guided Slice Selection for Weakly Supervised Binary ASPECTS Classification from Non-Contrast CT. 1-8 - Che-Wei Lee, Yi Chen, Chih-Yuan Hsu, HSIN-Tung Ma, Pei-Yung Hsiao, Li-Chen Fu:

HintOcc: Enhancing Bev-to-3D Reconstruction in Occupancy Prediction with Spatial-Awareness and Dynamic Class Balancing. 1-8 - Nanyu Dong, Townim F. Chowdhury, Hieu Phan, Mark Jenkinson, Johan Verjans, Zhibin Liao:

From Healthy Scans to Annotated Tumors: A Tumor Fabrication Framework for 3D Brain Mri Synthesis. 1-8 - Natthanich Hirunchavarod, Natnicha Sributsayakarn, Suchaya Pornprasertsuk-Damrongsri, Varangkanar Jirarattanasopha, Thanapong Intharah:

Through an AI's Looking Glass: Discovering Dental Sexual Dimorphism with Explainable AI. 1-8 - Salma P. González-Sabbagh, Antonio Robles-Kelly, Shang Gao:

DichroGAN: Towards Restoration of in-Air Colours of Seafloor from Satellite Imagery. 1-8 - Sirui Liu, Jinman Kim:

MedGemma-Critic: Fine-Tuning Medical Language Models for Domain-Specialised Text Evaluation. 1-5 - Ziqi Li, Abderraouf Amrani, Shri Rai, Hamid Laga:

Training-Free Non-Rigid Registration of Articulated Animal Bodies via Vision Features and Anatomical Priors. 1-8 - Sheikh Ridwan Raihan Kabir, G. M. Atiqur Rahaman, Sajib Saha:

RenalGraphNet: A Graph-Enhanced Hybrid CNNs for Kidney Structure Segmentation in CT Imaging. 1-8 - Qi Tan, Rong Wei, Zhiyu Xi, Jingqing Yang:

Consistent3D: Diffusion-Driven Sparse View Completion and 3D Reconstruction with Geometric Priors. 1-8

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














