


default search action
8th PRCV 2025: Shanghai, China - Part VI
- Josef Kittler, Hongkai Xiong

, Jian Yang
, Xilin Chen
, Jiwen Lu
, Weiyao Lin
, Jingyi Yu
, Weishi Zheng
:
Pattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Shanghai, China, October 15-18, 2025, Proceedings, Part VI. Lecture Notes in Computer Science 16277, Springer 2026, ISBN 978-981-95-5678-6
Multi-modal Information Processing
- Jianfei Liu, Yi Li, Fuxin Yu, Haiyan Fu, Yanqing Guo:

Reawakening Intra-modality Discrimination for Image-Text Matching. 3-16 - YiYang Tang

, Ning Luo
, Qian Chen
, NanJie Zheng:
Overcoming Feature Missing: Joint Reconstruction and Prior Semantics Transmission for Robust Multimodal Sentiment Analysis. 17-31 - Feng Zhang, Zhenming Chen, Hao Feng, Biao Guo, Yao Lu, Ming Jiang:

TMFit: Enhancing Fashion Image Editing Precision via Text-Driven Mask Generation. 32-46 - Dongjin Huang, Jiyu Qian, Yufei Liu, Wenyun Tu, Yichuan Liu:

DreamDancer: Music-Driven Dance Video Intelligent Generation. 47-61 - Jiawei Shi

, Dawei Liu, Huiyang Shi:
VSE-MQA: A Semantic Grouping and Multi-modal Approach for Accurate Video Quality Assessment. 62-75 - Shun Qian, Bingquan Liu, Chengjie Sun, Zhen Xu, Baoxun Wang:

Enhancing Compositional Reasoning in Multimodal Large Language Models. 76-90 - Siyi Liu, Weiming Chen

, Yushun Tang
, Zhihai He
:
LatentEdit: Adaptive Latent Control for Consistent Semantic Editing. 91-105 - Jianjing Wei, Wuman Luo

, Bidong Chen:
ClinCoCoOp: An Interpretable Prompt Learning Framework with Clinical Concept Guidance for Context Optimization. 106-119 - Lixuan Wei, Yichen Liu, Kejing Xia, Lei Yu:

All-in-Focus Seeing Through Occlusions with Event and Frame. 120-134 - Shun Qian, Bingquan Liu, Chengjie Sun, Zhen Xu, Baoxun Wang:

Capturing Cross-Modal Semantics by Generating Comments for Image-Text Contents. 135-148 - Heng-yang Lu, Yiyang Sung:

False Negatives Do Matter: A Novel Soft Label and Reranking Based Plug-in Method for Image-Text Retrieval. 149-163 - Xiaodi Yu, Yaoming Cai, Zijia Zhang, Yao Ding, Xiaobo Liu, Fei Li:

Uncertainty-Aware Deep Anchor Graph Learning for Multimodal Remote Sensing Image Clustering. 164-178 - Xiaoxiao Yan, Zuheng Wang, Yun Shuai, Jun Hu, Zhihao Deng, Quanyu Wang

, Guanyu Chen
:
CFANet: Cross-Frequency Adaptive Fusion with Frequency-Modulated Attention for Multi-focus Image Fusion. 179-193 - Aimei Dong, Yezou Zhou, Yongxing Cai:

Adaptive Multimodal Fusion for Graph Learning in Brain Disease Prediction. 194-207 - Xilai Xu, Zilin Zhao, Chengye Song, Zining Wang, Jinhe Qiang, Jiongrui Yan, Yuhuai Lin:

SentiMM: A Multimodal Multi-agent Framework for Sentiment Analysis in Social Media. 208-221 - Yilin Fan, Wenzhong Yang, Yabo Yin, Fuyuan Wei, Xiaoming Tao, Liejun Wang:

Bias-Unlearning in MABSA: A Causal Framework with Cross-Modal Counterfactual Inference. 222-236 - Yulong Yang, Shaoguo Cui, Chuan Sun, Linfeng Gong, Wei Xia, Sifan Zhao:

Mapping Semantic, Unmasking Falsehoods: Topic-Driven Hierarchical Graph Network for Short Video Fake News Detection. 237-251
Vision Applications and Systems
- Qunyi Zhang, Jiaqi Liu, Guoyang Xie, Liewen Liao, Yongming Chen, Xiaoning Lei, Annan Shu, Guannan Jiang, Songan Zhang:

Revisiting Symmetric Teacher-Student Network Distillation for Anomaly Detection. 255-269 - Xinyi Li, Xinyu Yang

, Shuo Zhang, Jiazhe Sun:
User-Adjustable Image Cropping Based on Visual Semantic Awareness. 270-284 - Manyu Wang, Xiang Zhang, Xinjue Hu, Fan Wang, Xu Cheng, Zhangjie Fu:

Multi-source Domain Adaptation Image Steganalysis for Cover Source Mismatch. 285-299 - Meiheng Wang

, Di Wu
, Chengqun Song, Jun Cheng:
PhoneGuide-SLAM: Low-Cost Smartphone Navigation for the Visually Impaired Using Visual Semantic SLAM. 300-314 - Yu Han, Wenhao Li, Feng Yang, Shan You, Yi Chen, Chang Xu, Xiu Su:

FairMoE: Decoupled Expert Learning for Unbiased Customized Face Generation. 315-329 - Yingying Sun, Zhiguang Chen, Nong Xiao:

StrategyAdapter: One-Shot Learning for Unseen-Domain Procedural Sequence Generation. 330-342 - Xiaolei Wei, Yi Ouyang, Haibo Ye:

Divide, Weight, and Route: Difficulty-Aware Optimization with Dynamic Expert Fusion for Long-Tailed Recognition. 343-357 - Zhonghua Sun

, Yaowei Yang, Kaisaier Tuerxun
, Alimjan Aysa, Kurban Ubul:
C3F: A Coarse-to-Fine Feature Fusion Approach for Scene Script Identification. 358-372 - Haomiao Tang, Wenjie Li, Yixiang Qiu, Genping Wang, Shu-Tao Xia:

Secure and Scalable Face Retrieval via Cancelable Product Quantization. 373-386 - Yan He, Xiang Xiang

, Xiaofei Liao:
Few-Shot Font Generation via Attribute-Guided Diffusion with Style Contrastive Learning. 387-402 - Jingyi Wang, Hanwei Gao, Zhidong Deng:

Rethinking the Evaluation of Scene Graph Generation. 403-417 - Fanxiang Zhou, Ziyue Wang

, Xina Cheng
, Takeshi Ikenaga
:
Geometry-Aware Contextual Reasoning-Based Indoor Accessibility Detection System for Visually Impaired Wheelchair Users. 418-431 - Minghong Sun, Lingye Zhao

, Luojun Lin:
Probabilistic Visual Prompt Tuning. 432-445 - Jiayi Deng, Shuai Tang, Ke Xu, Peisong He:

Identity-Agnostic Incremental Learning Framework for Face Forgery Detection. 446-459 - Haifeng Ni, Ming Xu:

ITVTON: Virtual Try-On Diffusion Transformer Based on Integrated Image and Text. 460-474 - Yuantao Jia, Feng Zhang, Bin Wang, Haonan Yan, Xing Wang, Zhangyu Gu, Shaopeng Zhou, Chaohao Li:

BTCD: Enabling Balanced Toxic Content Detection by Collaborating VLMs and CNNs. 475-488 - Shifa Tang, Jinlai Zhang, Shuimiao Yu, Xi Chen, Sheng Wu, Tiefang Zou, Qiqi Li, Lin Hu:

Hierarchical Multimodal Feature Learning and 3D Convolution for Rail Defect Detection. 489-503 - Ziyi Li, Qingyu Mao, Shuai Liu, Qilei Li, Fanyang Meng, Yongsheng Liang:

Boosting Neural Video Representation via Online Structural Reparameterization. 504-518 - Canzhi Chen, Weiqi Huang, Jiaxin Li, Zan Wang, Huijun Di, Wei Liang:

Emergency Evacuation Map Guided Navigation via Topological Alignment and VLM Reasoning. 519-533 - Wuti Xiong, Valentina Kuklina:

DF-CLIP: Adapting Visual-Language Models for Generalizable Deepfake Detection Using Multi-Modal Prompt Tuning. 534-548

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














