


default search action
8th PRCV 2025: Shanghai, China - Part V
- Josef Kittler, Hongkai Xiong

, Jian Yang
, Xilin Chen
, Jiwen Lu
, Weiyao Lin
, Jingyi Yu
, Weishi Zheng
:
Pattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Shanghai, China, October 15-18, 2025, Proceedings, Part V. Lecture Notes in Computer Science 16276, Springer 2026, ISBN 978-981-95-5566-6
Multi-modal Information Processing
- Zebao Zhang

, Wenlong Niu, Yue Yang
:
Dual-text Guided Cascading Visual Prompt for Vision-Language Models. 3-17 - Yulin Chen, Zeyuan Wang, Tianyuan Yu, Yingmei Wei, Liang Bai:

FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection. 18-32 - Siqi Yang, Na Lu

, Benqi Wang, Yikun Li, Runxi Cui:
SegCL: Segmented Reasoning with Global Visual-Audio Knowledge for Complex Long Video Understanding. 33-46 - Zebing Yao, Hao Fu

, Yuhao Liu, Guanghua Gu
:
Consistency Aware Representation Learning for Unsupervised Cross-Domain Image Retrieval. 47-60 - Pengfei Yue, Xiaokang Jiang, Yilin Lu, Jianghang Lin, Shengchuan Zhang, Liujuan Cao:

Referring Industrial Anomaly Segmentation. 61-75 - Ping Hu

, Junjie Cao, Jingyi Li, TongQing Zhu
, Jian Zhao
:
BF-HFD: Hidden Follower Detection Based on Behavioral Features. 76-90 - Zheling Meng, Bo Peng, Xiaochuan Jin, Yueming Lyu, Wei Wang, Jing Dong, Tieniu Tan:

Concept Corrector: Erase Concepts on the Fly for Text-to-Image Diffusion Models. 91-105 - Jiayi Wu, Siyu Zhang, Muchen Lan, Yaoru Sun:

CAHN: Category-Aware Hypergraph Network for Multimodal Aspect-Based Sentiment Analysis. 106-120 - Chuanjiang Zhang, Tianyang Xu, Zhangyong Tang, Xiaojun Wu:

Cross-Modal Supervised Contrastive Learning for RGB-T Semantic Segmentation. 121-135 - Huadong Chen, Xiaoyan Yu, Bang Li, Ming Yin, Taisong Jin:

HyperKGC: Hypergraph-Enhanced Multimodal Knowledge Graph Completion with Dynamic Fusion. 136-150 - Yijun Bei, Ke Wang, Bin Zhao:

STDC: Sparse Transformer Deep Collaboration Prompt Tuning for Industrial Multimodal Large Models. 151-164 - Yuqiu Kong, Wenjie Wu, Zijian Wang, Shenglan Liu:

PMCFNet: Prompt-Guided Multi-scale Cross-Modal Fusion Network for Referring Remote Sensing Image Segmentation. 165-181 - Hailin Wang, Sheng Huang

, Jiexuan Yan, Xin Zhang, Nankun Mu
:
Semantic Guided Dual-Branch Co-inference for Few-Shot 3D Point Cloud Classification. 182-196 - Chengsong Sun, Weiping Li, Xiang Yuan, Yuankun Liu:

GMM-Based Comprehensive Feature Extraction and Relative Distance Preservation for Few-Shot Cross-Modal Retrieval. 197-211 - Guanxi Liu, Zunwang Ke, Gang Wang, Hongyan Zhao, Yugui Zhang, Chunbao Lu:

AdCache-CLIP: Adaptive Dynamic Feature Caching and Cross-Modal Alignment for Zero-Shot Anomaly Detection. 212-225 - Yifan Xu

, Kaiwen Qian
, Yuchun Fang
:
DiscoIB: Disentangled Subject Customization via Information Bottleneck. 226-240 - Jinhong Li, Leheng Zhang, Hui Cui, Jingxian Wang, Rui Li:

Multimodal Sentiment Analysis via Spatio-Temporal Decoupling and Language-Focused Fusion. 241-255 - Songqian Zhang, Weijian Su, Lei Meng, Yuqi Han

, Jinli Suo, Qiang Zhang:
HeRIF: A Mixture-of-Experts Framework for Infrared and Visible Image Fusion with Heterogeneous Resolutions. 256-270 - Jingyi Zhang, Hefeng Wu, Liang Lin:

Improving Spatio-Temporal Awareness of Multimodal Large Language Models via Reinforcement Fine-Tuning. 271-283 - Tichao Wang, Ziliang Ren, Qieshi Zhang, Yimin Zhou, Jun Cheng:

Two-Stage Modal Feature Enhancement for Multispectral Object Detection. 284-297 - Siying Xu, Gang Wang:

PCFusion: A Unified Image Fusion Network with Perception-Driven Cross-Domain Learning. 298-312 - Jiawei Feng, Haiyu Song, Yun Mao, Jiayu Wang, Mingyu Ge, Zhengchi Du, Jialiang Chen, Zeyu Wang:

KMMF-Net: Implicit Fusion with KAN-Guided Mamba Modeling. 313-326 - Shenao Shao

, Liejun Wang
, Shaochen Jiang, Beibei Gao:
Dual Fusion with Auxiliary Loss Hashing for Cross-Modal Retrieval. 327-341 - Minghui Ding, Kaibing Zhang, Hui Zhang, Xuan Zhou, Junwei Wang:

Text2Printing: Controllable Textile Digital Printing Pattern Generation with Attention Modulation. 342-357 - Chenlin Meng

, Zhaoyong Mao
, Chi Zhang, Kai Jiang, Junge Shen
:
Boosting Weakly Supervised Video Anomaly Detection with Generative Description. 358-372 - Hannan Bai, Haoyuan Sun, Yuncheng Du:

Entropy-Aware Preference Alignment for Diffusion-Based Text-to-Image Generation. 373-387 - Lipeng Wang, Hongxing Fan, Zehuan Huang, Lu Sheng:

Absolute Story: Visual Storytelling with Consistent Subject and Style. 388-402 - Xing Tan, Meng Yang:

Visible-Infrared Person Re-identification via Counterfactual Intervention Learning. 403-417 - Hongbin Chen

, Rui Feng
, Jie Li
, Wei Wang
, Jianqing Li
, Wentao Xiang
:
MOFA: Modality-Orthogonalized Fusion Architecture for Multimodal Emotion Recognition. 418-433 - Chenxu Wang, Xiaojin Gong

:
Leveraging Vision Foundation Models for RGB-Thermal Semantic Segmentation. 434-448 - Haohuinan Zhang:

Strengthened Node and Edge Generation for Enhanced Information Interaction in Infrared-Visible Fusion. 449-463 - Jiaxing Yang, Lihe Zhang, Jiayu Sun, Huchuan Lu:

Bidirectional Spatial Semantics Correlation for Referring Image Segmentation. 464-478 - Hongyang Zhu, Haipeng Liu, Bo Fu, Yang Wang:

Masked Dual-Editing Diffusion Models for Multi-object Image Editing. 479-493 - Haoyu Cao

, Anqi Gou, Haobin Cao:
DiffCTE: Consistent Visual Text Editing with High Style Fidelity via Diffusion Model. 494-507 - Haoliang Feng, Haoran Li, Lan Tang:

Diffusion-Based Cross-Modal Denoising and Reliability-Aware Deep Matching for Robust Radar Odometry. 508-522 - Xue Wang

, Huijie Zhang
, Jialu Dong
, Yiming Lin, Xin Liu
:
Towards Better Image-Text Matching: Concept-Guided Alignment for Vision-Language Models. 523-537 - Yang Yu, Xin Sun, Qijun Hu, Bopeng Fang, Leping He, Qijie Cai, Yu Bai:

An EEG-Driven Multi-branch Framework for Joint Spatio-Temporal-Spectral Modeling in Fatigue Recognition. 538-552 - Jiale Sun, Yongxing Cai, Bin Liu, Yezou Zhou, Aimei Dong:

MANGL: Multimodal Feature Alignment and Masked Random Noise Perturbation for Graph Learning in Disease Prediction. 553-566

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














