


default search action
6th PRCV 2023: Xiamen, China - Part I
- Qingshan Liu

, Hanzi Wang
, Zhanyu Ma
, Weishi Zheng
, Hongbin Zha
, Xilin Chen
, Liang Wang, Rongrong Ji
:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part I. Lecture Notes in Computer Science 14425, Springer 2024, ISBN 978-981-99-8428-2
Action Recognition
- Chengguo Yuan, Yu Jin, Zongzhen Wu, Fanting Wei, Yangzirui Wang, Lan Chen, Xiao Wang:

Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion Based Classification. 3-15 - Yang Shu, Wanggen Li, Doudou Li, Kun Gao, Biao Jie:

Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition. 16-28 - Wentian Xin, Yi Liu, Ruyi Liu, Qiguang Miao, Cheng Shi, Chi-Man Pun:

Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action Recognition. 29-42 - Xiaowei Zhu, Qian Huang, Chang Li

, Jingwen Cui, Yingying Chen:
Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional Networks. 43-59 - Mingliang Xue

, Siwei Wang, Bing Fu, Zhengyang Zhao, Tao Liu, Lingfeng Lai:
Segmenting Key Clues to Induce Human-Object Interaction Detection. 60-71 - Teng Huang

, Weiqing Kong
, Jiaming Liang
, Ziyu Ding
, Hui Li
, Xi Zhang:
Lightweight Multispectral Skeleton and Multi-stream Graph Attention Networks for Enhanced Action Prediction with Multiple Modalities. 72-83 - Wanchuan Yu, Hanyu Guo, Yan Yan, Jie Li, Hanzi Wang:

Spatio-Temporal Self-supervision for Few-Shot Action Recognition. 84-96 - Jiulin Li, Mengyu Yang, Yang Liu, Gongli Xi, Lanshan Zhang, Ye Tian:

A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model. 97-108 - Jinzhao Luo, Lu Zhou, Guibo Zhu, Guojing Ge, Beiying Yang, Jinqiao Wang:

Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition. 109-119 - Ying Zhou, Yana Zhang, Aiqiu Wu

:
HFGCN-Based Action Recognition System for Figure Skating. 120-130
Multi-modal Information Processing
- Zhengyu Li, Yao Wu

, Yanyun Qu:
Image Priors Assisted Pre-training for Point Cloud Shape Analysis. 133-145 - Wei Yue:

AMM-GAN: Attribute-Matching Memory for Person Text-to-Image Generation. 146-158 - Liucun Lu

, Jinghui Qin
, Zequn Jie
, Lin Ma
, Liang Lin
, Xiaodan Liang
:
RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog. 159-171 - Jiancheng Huang, Yifan Liu, Jin Qin, Shifeng Chen:

KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing. 172-184 - Jiaer Xia, Haozhe Yang, Yan Zhang, Pingyang Dai

:
Enhancing Text-Image Person Retrieval Through Nuances Varied Sample. 185-196 - Yi Zhang

, Ce Zhang
, Xueting Hu, Zhihai He:
Unsupervised Prototype Adapter for Vision-Language Models. 197-209 - Wenjun Feng, Dazhen Lin, Donglin Cao:

Multimodal Causal Relations Enhanced CLIP for Image-to-Text Retrieval. 210-221 - Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Siqi Wang:

Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News Detection. 222-234 - Mengluan Li, Yanqing Guo, Haiyan Fu

, Yi Li, Hong Su:
Deep Consistency Preserving Network for Unsupervised Cross-Modal Hashing. 235-246 - Mintu Yang, Xianxu Hou

, Hao Li, Linlin Shen, Lixin Fan:
Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion Models. 247-258 - Zikun Song, Pinle Qin, Jianchao Zeng, Shuangjiao Zhai, Rui Chai, JunYi Yan:

EdgeFusion: Infrared and Visible Image Fusion Algorithm in Low Light. 259-270 - Yuanyuan Qiu, Zhenning Yu, Zhenguo Gao:

An Efficient Momentum Framework for Face-Voice Association Learning. 271-283 - Yuan Qing

, Naixing Wu, Shaohua Wan
, Lixin Duan:
Multi-modal Instance Refinement for Cross-Domain Action Recognition. 284-296 - Yang Xu, Junyi Wu

, Yan Yan, Xinsheng Du, Huiji Zhang, Jianqiang Zhao, Zhipeng Gao:
Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition. 297-308 - Jie Wang, Yixiao Zheng, Ruoyi Du, Yiming Zhang, Kongming Liang, Zhanyu Ma:

Plugging Stylized Controls in Open-Stylized Image Captioning. 309-320 - Taoying Zhang, Hesong Li, Qiankun Liu, Xiaoyong Wang, Ying Fu:

MGT: Modality-Guided Transformer for Infrared and Visible Image Fusion. 321-332 - Chenyu Zhou

, Xiuhong Li
, Zhe Li
, Fan Chen, Xiaofan Wang, Dan Yang, Bin Chen
, Songlin Li
:
Multimodal Rumor Detection by Using Additive Angular Margin with Class-Aware Attention for Hard Samples. 333-344 - Lingfeng Hu, Si Liu, Hanzi Wang:

An Effective Dynamic Reweighting Method for Unbiased Scene Graph Generation. 345-356 - Zejun Wang

, Xinglong Wu
, Hongwei Yang
, Hui He
, Yu Tai
, Weizhe Zhang
:
Multi-modal Graph and Sequence Fusion Learning for Recommendation. 357-369 - Guoyong Cai

, Shunjie Wang, Guangrui Lv:
Co-attention Guided Local-Global Feature Fusion for Aspect-Level Multimodal Sentiment Analysis. 370-382 - Qing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Mingying Xv, Hao Wang:

Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering. 383-394 - Chengjie Sun, Weiwei Chen, Lei Lin, Lili Shan:

Enhancing Recommender System with Multi-modal Knowledge Graph. 395-407 - Guoqing Xu, Min Hu

, Xiaohua Wang
, Jiaoyun Yang
, Nan Li
, Qingyu Zhang:
Location Attention Knowledge Embedding Model for Image-Text Matching. 408-421 - Dan Liu, Wei Song, Xiaobing Zhao:

Pedestrian Attribute Recognition Based on Multimodal Transformer. 422-433 - Xinyi Wu

, Xia Yuan
, YanChao Cui
, Chunxia Zhao:
RGB-D Road Segmentation Based on Geometric Prior Information. 434-445 - Tingting Han

, Yuanxin Lv, Zhou Yu, Jun Yu
, Jianping Fan, Liu Yuan:
Contrastive Perturbation Network for Weakly Supervised Temporal Sentence Grounding. 446-460 - Feng Li, Enguang Zuo, Chen Chen, Cheng Chen, Mingrui Ma, Yunling Wang, Xiaoyi Lv, Min Li:

MLDF-Net: Metadata Based Multi-level Dynamic Fusion Network. 461-473 - Ran Yan

, Ruiying Du
, Kun He
, Jing Chen
:
Efficient Adversarial Training with Membership Inference Resistance. 474-486 - Hongyu Wang, Pengpeng Qiang

, Hongye Tan, Jingchang Hu:
Enhancing Image Comprehension for Computer Science Visual Question Answering. 487-498 - Wei Bao, Jingjing Hu, Meiyu Huang, Xueshuang Xiang:

Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection. 499-510

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














