


default search action
18th ECCV 2024: Milan, Italy - Part XXII
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXII. Lecture Notes in Computer Science 15080, Springer 2025, ISBN 978-3-031-72669-9 - Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu:

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting. 1-19 - Runyi Hu

, Jie Zhang
, Ting Xu, Jiwei Li, Tianwei Zhang
:
Robust-Wide: Robust Watermarking Against Instruction-Driven Image Editing. 20-37 - Qiao Mo

, Yukang Ding
, Jinhua Hao
, Qiang Zhu
, Ming Sun
, Chao Zhou
, Feiyu Chen
, Shuyuan Zhu:
OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal. 38-56 - Ryosuke Yamada

, Kensho Hara
, Hirokatsu Kataoka
, Koshi Makihara
, Nakamasa Inoue
, Rio Yokota
, Yutaka Satoh
:
Formula-Supervised Visual-Geometric Pre-training. 57-74 - Yue Fan

, Xiaojian Ma
, Rujie Wu
, Yuntao Du
, Jiaqi Li
, Zhi Gao
, Qing Li
:
🤖 VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding. 75-92 - Guanghao Zheng

, Yuchen Liu, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong:
Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-spoofing. 93-110 - Shangquan Sun

, Wenqi Ren, Xinwei Gao, Rui Wang
, Xiaochun Cao
:
Restoring Images in Adverse Weather Conditions via Histogram Transformer. 111-129 - Tongkun Guan

, Chengyu Lin
, Wei Shen, Xiaokang Yang:
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer. 130-147 - Yubin Hu

, Xiaoyang Guo, Yang Xiao, Jingwei Huang, Yong-Jin Liu:
NGP-RT: Fusing Multi-level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis. 148-165 - Han Wang, Yongjie Ye, Yanjie Wang, Yuxiang Nie, Can Huang:

Elysium: Exploring Object-Level Perception in Videos via MLLM. 166-185 - Shuxiang Xie, Shuyi Zhou, Ken Sakurada, Ryoichi Ishikawa, Masaki Onishi, Takeshi Oishi:

G2fR: Frequency Regularization in Grid-Based Feature Encoding Neural Radiance Fields. 186-203 - Agneet Chatterjee

, Gabriela Ben Melech Stan
, Estelle Aflalo
, Sayak Paul
, Dhruba Ghosh
, Tejas Gokhale
, Ludwig Schmidt, Hannaneh Hajishirzi
, Vasudev Lal
, Chitta Baral
, Yezhou Yang
:
Getting it Right: Improving Spatial Consistency in Text-to-Image Models. 204-222 - Xueqi Ma

, Yilin Liu
, Wenjun Zhou
, Ruowei Wang
, Hui Huang
:
Generating 3D House Wireframes with Semantics. 223-240 - Xiao Fu

, Wei Yin
, Mu Hu
, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin
, Xiaoxiao Long
:
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image. 241-258 - Yiyao Ma, Kai Chen

, Hon-Sing Tong
, Ruofeng Wei
, Yui-Lun Ng, Ka-Wai Kwok, Qi Dou:
Shape-Guided Configuration-Aware Learning for Endoscopic-Image-Based Pose Estimation of Flexible Robotic Instruments. 259-276 - Jianan Wei, Tianfei Zhou, Yi Yang, Wenguan Wang:

Nonverbal Interaction Detection. 277-295 - Jian Zou

, Tianyu Huang
, Guanglei Yang
, Zhenhua Guo
, Tao Luo
, Chun-Mei Feng
, Wangmeng Zuo
:
UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving. 296-313 - Minheng Ni, Yeli Shen, Lei Zhang, Wangmeng Zuo:

Responsible Visual Editing. 314-330 - Weijia Wu

, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang:
DragAnything: Motion Control for Anything Using Entity Representation. 331-348 - Shuting He

, Henghui Ding
, Xudong Jiang
, Bihan Wen
:
🤖 SegPoint: Segment Any Point Cloud via Large Language Model. 349-367 - Sheng Fan, Rui Liu, Wenguan Wang, Yi Yang:

Navigation Instruction Generation with BEV Perception and Large Language Models. 368-387 - Taemin Park

, Hyuck Lee
, Heeyoung Kim
:
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-supervised Learning Under Class Distribution Mismatch. 388-404 - Qiuhong Shen, Xingyi Yang, Michael Bi Mi, Xinchao Wang

:
Vista3D: Unravel the 3D Darkside of a Single Image. 405-421 - Yi Yao

, Chan-Feng Hsu, Jhe-Hao Lin, Hongxia Xie
, Terence Lin, Yi-Ning Huang, Hong-Han Shuai
, Wen-Huang Cheng
:
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation. 422-438 - Junjie Huang

, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du:
Detecting as Labeling: Rethinking LiDAR-Camera Fusion in 3D Object Detection. 439-455 - Qiuhong Shen, Xingyi Yang, Xinchao Wang

:
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally. 456-472 - Guanting Dong, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong:

Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising. 473-489

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














