


default search action
18th ECCV 2024: Milan, Italy - Part LXIV
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXIV. Lecture Notes in Computer Science 15122, Springer 2025, ISBN 978-3-031-73038-2 - Anita Rau, Josiah Aklilu, F. Christopher Holsinger, Serena Yeung-Levy:

Depth-Guided NeRF Training via Earth Mover's Distance. 1-17 - Ji Ha Jang, Hoigi Seo

, Se Young Chun
:
INTRA: Interaction Relationship-Aware Weakly Supervised Affordance Grounding. 18-34 - Sarah Jabbour

, Gregory Kondas, Ella Kazerooni
, Michael W. Sjoding
, David Fouhey
, Jenna Wiens
:
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks. 35-51 - Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny

, Ruohan Gao
, Dinesh Manocha:
MEERKAT: Audio-Visual Large Language Model for Grounding in Space and Time. 52-70 - Yake Wei, Siwei Li, Ruoxuan Feng, Di Hu:

Diagnosing and Re-learning for Balanced Multimodal Learning. 71-86 - Dongwon Park

, Hayeon Kim
, Se Young Chun
:
Contribution-Based Low-Rank Adaptation with Pre-training Model for Real Image Restoration. 87-105 - Lucas Stoffl

, Andy Bonnetto
, Stéphane d'Ascoli
, Alexander Mathis
:
Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders. 106-125 - Gwanghyun Kim

, Hayeon Kim
, Hoigi Seo
, Dong Un Kang
, Se Young Chun
:
BeyondScene: Higher-Resolution Human-Centric Scene Generation with Pretrained Diffusion. 126-142 - Chao Xu

, Ang Li, Linghao Chen, Yulin Liu, Ruoxi Shi, Hao Su, Minghua Liu:
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views. 143-163 - Vishal Nedungadi

, Ankit Kariryaa
, Stefan Oehmcke
, Serge J. Belongie
, Christian Igel
, Nico Lang
:
MMEarth: Exploring Multi-modal Pretext Tasks for Geospatial Representation Learning. 164-182 - Mia Chiquier, Utkarsh Mall, Carl Vondrick:

Evolving Interpretable Visual Classifiers with Large Language Models. 183-201 - De-An Huang, Shijia Liao, Subhashree Radhakrishnan, Hongxu Yin, Pavlo Molchanov, Zhiding Yu, Jan Kautz:

LITA: Language Instructed Temporal-Localization Assistant. 202-218 - Timothy Chase Jr.

, Karthik Dantu
:
MARs: Multi-view Attention Regularizations for Patch-Based Feature Recognition of Space Terrain. 219-239 - Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeffrey Nichols, Yinfei Yang, Zhe Gan:

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs. 240-255 - Zhengfeng Lai

, Joohi Chauhan
, Brittany N. Dugger
, Chen-Nee Chuah
:
Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data. 256-273 - Yangchao Wu

, Tian Yu Liu
, Hyoungseob Park
, Stefano Soatto
, Dong Lao
, Alex Wong
:
AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation. 274-293 - Wei-Yu Lee

, Martin D. Dimitrievski
, David Van Hamme
, Jan Aelterman
, Ljubomir Jovanov
, Wilfried Philips
:
CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection. 294-310 - Haijin Zeng, Yuxi Liu, Yongyong Chen, Youfa Liu, Chong Peng, Jingyong Su:

SAH-SCI: Self-supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging. 311-328 - Jeremy Klotz

, Shree K. Nayar:
Minimalist Vision with Freeform Pixels. 329-346 - Seongho Kim

, Byung Cheol Song
:
All You Need Is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation. 347-363 - Umar Khalid

, Hasan Iqbal
, Nazmul Karim
, Muhammad Tayyab
, Jing Hua
, Chen Chen
:
LatentEditor: Text Driven Local Editing of 3D Scenes. 364-380 - Kaustubh Sadekar

, David Maier
, Atul Ingle
:
Single-Photon 3D Imaging with Equi-Depth Photon Histograms. 381-398 - Sanket Kachole

, Hussain M. Sajwani
, Fariborz Baghaei Naeini
, Dimitrios Makris
, Yahya H. Zweiri
:
Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision. 399-415 - James Burgess

, Kuan-Chieh Wang
, Serena Yeung-Levy
:
Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models. 416-435 - Prachi Garg, K. J. Joseph, Vineeth N. Balasubramanian, Necati Cihan Camgöz, Chengde Wan, Kenrick Kin, Weiguang Si, Shugao Ma, Fernando De la Torre:

POET: Prompt Offset Tuning for Continual Human Action Adaptation. 436-455 - Shuangzhi Li

, Lei Ma
, Xingyu Li
:
Domain Generalization of 3D Object Detection by Density-Resampling. 456-473 - Chenglin Yang, Siyuan Qiao, Yuan Cao, Yu Zhang, Tao Zhu, Alan L. Yuille, Jiahui Yu:

IG Captioner: Information Gain Captioners Are Strong Zero-Shot Classifiers. 474-490

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














