"Look Before You Speak: Visually Contextualized Utterances."

Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid (2021)

Details and statistics

DOI: 10.1109/CVPR46437.2021.01660

access: open

type: Conference or Workshop Paper

metadata version: 2022-07-18