"NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation."

Jiazhao Zhang et al. (2024)

> Home

Details and statistics

DOI: 10.15607/RSS.2024.XX.079

access: closed

type: Conference or Workshop Paper

metadata version: 2025-01-28

- view
  - electronic edition via DOI
  - unpaywalled version
  authority control:
- export record
  dblp key:
  - conf/rss/ZhangWXZHF0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rss/ZhangWXZHF0ZW24
Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang:
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation. Robotics: Science and Systems 2024

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.