default search action

combined dblp search
author search
venue search
publication search

ask others

Mu Cai

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ShaoTZFCSYQSW26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ShaoTZFCSYQSW26
Kele Shao, Keda Tao, Kejia Zhang, Sicheng Feng, Mu Cai, Yuzhang Shang, Haoxuan You, Can Qin, Yang Sui, Huan Wang:
A Survey of Token Compression for Efficient Multimodal Large Language Models. Trans. Mach. Learn. Res. 2026 (2026)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2603-06561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2603-06561
Fangrui Zhu, Yunfeng Xi, Jianmo Ni, Mu Cai, Boqing Gong, Long Zhao, Chen Qu, Ian Miao, Yi Li, Cheng Zhong, Huaizu Jiang, Shwetak Patel:
EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking. CoRR abs/2603.06561 (2026)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2603-25744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2603-25744
Bocheng Zou, Mu Cai, Mark Stanley, Dingfu Lu, Yong Jae Lee:
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models. CoRR abs/2603.25744 (2026)
2025
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YangTWZPLGCYJD025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YangTWZPLGCYJD025
Jianwei Yang, Reuben Tan, Qianhui Wu, Ruijie Zheng, Baolin Peng, Yongyuan Liang, Yu Gu, Mu Cai, Seonghyeon Ye, Joel Jang, Yuquan Deng, Jianfeng Gao:
Magma: A Foundation Model for Multimodal AI Agents. CVPR 2025: 14203-14214
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0109M0KJSRBCLR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0109M0KJSRBCLR25
Xiang Li, Cristina Mata, Jongwoo Park, Kumara Kahatapitiya, Yoo Sung Jang, Jinghuan Shang, Kanchana Ranasinghe, Ryan D. Burgert, Mu Cai, Yong Jae Lee, Michael S. Ryoo:
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy. ICLR 2025
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/CaiY0L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CaiY0L25
Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee:
Matryoshka Multimodal Models. ICLR 2025
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/CaiHLOWL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/CaiHLOWL25
Mu Cai, Zeyi Huang, Yuheng Li, Utkarsh Ojha, Haohan Wang, Yong Jae Lee:
An Investigation on LLMs' Visual Understanding Ability Using SVG for Image-Text Bridging. WACV 2025: 5377-5386
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13130
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13130
Jianwei Yang, Reuben Tan, Qianhui Wu, Ruijie Zheng, Baolin Peng, Yongyuan Liang, Yu Gu, Mu Cai, Seonghyeon Ye, Joel Jang, Yuquan Deng, Lars Liden, Jianfeng Gao:
Magma: A Foundation Model for Multimodal AI Agents. CoRR abs/2502.13130 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20021
Hyunsik Chae, Seungwoo Yoon, Jaden Park, Chloe Yewon Chun, Yongin Cho, Mu Cai, Yong Jae Lee, Ernest K. Ryu:
Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models. CoRR abs/2505.20021 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-20198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-20198
Kele Shao, Keda Tao, Kejia Zhang, Sicheng Feng, Mu Cai, Yuzhang Shang, Haoxuan You, Can Qin, Yang Sui, Huan Wang:
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios. CoRR abs/2507.20198 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-03774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-03774
Jaden Park, Mu Cai, Feng Yao, Jingbo Shang, Soochahn Lee, Yong Jae Lee:
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation. CoRR abs/2511.03774 (2025)
2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhangCXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangCXL24
Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee:
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples. ACL (Findings) 2024: 15481-15495
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/cpal/ZhaiTLCQLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cpal/ZhaiTLCQLM24
Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma:
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning. CPAL 2024: 202-227
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/CaiLMMCPL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/CaiLMMCPL24
Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee:
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts. CVPR 2024: 12914-12923
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiLCLSLLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiLCLSLLS24
Yuheng Li, Haotian Liu, Mu Cai, Yijun Li, Eli Shechtman, Zhe Lin, Yong Jae Lee, Krishna Kumar Singh:
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment. ECCV (21) 2024: 405-422
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZouCZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZouCZL24
Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee:
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation. EMNLP 2024: 3647-3659
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/CaiLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/CaiLL024
Mu Cai, Chenxu Luo, Yong Jae Lee, Xiaodong Yang:
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds. IROS 2024: 9468-9475
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/NguyenLLCOL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NguyenLLCOL24
Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee:
Yo'LLaVA: Your Personalized Language and Vision Assistant. NeurIPS 2024
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-13254
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-13254
Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee:
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples. CoRR abs/2402.13254 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15388
Yuzhang Shang, Mu Cai, Bingxin Xu, Yong Jae Lee, Yan Yan:
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models. CoRR abs/2403.15388 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17430
Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee:
Matryoshka Multimodal Models. CoRR abs/2405.17430 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-09400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-09400
Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee:
Yo'LLaVA: Your Personalized Language and Vision Assistant. CoRR abs/2406.09400 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-20095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-20095
Xiang Li, Cristina Mata, Jongwoo Park, Kumara Kahatapitiya, Yoo Sung Jang, Jinghuan Shang, Kanchana Ranasinghe, Ryan D. Burgert, Mu Cai, Yong Jae Lee, Michael S. Ryoo:
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy. CoRR abs/2406.20095 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10972
Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee:
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation. CoRR abs/2407.10972 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06827
Mu Cai, Chenxu Luo, Yong Jae Lee, Xiaodong Yang:
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds. CoRR abs/2409.06827 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12963
Yuzhang Shang, Bingxin Xu, Weitai Kang, Mu Cai, Yuheng Li, Zehao Wen, Zhen Dong, Kurt Keutzer, Yong Jae Lee, Yan Yan:
Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner. CoRR abs/2409.12963 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-00905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-00905
Yuheng Li, Haotian Liu, Mu Cai, Yijun Li, Eli Shechtman, Zhe Lin, Yong Jae Lee, Krishna Kumar Singh:
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment. CoRR abs/2410.00905 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-02763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-02763
Jianrui Zhang, Mu Cai, Yong Jae Lee:
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos. CoRR abs/2410.02763 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-10818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-10818
Mu Cai, Reuben Tan, Jianrui Zhang, Bocheng Zou, Kai Zhang, Feng Yao, Fangrui Zhu, Jing Gu, Yiwu Zhong, Yuzhang Shang, Yao Dou, Jaden Park, Jianfeng Gao, Yong Jae Lee, Jianwei Yang:
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models. CoRR abs/2410.10818 (2024)
2023
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HuangZLCWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HuangZLCWL23
Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee:
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance. ICCV 2023: 11651-11661
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/CaiL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/CaiL23
Mu Cai, Yixuan Li:
Out-of-distribution Detection via Frequency-regularized Generative Models. WACV 2023: 5510-5519
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06094
Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee:
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding. CoRR abs/2306.06094 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10313
Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma:
Investigating the Catastrophic Forgetting in Multimodal Large Language Models. CoRR abs/2309.10313 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12530
Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee:
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance. CoRR abs/2309.12530 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00784
Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee:
Making Large Multimodal Models Understand Arbitrary Visual Prompts. CoRR abs/2312.00784 (2023)
2022
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiuCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiuCL22
Haotian Liu, Mu Cai, Yong Jae Lee:
Masked Discrimination for Self-supervised Learning on Point Clouds. ECCV (2) 2022: 657-675
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuWCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuWCL22
Xuefeng Du, Zhaoning Wang, Mu Cai, Yixuan Li:
VOS: Learning What You Don't Know by Virtual Outlier Synthesis. ICLR 2022
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01197
Xuefeng Du, Zhaoning Wang, Mu Cai, Yixuan Li:
VOS: Learning What You Don't Know by Virtual Outlier Synthesis. CoRR abs/2202.01197 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11183
Haotian Liu, Mu Cai, Yong Jae Lee:
Masked Discrimination for Self-Supervised Learning on Point Clouds. CoRR abs/2203.11183 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-09083
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-09083
Mu Cai, Yixuan Li:
Out-of-distribution Detection via Frequency-regularized Generative Models. CoRR abs/2208.09083 (2022)
2021
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/CaiZHGLH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/CaiZHGLH21
Mu Cai, Hong Zhang, Huijuan Huang, Qichuan Geng, Yixuan Li, Gao Huang:
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving. ICCV 2021: 13910-13920
2020
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SunCZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SunCZT20
Liting Sun, Mu Cai, Wei Zhan, Masayoshi Tomizuka:
A Game-Theoretic Strategy-Aware Interaction Algorithm with Validation on Real Traffic Data. IROS 2020: 11038-11044
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-13611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-13611
Mu Cai, Hong Zhang, Huijuan Huang, Qichuan Geng, Gao Huang:
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving. CoRR abs/2011.13611 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.