


default search action
Yuxiang Huang 0001
Person information
- affiliation: Tsinghua University, BNRist, Beijing, China
Other persons with the same name
- Yuxiang Huang — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
2020 – today
- 2026
[i13]Yuxiang Huang, Mingye Li, Xu Han
, Chaojun Xiao, Weilin Zhao, Ao Sun, Ziqi Yuan, Hao Zhou, Fandong Meng, Zhiyuan Liu:
Spava: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention. CoRR abs/2601.21444 (2026)
[i12]Yuxiang Huang, Nuno Gonçalves, Federico Alvetreti, Lei Li, Xu Han, Edoardo M. Ponti, André F. T. Martins, Marcos V. Treviso:
DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention. CoRR abs/2605.18753 (2026)- 2025
[j4]Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Xuanhe Zhou
, Yufei Huang, Chaojun Xiao, Chi Han, Yi R. Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang
, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Guoliang Li, Zhiyuan Liu, Maosong Sun:
Tool Learning with Foundation Models. ACM Comput. Surv. 57(4): 101:1-101:40 (2025)
[j3]Yuxiang Huang, Binhang Yuan, Xu Han, Chaojun Xiao, Zhiyuan Liu:
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices. Trans. Mach. Learn. Res. 2025 (2025)
[c4]Weilin Zhao, Tengyu Pan, Xu Han, Yudi Zhang, Sun Ao, Yuxiang Huang, Kaihuo Zhang, Weilun Zhao, Yuxuan Li, Jie Zhou, Hao Zhou, Jianyong Wang, Maosong Sun, Zhiyuan Liu:
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling. ACL (1) 2025: 3909-3921
[c3]Yuxiang Huang, Mingye Li, Xu Han, Chaojun Xiao, Weilin Zhao, Sun Ao, Hao Zhou, Jie Zhou, Zhiyuan Liu, Maosong Sun:
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs. ACL (1) 2025: 10708-10727
[c2]Ziqi Yuan
, Jun Li
, Yanghao Li
, Yuxiang Huang
, Chi Chen
, Shuo Wang
, Zhinan Gou
:
CITR: Efficient Long Video Understanding Needs Causal Importance. ACM Multimedia 2025: 4068-4076
[i11]Yuxiang Huang, Mingye Li, Xu Han
, Chaojun Xiao, Weilin Zhao, Sun Ao, Hao Zhou, Jie Zhou, Zhiyuan Liu, Maosong Sun:
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs. CoRR abs/2502.12085 (2025)
[i10]Weilin Zhao, Tengyu Pan, Xu Han
, Yudi Zhang, Ao Sun, Yuxiang Huang, Kaihuo Zhang, Weilun Zhao, Yuxuan Li, Jianyong Wang, Zhiyuan Liu, Maosong Sun:
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling. CoRR abs/2502.14856 (2025)
[i9]Chaojun Xiao, Yuxuan Li, Xu Han, Yuzhuo Bai, Jie Cai, Haotian Chen, Wentong Chen, Xin Cong, Ganqu Cui, Ning Ding, Shengda Fan, Yewei Fang, Zixuan Fu, Wenyu Guan, Yitong Guan, Junshao Guo, Yufeng Han, Bingxiang He, Yuxiang Huang, Cunliang Kong, Qiuzuo Li, Siyuan Li, Wenhao Li, Yanghao Li, Yishan Li, Zhen Li, Dan Liu, Biyuan Lin, Yankai Lin, Xiang Long, Quanyu Lu, Yaxi Lu, Peiyan Luo, Hongya Lyu, Litu Ou, Yinxu Pan, Zekai Qu, Qundong Shi, Zijun Song, Jiayuan Su, Zhou Su, Ao Sun, Xianghui Sun, Peijun Tang, Fangzheng Wang, Feng Wang, Shuo Wang, Yudong Wang, Yesai Wu, Zhenyu Xiao, Jie Xie, Zihao Xie, Yukun Yan, Jiarui Yuan, Kaihuo Zhang, Lei Zhang, Linyue Zhang, Xueren Zhang, Yudi Zhang, Hengyu Zhao, Weilin Zhao, Weilun Zhao, Yuanqian Zhao, Zhi Zheng, Ge Zhou, Jie Zhou, Wei Zhou, Zihan Zhou, Zixuan Zhou, Zhiyuan Liu, Guoyang Zeng, Chao Jia, Dahai Li, Maosong Sun:
MiniCPM4: Ultra-Efficient LLMs on End Devices. CoRR abs/2506.07900 (2025)
[i8]Tianyu Yu, Zefan Wang, Chongyi Wang, Fuwei Huang, Wenshuo Ma, Zhihui He, Tianchi Cai, Weize Chen, Yuxiang Huang, Yuanqian Zhao, Bokai Xu, Junbo Cui, Yingjing Xu, Liqing Ruan, Luoyuan Zhang, Hanyu Liu, Jingkun Tang, Hongyuan Liu, Qining Guo, Wenhao Hu, Bingxiang He, Jie Zhou, Jie Cai, Ji Qi, Zonghao Guo, Chi Chen, Guoyang Zeng, Yuxuan Li, Ganqu Cui, Ning Ding, Xu Han, Yuan Yao, Zhiyuan Liu, Maosong Sun:
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe. CoRR abs/2509.18154 (2025)
[i7]Weilin Zhao, Zihan Zhou, Zhou Su, Chaojun Xiao, Yuxuan Li, Yanghao Li, Yudi Zhang, Weilun Zhao, Zhen Li, Yuxiang Huang, Ao Sun, Xu Han
, Zhiyuan Liu:
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation. CoRR abs/2509.24663 (2025)
[i6]Yuxiang Huang, Chaojun Xiao, Xu Han
, Zhiyuan Liu:
NOSA: Native and Offloadable Sparse Attention. CoRR abs/2510.13602 (2025)- 2024
[j2]Tianrui Xia, Jinzhao Xiao, Yuxiang Huang
, Changyu Hu, Shaoxu Song
, Xiangdong Huang, Jian-min Wang:
Time series data encoding in Apache IoTDB: comparative analysis and recommendation. VLDB J. 33(3): 727-752 (2024)
[c1]Weilin Zhao, Yuxiang Huang, Xu Han
, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun:
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding. EMNLP 2024: 13378-13393
[i5]Weilin Zhao, Yuxiang Huang, Xu Han
, Chaojun Xiao, Zhiyuan Liu, Maosong Sun:
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting. CoRR abs/2402.13720 (2024)
[i4]Shengding Hu, Yuge Tu, Xu Han
, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zhen Leng Thai, Kai Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun:
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies. CoRR abs/2404.06395 (2024)
[i3]Yuxiang Huang, Binhang Yuan, Xu Han
, Chaojun Xiao, Zhiyuan Liu:
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads. CoRR abs/2410.01805 (2024)- 2023
[i2]Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han
, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun:
Tool Learning with Foundation Models. CoRR abs/2304.08354 (2023)
[i1]Weilin Zhao, Yuxiang Huang, Xu Han
, Zhiyuan Liu, Zhengyan Zhang, Maosong Sun:
CPET: Effective Parameter-Efficient Tuning for Compressed Large Language Models. CoRR abs/2307.07705 (2023)- 2022
[j1]Jinzhao Xiao, Yuxiang Huang
, Changyu Hu, Shaoxu Song
, Xiangdong Huang, Jianmin Wang
:
Time Series Data Encoding for Efficient Storage: A Comparative Analysis in Apache IoTDB. Proc. VLDB Endow. 15(10): 2148-2160 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-06-12 21:32 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







