


default search action
Wenpin Tang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i22]Yilie Huang, Wenpin Tang, Xun Yu Zhou:
ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule. CoRR abs/2601.18681 (2026)
[i21]Zhengyi Guo, Wenpin Tang, Renyuan Xu:
Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach. CoRR abs/2602.05533 (2026)- 2025
[j10]Genta Indra Winata, Hanyang Zhao, Anirban Das, Wenpin Tang, David D. Yao, Shi-Xiong Zhang, Sambit Sahu:
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey. J. Artif. Intell. Res. 82: 2595-2661 (2025)
[j9]Wenpin Tang
, David D. Yao:
Polynomial Voting Rules. Math. Oper. Res. 50(1): 90-106 (2025)
[j8]Wenpin Tang
, Hung V. Tran
, Yuming Paul Zhang
:
Policy Iteration for the Deterministic Control Problems - A Viscosity Approach. SIAM J. Control. Optim. 63(1): 375-401 (2025)
[j7]Wenpin Tang
, Yuming Paul Zhang
:
The Convergence Rate of Vanishing Viscosity Approximations for Mean Field Games. SIAM J. Math. Anal. 57(3): 3217-3254 (2025)
[c6]Haoxian Chen, Hanyang Zhao, Henry Lam, David D. Yao, Wenpin Tang:
MallowsPO: Fine-Tune Your LLM with Preference Dispersions. ICLR 2025
[c5]Hanyang Zhao, Genta Indra Winata, Anirban Das, Shi-Xiong Zhang, David D. Yao, Wenpin Tang, Sambit Sahu:
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization. ICLR 2025
[c4]Hanyang Zhao, Haoxian Chen, Ji Zhang, David D. Yao, Wenpin Tang:
Score as Action: Fine Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning. ICML 2025
[i20]Hanyang Zhao, Haoxian Chen, Ji Zhang, David D. Yao, Wenpin Tang:
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning. CoRR abs/2502.01819 (2025)
[i19]Hanyang Zhao, Haoxian Chen, Yucheng Guo, Genta Indra Winata, Tingting Ou, Ziyu Huang, David D. Yao, Wenpin Tang:
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization. CoRR abs/2503.11720 (2025)
[i18]Zhengyi Guo, Jiatu Li, Wenpin Tang, David D. Yao:
Diffusion Generative Models Meet Compressed Sensing, with Applications to Imaging and Finance. CoRR abs/2509.03898 (2025)
[i17]Hanyang Zhao, Dawen Liang, Wenpin Tang, David D. Yao, Nathan Kallus:
DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning. CoRR abs/2510.02212 (2025)
[i16]Jiayuan Sheng, Hanyang Zhao, Haoxian Chen, David D. Yao, Wenpin Tang:
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF. CoRR abs/2510.10767 (2025)
[i15]Haoting Zhang, Haoxian Chen, Donglin Zhan, Hanyang Zhao, Henry Lam, Wenpin Tang, David D. Yao, Zeyu Zheng:
SOCRATES: Simulation Optimization with Correlated Replicas and Adaptive Trajectory Evaluations. CoRR abs/2511.00685 (2025)- 2024
[i14]Wenpin Tang, Hanyang Zhao:
Contractive Diffusion Probabilistic Models. CoRR abs/2401.13115 (2024)
[i13]Wenpin Tang, Hanyang Zhao:
Score-based Diffusion Models via Stochastic Differential Equations - a Technical Tutorial. CoRR abs/2402.07487 (2024)
[i12]Wenpin Tang:
Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond. CoRR abs/2403.06279 (2024)
[i11]Haoxian Chen, Hanyang Zhao, Henry Lam, David D. Yao, Wenpin Tang:
Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions. CoRR abs/2405.14953 (2024)
[i10]Hanyang Zhao, Haoxian Chen, Ji Zhang, David D. Yao, Wenpin Tang:
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning. CoRR abs/2409.08400 (2024)
[i9]Genta Indra Winata, Hanyang Zhao, Anirban Das, Wenpin Tang, David D. Yao, Shi-Xiong Zhang, Sambit Sahu:
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey. CoRR abs/2409.11564 (2024)
[i8]Hanyang Zhao, Genta Indra Winata, Anirban Das, Shi-Xiong Zhang, David D. Yao, Wenpin Tang, Sambit Sahu:
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization. CoRR abs/2410.04203 (2024)
[i7]Wenpin Tang, Xun Yu Zhou:
Regret of exploratory policy improvement and q-learning. CoRR abs/2411.01302 (2024)- 2023
[j6]Didong Li, Wenpin Tang, Sudipto Banerjee:
Inference for Gaussian Processes with Matern Covariogram on Compact Riemannian Manifolds. J. Mach. Learn. Res. 24: 101:1-101:26 (2023)
[c3]Hanyang Zhao, Wenpin Tang, David D. Yao:
Policy Optimization for Continuous Reinforcement Learning. NeurIPS 2023
[i6]Wenpin Tang, Hung Vinh Tran, Yuming Paul Zhang:
Policy iteration for the deterministic control problems - a viscosity approach. CoRR abs/2301.00419 (2023)
[i5]Hanyang Zhao, Wenpin Tang, David D. Yao:
Policy Optimization for Continuous Reinforcement Learning. CoRR abs/2305.18901 (2023)
[i4]Wenpin Tang, David D. Yao:
Transaction fee mechanism for Proof-of-Stake protocol. CoRR abs/2308.13881 (2023)- 2022
[j5]Wenpin Tang
, Xiao Xu
, Xun Yu Zhou
:
Asset selection via correlation blockmodel clustering. Expert Syst. Appl. 195: 116558 (2022)
[j4]Xin Guo
, Wenpin Tang
, Renyuan Xu
:
A Class of Stochastic Games and Moving Free Boundary Problems. SIAM J. Control. Optim. 60(2): 758-785 (2022)
[j3]Wenpin Tang
, Yuming Paul Zhang, Xun Yu Zhou:
Exploratory HJB Equations and Their Convergence. SIAM J. Control. Optim. 60(6): 3191-3216 (2022)- 2021
[j2]Xiao Fang, Han L. Gan, Susan P. Holmes
, Haiyan Huang, Erol Peköz, Adrian Röllin
, Wenpin Tang:
Arcsine laws for random walks generated from random permutations with applications to genomics. J. Appl. Probab. 58(4): 851-867 (2021)- 2020
[c2]Wenpin Tang, Xin Guo, Fengmin Tang:
The Buckley-Osthus model and the block preferential attachment model: statistical analysis and application. ICML 2020: 9377-9386
[i3]Xin Guo, Jiequn Han, Wenpin Tang:
Perturbed gradient descent with occupation time. CoRR abs/2005.04507 (2020)
[i2]Wenpin Tang:
Learning an arbitrary mixture of two multinomial logits. CoRR abs/2007.00204 (2020)
2010 – 2019
- 2019
[j1]Wenpin Tang
:
Exponential ergodicity and convergence for generalized reflected Brownian motion. Queueing Syst. Theory Appl. 92(1-2): 83-101 (2019)
[c1]Wenpin Tang:
Mallows ranking models: maximum likelihood estimate and regeneration. ICML 2019: 6125-6134
[i1]Xin Guo, Fengmin Tang, Wenpin Tang:
Consistency of the Buckley-Osthus model and the hierarchical preferential attachment model. CoRR abs/1910.07698 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-20 23:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







