Follow
Zhaoran Wang
Title
Cited by
Cited by
Year
Provably Efficient Reinforcement Learning with Linear Function Approximation
C Jin, Z Yang, Z Wang, MI Jordan
Annual Conference on Learning Theory, 2020
3452020
A Theoretical Analysis of Deep Q-Learning
J Fan, Z Wang, Y Xie, Z Yang
Learning for Dynamics and Control, 2020
3232020
Optimal Computational and Statistical Rates of Convergence for Sparse Nonconvex Learning Problems
Z Wang, H Liu, T Zhang
Annals of Statistics, 2014
1792014
A Nonconvex Optimization Framework for Low Rank Matrix Estimation
T Zhao, Z Wang, H Liu
Advances in Neural Information Processing Systems, 2015
175*2015
A Strictly Contractive Peaceman--Rachford Splitting Method for Convex Programming
B He, H Liu, Z Wang, X Yuan
SIAM Journal on Optimization, 2014
1612014
Provably Efficient Exploration in Policy Optimization
Q Cai, Z Yang, C Jin, Z Wang
International Conference on Machine Learning, 2020
1552020
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization
HT Wai, Z Yang, Z Wang, M Hong
Advances in Neural Information Processing Systems, 2018
1422018
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
L Wang, Q Cai, Z Yang, Z Wang
International Conference on Learning Representations, 2020
1372020
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
B Liu, Q Cai, Z Yang, Z Wang
Advances in Neural Information Processing Systems, 2019
130*2019
High-Dimensional Expectation-Maximization Algorithm: Statistical Optimization and Asymptotic Normality
Z Wang, Q Gu, Y Ning, H Liu
Advances in Neural Information Processing Systems, 2015
1022015
Is Pessimism Provably Efficient for Offline RL?
Y Jin, Z Yang, Z Wang
International Conference on Machine Learning, 2021
962021
Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization
X Li, J Lu, R Arora, J Haupt, H Liu, Z Wang, T Zhao
IEEE Transactions on Information Theory, 2019
90*2019
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
M Hong, HT Wai, Z Wang, Z Yang
SIAM Journal on Optimization, 2022
832022
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Q Cai, Z Yang, JD Lee, Z Wang
Advances in Neural Information Processing Systems, 2019
832019
Nonconvex Statistical Optimization: Minimax-Optimal Sparse PCA in Polynomial Time
Z Wang, H Lu, H Liu
Advances in Neural Information Processing Systems, 2014
82*2014
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Z Yang, Y Chen, M Hong, Z Wang
Advances in Neural Information Processing Systems, 2019
752019
Low-Rank and Sparse Structure Pursuit via Alternating Minimization
Q Gu, Z Wang, H Liu
International Conference on Artificial Intelligence and Statistics, 2016
732016
Sparse Nonlinear Regression: Parameter Estimation and Asymptotic Inference under Nonconvexity
Z Yang, Z Wang, H Liu, YC Eldar, T Zhang
International Conference on Machine Learning, 2016
71*2016
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
D Ding, X Wei, Z Yang, Z Wang, MR Jovanović
International Conference on Artificial Intelligence and Statistics, 2021
692021
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Q Xie, Y Chen, Z Wang, Z Yang
Annual Conference on Learning Theory, 2020
622020
The system can't perform the operation now. Try again later.
Articles 1–20