Thinh T. Doan
Zitiert von
Zitiert von
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation for Multi-Agent Reinforcement Learning
TT Doan, ST Maguluri, J Romberg
International Conference on Machine Learning, 2019
Performance of Q-learning with Linear Function Approximation: Stability and Finite-Time Analysis
Z Chen, S Zhang, TT Doan, ST Maguluri, JP Clarke
arXiv preprint arXiv:1905.11425, 2019
Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization
TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:1810.13245, 2018
Distributed resource allocation on dynamic networks in quadratic time
TT Doan, A Olshevsky, 2015
Convergence of the iterates in mirror descent methods
TT Doan, S Bose, DH Nguyen, CL Beck
IEEE control systems letters 3 (1), 114-119, 2018
Convergence rates of distributed gradient methods under random quantization: A stochastic approximation approach
TT Doan, ST Maguluri, J Romberg
IEEE Transactions on Automatic Control 66 (10), 4469-4484, 2020
Distributed Lagrangian Methods for Network Resource Allocation
TT Doan, CL Beck
arXiv preprint arXiv:1609.06287, 2016
Finite-time performance of distributed temporal-difference learning with linear function approximation
TT Doan, ST Maguluri, J Romberg
SIAM Journal on Mathematics of Data Science 3 (1), 298-320, 2021
On the convergence rate of distributed gradient methods for finite-sum optimization under communication delays
TT Doan, CL Beck, R Srikant
arXiv preprint arXiv:1708.03277, 2017
On the Convergence Rate of Distributed Gradient Methods for Finite-Sum Optimization under Communication Delays
TT Doan, CL Beck, R Srikant
Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (2 …, 2017
On the geometric convergence rate of distributed economic dispatch/demand response in power networks
TT Doan, A Olshevsky
arXiv preprint arXiv:1609.06660, 2016
Finite-time analysis and restarting scheme for linear two-time-scale stochastic approximation
TT Doan
SIAM Journal on Control and Optimization 59 (4), 2798-2819, 2021
Distributed resource allocation over dynamic networks with uncertainty
TT Doan, CL Beck
IEEE Transactions on Automatic Control, 2020
Finite sample analysis of two-time-scale natural actor-critic algorithm
S Khodadadian, TT Doan, J Romberg, ST Maguluri
IEEE Transactions on Automatic Control, 2022
A decentralized policy gradient approach to multi-task reinforcement learning
S Zeng, MA Anwar, TT Doan, A Raychowdhury, J Romberg
Uncertainty in Artificial Intelligence, 1002-1012, 2021
Convergence rates of accelerated markov gradient descent with applications in reinforcement learning
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2002.02873, 2020
Nonlinear Two-Time-Scale Stochastic Approximation Convergence and Finite-Time Performance
TT Doan
IEEE Transactions on Automatic Control, 2022
Finite-time analysis of stochastic gradient descent under markov randomness
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2003.10973, 2020
Distributed two-time-scale methods over clustered networks
TV Pham, TT Doan, DH Nguyen
2021 American Control Conference (ACC), 4625-4630, 2021
Linear two-time-scale stochastic approximation a finite-time analysis
TT Doan, J Romberg
2019 57th Annual Allerton Conference on Communication, Control, and …, 2019
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20