Policy teaching via environment poisoning: Training-time adversarial attacks against reinforcement learning A Rakhsha, G Radanovic, R Devidze, X Zhu, A Singla International Conference on Machine Learning, 7974-7984, 2020 | 143 | 2020 |
Reward poisoning in reinforcement learning: Attacks against unknown learners in unknown environments A Rakhsha, X Zhang, X Zhu, A Singla arXiv preprint arXiv:2102.08492, 2021 | 42 | 2021 |
Policy teaching in reinforcement learning via environment poisoning attacks A Rakhsha, G Radanovic, R Devidze, X Zhu, A Singla Journal of Machine Learning Research 22 (210), 1-45, 2021 | 28 | 2021 |
Operator splitting value iteration A Rakhsha, A Wang, M Ghavamzadeh, A Farahmand Advances in Neural Information Processing Systems 35, 38373-38385, 2022 | 6 | 2022 |
Deflated dynamics value iteration J Lee, A Rakhsha, EK Ryu, A Farahmand arXiv preprint arXiv:2407.10454, 2024 | 1 | 2024 |
Maximum Entropy Model Correction in Reinforcement Learning A Rakhsha, M Kemertas, M Ghavamzadeh, A Farahmand arXiv preprint arXiv:2311.17855, 2023 | 1 | 2023 |
PID Accelerated Temporal Difference Algorithms M Bedaywi, A Rakhsha, A Farahmand arXiv preprint arXiv:2407.08803, 2024 | | 2024 |