Bhatnagar, S., & Lakshmanan, K. (2012). An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. Journal of optimization theory and applications, 153(3), 688-708. https://doi.org/10.1007/s10957-012-9989-5
Chicago-Zitierstil (17. Ausg.)Bhatnagar, Shalabh, und K. Lakshmanan. "An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes." Journal of Optimization Theory and Applications 153, no. 3 (2012): 688-708. https://doi.org/10.1007/s10957-012-9989-5.
MLA-Zitierstil (9. Ausg.)Bhatnagar, Shalabh, und K. Lakshmanan. "An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes." Journal of Optimization Theory and Applications, vol. 153, no. 3, 2012, pp. 688-708, https://doi.org/10.1007/s10957-012-9989-5.