Bhatnagar, S., & Lakshmanan, K. (2012). An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes. Journal of optimization theory and applications, 153(3), 688-708. https://doi.org/10.1007/s10957-012-9989-5
Citace podle Chicago (17th ed.)Bhatnagar, Shalabh, a K. Lakshmanan. "An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes." Journal of Optimization Theory and Applications 153, no. 3 (2012): 688-708. https://doi.org/10.1007/s10957-012-9989-5.
Citace podle MLA (9th ed.)Bhatnagar, Shalabh, a K. Lakshmanan. "An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes." Journal of Optimization Theory and Applications, vol. 153, no. 3, 2012, pp. 688-708, https://doi.org/10.1007/s10957-012-9989-5.