Mallick, P., Chen, Z., & Zamani, M. (2022). Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics. Neurocomputing (Amsterdam), 484, 79-88. https://doi.org/10.1016/j.neucom.2021.01.142
Chicago-Zitierstil (17. Ausg.)Mallick, Prakash, Zhiyiong Chen, und Mohsen Zamani. "Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics." Neurocomputing (Amsterdam) 484 (2022): 79-88. https://doi.org/10.1016/j.neucom.2021.01.142.
MLA-Zitierstil (9. Ausg.)Mallick, Prakash, et al. "Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics." Neurocomputing (Amsterdam), vol. 484, 2022, pp. 79-88, https://doi.org/10.1016/j.neucom.2021.01.142.