On the Convergence Rate of MCTS for the Optimal Value Estimation in Markov Decision Processes
A recent theoretical analysis of a Monte-Carlo tree search (MCTS) method properly modified from the "upper confidence bound applied to trees" (UCT) algorithm established a surprising result, due to a great deal of empirical successes reported from heuristic usage of UCT with relevant adjus...
Saved in:
| Published in: | IEEE transactions on automatic control Vol. 70; no. 7; pp. 4788 - 4793 |
|---|---|
| Main Author: | |
| Format: | Journal Article |
| Language: | English |
| Published: |
New York
IEEE
01.07.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Subjects: | |
| ISSN: | 0018-9286, 1558-2523 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!