Castellini, A.; Bianchi, F.; Zorzi, E.; Simao, T. D.; Farinelli, A.; Spaan, M. T. J.,
Scalable Safe Policy Improvement via Monte Carlo Tree Search
in «PROCEEDINGS OF MACHINE LEARNING RESEARCH»
PMLR
in Proceedings of the 40 th International Conference on Machine Learning, Honolulu, Hawaii, USA
,
PMLR
,
Atti di "International Conference on Machine Learning"
, Hawaii, USA
, 23-29 July 2023
,
2023
,
pp. 3732-3756