Modeling Strong and Human-Like Gameplay with KL-Regularized Search.

AllVideos Images Shopping Maps News Books

Modeling Strong and Human-Like Gameplay with KL-Regularized ...

Dec 14, 2021 � We consider the task of building strong but human-like policies in multi-agent decision-making problems, given examples of human behavior.

Modeling Strong and Human-Like Gameplay with KL-Regularized ...

proceedings.mlr.press › ...

We show that regularized search algorithms that penalize KL divergence from an imitation-learned policy yield higher prediction accuracy of strong humans.

[PDF] Modeling Strong and Human-Like Gameplay with KL-Regularized ...

arxiv.org › pdf

In this paper, we study the problem of producing policies that are both strong and human-like in games with com- plex strategic planning like chess, Go, Hanabi,�...

Modeling Strong and Human-like Gameplay with KL-Regularized ...

openreview.net › forum

Apr 25, 2022 � We show in chess and Go that regularizing search based on the KL divergence from an imitation-learned policy results in higher human prediction accuracy and�...

Modeling Strong and Human-Like Gameplay with KL ... - NASA ADS

ui.adsabs.harvard.edu › abs › abstract

We show in chess and Go that regularizing search based on the KL divergence from an imitation-learned policy results in higher human prediction accuracy and�...

ICML 2022 Modeling Strong and Human-Like Gameplay with KL ...

icml.cc › virtual › spotlight

Modeling Strong and Human-Like Gameplay with KL-Regularized Search. Athul ... In chess and Go, we show that regularized search algorithms that penalize KL�...

[PDF] Modeling Strong and Human-Like Gameplay with KL-Regularized ...

icml.cc › media › icml-2022 › Slides › 16682_BRLGuFL

Modeling Strong and Human-Like Gameplay with KL-Regularized Search. International Conference on Machine Learning 2022. Poster details: HALL E #816. 6:30PM - 8:�...

Gabriele Farina - Modeling Strong and Human-Like Gameplay ... - MIT

www.mit.edu › ~gfarina › human_like_pikl_icml22

We show in chess and Go that regularizing search based on the KL divergence from an imitation-learned policy results in higher human prediction accuracy and�...

Modeling Strong and Human-Like Gameplay with KL-Regularized ...

www.researchgate.net › publication › 357046443_Modeling_Strong_and_...

Dec 14, 2021 � We consider the task of building strong but human-like policies in multi-agent decision-making problems, given examples of human behavior.

Noam Brown - Google Scholar

scholar.google.com › citations

Modeling strong and human-like gameplay with KL-regularized search. AP Jacob, DJ Wu, G Farina, A Lerer, H Hu, A Bakhtin, J Andreas, N Brown. International�...

People also search for

Modeling strong and human like gameplay with kl regularized search github

piKL algorithm