UPDF AI

Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm

Junling Hu,Michael P. Wellman

1998 · DBLP: conf/icml/HuW98
International Conference on Machine Learning · 944 citazioni

TLDR

A multiagent Q-learning method is designed under general-sum stochastic games, and it is proved that it converges to a Nash equilibrium under speci ed conditions.