FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
This academic article is about the Factorized Multi-Agent MiniMax Q-Learning (FM3Q), a new framework for two-team zero-sum Markov games. The authors identify inefficiencies in existing methods and propose the individual-global-minimax…
Continue reading