Why do you assume this has been tried? It's not even clear what the game is. In this setting, what state and actions would the algorithm have access to?
In some games it could find an equilibrium where it could keep the game going on indefinitely by moving back and forth, for example (which won't work in a game like Go[1], though).