Independent reinforcement learners in cooperative Markov