Palabras claves: compact state representation, discrete-time Markov decision process, ENERGY SAVING, heterogeneous cellular networks, reinforcement learning, team Markov game, traffic load balancing, traffic offloading, wireless networks
Autores: Cai Y., Chen T., Chen X., Jinsong Wu, Zhang H.