Manuel M. H. Roth, Anupama Hegde, Thomas Delamotte, Andreas Knopp: Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks. CoRR abs/2408.01979 (2024)