Physics-augmented safe reinforcement learning for overload mitigation
In this section, each algorithm was trained for 4000 episodes to learn overload prevention control strategies for the power distribution network, with each episode comprising 96 control interactions