Synthesis of Fault-Tolerant Reliable Broadcast Algorithms With Reinforcement Learning