Return to Article Details Multi-Agent Post-Co-Training of Large Language Models via Reinforcement Learning Download Download PDF