Paper Reading #4: POMO
Last updated on August 19, 2025 pm
本文将精读论文 “POMO: Policy Optimization with Multiple Optima for Reinforcement Learning”,作者 Kwon et al.,时间 2020 年,链接 arXiv:2010.16011。
Paper Reading #4: POMO
https://cny123222.github.io/2025/08/19/Paper-Reading-4-POMO/