A collection of paper/projects that trains flow matching model/policies via RL. We will focus on the application to CV/Robotics. The list will be updated on a regular basis.
Please give it a star ⭐ if you like this project!
Contributors: Tonghe Zhang, Kang Chen, Zeyue Xue
Method | Paper | Code | Website | Domain | Online/Offline | On-policy/Off-policy | Pre-train/Fine-tune |
---|---|---|---|---|---|---|---|
FQL | arXiv | GitHub | Link | Robotics | Off2On | Off-policy | Pre-train + Fine-tune |
ReinFlow | arXiv | GitHub | Link | Robotics | Online | On-policy | Fine-tune |
FPO | arXiv | GitHub | Link | Robotics | Online | On-policy | Pre-train |
DSRL | arXiv | GitHub | Link | Robotics | Online | Off-policy | Fine-tune |
Flow-GRPO | arXiv | GitHub | Link | CV | Online | On-policy | Fine-tune |
DanceGRPO | arXiv | GitHub | Link | CV | Online | On-policy | Fine-tune |
Mix-GRPO | arXiv | Github | Link | CV | Online | On-policy | Fine-tune |
TempFlow-GRPO | arXiv | Github | Link | CV | Online | On-policy | Fine-tune |
DSRL-pi0 | arXiv | GitHub | N/A | Robotics | On/Off/Off2On | Off-policy | Fine-tune |
FPMD | arXiv | N/A | N/A | Robotics | Online | Off-policy | Pre-train |
RLFM | arXiv | GitHub | N/A | Robotics | Online | On-policy | Fine-tune |