Releases: om-ai-lab/VLM-R1
Releases Β· om-ai-lab/VLM-R1
v0.2.1
What's Changed
- Develop/v0.2.0 by @SZhanZ in #177
- Develop/v0.2.0 by @SZhanZ in #186
- Develop/v0.2.0 by @SZhanZ in #187
- Develop/v0.2.0 by @SZhanZ in #188
- update README by @SZhanZ in #189
- add findings info in README by @SZhanZ in #190
- fix mcq reward by @Amos1109 in #144
- Feat: Clip Higher by @SabaPivot in #199
- Feature/v0.2.1/odLength_reward by @P3ngLiu in #207
- sync to main by @SZhanZ in #208
- Add more rewards to Develop/v0.2.1 by @zhangqianqianhzlh in #209
- Develop/v0.2.1 by @SZhanZ in #216
- fix processing_class bug by @SZhanZ in #223
- release tech report by @SZhanZ in #228
- update bibtex by @SZhanZ in #230
- add zero2 json by @SZhanZ in #231
New Contributors
- @SabaPivot made their first contribution in #199
- @P3ngLiu made their first contribution in #207
Full Changelog: v0.2.0...v0.2.1
v0.2.0
v0.1.0
What's Changed
- Ruox/main jsonl dataloader by @xrc10 in #14
- docs: update README.md by @eltociear in #27
- fix model torch_dtype setting by @zhangqianqianhzlh in #49
- custom reward by @zhangqianqianhzlh in #55
- multi-node GPRO recipe by @xrc10 in #59
- add epsilon clipping for GRPO by @xrc10 in #61
- formats are not unique by @Amos1109 in #65
- Add num_iterations from original GRPO algorithm by @xrc10 in #78
- Update grpo_trainer.py by @davidluciolu in #58
- add yes_no_reward function by @KingSan666888 in #88
- fix default reward method by @zhangqianqianhzlh in #87
- fix multi match case by @zhangqianqianhzlh in #94
- convert other types of answers by @xrc10 in #95
- fix data loader and batching by @xrc10 in #98
- fix: pin transformers to v4.49.0 to resolve model loading issues by @chaoyuhao in #105
- llm reward by @Amos1109 in #115
- math_reward by @Amos1109 in #127
- support language training data by @zhangqianqianhzlh in #130
New Contributors
- @xrc10 made their first contribution in #14
- @eltociear made their first contribution in #27
- @zhangqianqianhzlh made their first contribution in #49
- @Amos1109 made their first contribution in #65
- @davidluciolu made their first contribution in #58
- @chaoyuhao made their first contribution in #105
Full Changelog: https://github.com/om-ai-lab/VLM-R1/commits/v0.1.0