Releases · om-ai-lab/VLM-R1 · GitHub

15 Apr 06:09

SZhanZ

v0.2.1 Latest

Latest

What's Changed

Develop/v0.2.0 by @SZhanZ in #177
Develop/v0.2.0 by @SZhanZ in #186
Develop/v0.2.0 by @SZhanZ in #187
Develop/v0.2.0 by @SZhanZ in #188
update README by @SZhanZ in #189
add findings info in README by @SZhanZ in #190
fix mcq reward by @Amos1109 in #144
Feat: Clip Higher by @SabaPivot in #199
Feature/v0.2.1/odLength_reward by @P3ngLiu in #207
sync to main by @SZhanZ in #208
Add more rewards to Develop/v0.2.1 by @zhangqianqianhzlh in #209
Develop/v0.2.1 by @SZhanZ in #216
fix processing_class bug by @SZhanZ in #223
release tech report by @SZhanZ in #228
update bibtex by @SZhanZ in #230
add zero2 json by @SZhanZ in #231

New Contributors

@SabaPivot made their first contribution in #199
@P3ngLiu made their first contribution in #207

Full Changelog: v0.2.0...v0.2.1

Contributors

P3ngLiu, SZhanZ, and 3 other contributors

Assets 2

24 Mar 11:50

SZhanZ

v0.2.0

What's Changed

add test_od_r1 by @KingSan666888 in #157
Develop/v0.2.0 by @SZhanZ in #165
add math model in README by @SZhanZ in #166
add features in README by @SZhanZ in #170
Sync the blog content. by @SZhanZ in #176

Full Changelog: v0.1.0...v0.2.0

Contributors

KingSan666888 and SZhanZ

Assets 2

17 Mar 08:16

SZhanZ

v0.1.0

What's Changed

Ruox/main jsonl dataloader by @xrc10 in #14
docs: update README.md by @eltociear in #27
fix model torch_dtype setting by @zhangqianqianhzlh in #49
custom reward by @zhangqianqianhzlh in #55
multi-node GPRO recipe by @xrc10 in #59
add epsilon clipping for GRPO by @xrc10 in #61
formats are not unique by @Amos1109 in #65
Add num_iterations from original GRPO algorithm by @xrc10 in #78
Update grpo_trainer.py by @davidluciolu in #58
add yes_no_reward function by @KingSan666888 in #88
fix default reward method by @zhangqianqianhzlh in #87
fix multi match case by @zhangqianqianhzlh in #94
convert other types of answers by @xrc10 in #95
fix data loader and batching by @xrc10 in #98
fix: pin transformers to v4.49.0 to resolve model loading issues by @chaoyuhao in #105
llm reward by @Amos1109 in #115
math_reward by @Amos1109 in #127
support language training data by @zhangqianqianhzlh in #130

New Contributors

@xrc10 made their first contribution in #14
@eltociear made their first contribution in #27
@zhangqianqianhzlh made their first contribution in #49
@Amos1109 made their first contribution in #65
@davidluciolu made their first contribution in #58
@chaoyuhao made their first contribution in #105

Full Changelog: https://github.com/om-ai-lab/VLM-R1/commits/v0.1.0

Contributors

xrc10, eltociear, and 5 other contributors

Assets 2