Feature/v0.2.1/odLength_reward #207

P3ngLiu · 2025-04-01T02:17:54Z

Update the grpo_jsonl.py file to add the functionality for calculating mAP rewards, supporting length penalties and the selection of different scoring types.
Fix the handling logic for ref_per_token_logps in grpo_trainer.py to ensure beta=0 works for KL setting.

…_reward Feature/v0.2.1/odLength_reward

add od_ap, od_ap50 and odLength reward

b8a6462

SZhanZ merged commit 8a0af96 into om-ai-lab:develop/v0.2.1 Apr 1, 2025

IANNXANG pushed a commit to IANNXANG/VLM-R1 that referenced this pull request May 20, 2025

Merge pull request om-ai-lab#207 from P3ngLiu/feature/v0.2.1/odLength…

1b890cc

…_reward Feature/v0.2.1/odLength_reward

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/v0.2.1/odLength_reward #207

Feature/v0.2.1/odLength_reward #207

Uh oh!

P3ngLiu commented Apr 1, 2025

Uh oh!

Uh oh!

Feature/v0.2.1/odLength_reward #207

Feature/v0.2.1/odLength_reward #207

Uh oh!

Conversation

P3ngLiu commented Apr 1, 2025

Uh oh!

Uh oh!