Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning
-
armenjeddi/PCGRPO-Qwen2.5-VL-3B-Jigsaw-Base-plus-curriculum-plus-CARE
4B • Updated • 45 -
armenjeddi/PCGRPO-Qwen2.5-VL-3B-MixPuzzles-Base-plus-curriculum-plus-CARE
4B • Updated • 26 -
armenjeddi/PCGRPO-Qwen2.5-VL-7B-Jigsaw-Base
8B • Updated • 18 -
armenjeddi/PCGRPO-Qwen2.5-VL-7B-Jigsaw-Base-plus-CARE
8B • Updated • 48