CodeGoat24 's Collections UnifiedReward Training Data
updated
Unified Reward Model for Multimodal Understanding and Generation
Paper
• 2503.05236
• Published
• 123
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement
Fine-Tuning
Paper
• 2505.03318
• Published
• 92
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
• Updated
• 337k • 277
CodeGoat24/ImageGen-CoT-Reward-5K
Viewer
• Updated
• 5.54k • 97
• 1
CodeGoat24/LLaVA-Critic-113k
Preview
• Updated
• 187
Viewer
• Updated
• 21.4k • 79
CodeGoat24/ShareGPTVideo-DPO
Viewer
• Updated
• 101k • 66
Viewer
• Updated
• 29k • 182
Preview
• Updated
• 130
Viewer
• Updated
• 73.2k • 68
Viewer
• Updated
• 72.7k • 94
Viewer
• Updated
• 19k • 76