Ray2333/Qwen3-VL-3B-sft-reasoning_and_grounding_changecoord_mixnoreasoning_cpt636 4B • Updated 3 days ago • 26
Ray2333/Qwen3-VL-7B-sft-reasoning_and_grounding_changecoord_mixnoreasoning_cpt636 8B • Updated 3 days ago • 24
Ray2333/Qwen3-VL-4B-weighted_sft_ratio2-reasoning_and_grounding_changecoord_mixnoreasoning_cpt637 4B • Updated 3 days ago • 36
Ray2333/Qwen3-VL-8B-weighted_sft_ratio2-reasoning_and_grounding_changecoord_mixnoreasoning_cpt637 9B • Updated 3 days ago • 24
Ray2333/Qwen3-VL-4B-sft-reasoning_and_grounding_changecoord_mixnoreasoning_cpt637 4B • Updated 3 days ago • 18
Ray2333/Qwen3-VL-8B-sft-reasoning_and_grounding_changecoord_mixnoreasoning_cpt637 9B • Updated 3 days ago • 26
Ray2333/Qwen2.5-VL-3B-weighted_sft_ratio2-reasoning_and_grounding_changecoord_mixnoreasoning_cpt636 4B • Updated 3 days ago • 17
Ray2333/Qwen2.5-VL-7B-weighted_sft_ratio2-reasoning_and_grounding_changecoord_mixnoreasoning_cpt636 8B • Updated 3 days ago • 26
Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback Text Classification • 7B • Updated Feb 5, 2025 • 886 • 11
Ray2333/gpt2-large-helpful-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 26.6k • 13
Ray2333/gpt2-large-harmless-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 25.6k • 4