mssfj/Qwen2.5-7B-Instruct_dbbench_grpo_dataset_react-2 Text Generation • 8B • Updated 2 days ago • 24
mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset-2 Text Generation • 8B • Updated 3 days ago • 25
mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset Text Generation • 8B • Updated 3 days ago • 75
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-15 Text Generation • 8B • Updated 5 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-14 Text Generation • 8B • Updated 5 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-13 Text Generation • 8B • Updated 5 days ago