YWZBrandon/verl_agent_swebench_sum_reward_v2_grpo_qwen3_4b_qmax_pv5_2048_sft_v2_grp16_max40_s35 Updated 5 days ago
YWZBrandon/verl_agent_swebench_sum_reward_v2_grpo_qwen3_4b_glm_pv5_2048_sft_v2_grp16_max40_s35 Updated 5 days ago
YWZBrandon/verl_agent_swebench_t100_sum_v1_grpo_qwen3_4b_temp1_pv5_2048_sft_v2_grp16_s150 Updated 5 days ago
YWZBrandon/google_flan-t5-base_temporal_10_clusters_6_full_upsample1000 0.2B • Updated May 15, 2025 • 3
YWZBrandon/wikidyk-scope-clf-deberta-v3-large-temporal_10_clusters Text Classification • 0.4B • Updated May 15, 2025 • 1
YWZBrandon/google_flan-t5-large_temporal_10_clusters_9_full_upsample1000 0.8B • Updated May 15, 2025 • 1
YWZBrandon/google_flan-t5-large_temporal_10_clusters_4_full_upsample1000 0.8B • Updated May 14, 2025 • 4
YWZBrandon/google_gemma-3-1b-pt_qa_ds100_upsample1000 Text Generation • 1.0B • Updated May 14, 2025 • 3
YWZBrandon/google_flan-t5-large_temporal_10_clusters_3_full_upsample1000 0.8B • Updated May 14, 2025 • 1
YWZBrandon/google_flan-t5-large_temporal_10_clusters_7_full_upsample1000 0.8B • Updated May 14, 2025 • 1
YWZBrandon/meta-llama_Llama-3.2-1B_qa_ds100_upsample1000 Text Generation • 1B • Updated May 14, 2025 •
YWZBrandon/google_flan-t5-large_temporal_10_clusters_5_full_upsample1000 0.8B • Updated May 14, 2025 • 1
YWZBrandon/google_flan-t5-large_temporal_10_clusters_2_full_upsample1000 0.8B • Updated May 14, 2025 • 1
YWZBrandon/google_flan-t5-large_temporal_3_clusters_1_full_upsample1000 0.8B • Updated May 14, 2025 • 2
YWZBrandon/google_flan-t5-large_temporal_10_clusters_1_full_upsample1000 0.8B • Updated May 14, 2025 • 1
YWZBrandon/downloaded_models_Llama-3.2-1B_qa_ds3500_upsample1000 Text Generation • 1B • Updated May 14, 2025 • 2 •
YWZBrandon/google_flan-t5-large_temporal_10_clusters_0_full_upsample1000 0.8B • Updated May 14, 2025 • 3
YWZBrandon/google_flan-t5-base_temporal_3_clusters_2_full_upsample1000 0.2B • Updated May 14, 2025 • 2
YWZBrandon/meta-llama_Llama-3.2-1B_full_upsample1000 Text Generation • 1B • Updated May 14, 2025 • 3 •