payelb/HHRLHF_roberta-base_1k_fixed_MARS_semantic_distance_synth Text Classification • 0.1B • Updated 1 day ago • 18
payelb/PKUSafeRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined Text Classification • 0.2B • Updated 2 days ago • 100
payelb/PKUSafeRLHF_roberta-base_1k_fixed_MARS_semantic_refined Text Classification • 0.1B • Updated 3 days ago • 159
payelb/UltraFeedback_openbmb_roberta-base_1k_fixed_MARS_semantic_refined Text Classification • 0.1B • Updated 4 days ago • 34
payelb/UltraFeedback_openbmb_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined Text Classification • 0.2B • Updated 5 days ago • 51
payelb/HHRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_distance_synth Text Classification • 0.2B • Updated 7 days ago • 29