This collection contains all the GRPO-trained models for our paper "A Rising Tide Lifts All Boats". Please consider citing us!
Ishika Agarwal
ishikaa
·
AI & ML interests
active learning, reinforcement learning, reasoning, planning, NLP
Recent Activity
updated a model about 5 hours ago
ishikaa/UAS_student_qwen7b_only_alpaca_expweak2 published a model about 6 hours ago
ishikaa/UAS_student_qwen7b_only_alpaca_expweak2 updated a model about 7 hours ago
ishikaa/UAS_student_qwen7b_only_medmcqa_expweak2