Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
uavleeva
/
grpo_merged_math_sql_code_ties_001
like
0
Text Generation
PEFT
Safetensors
Transformers
lora
unsloth
conversational
Model card
Files
Files and versions
xet
Community
Use this model
Uploaded model
Uploaded model
Developed by:
uavleeva
License:
apache-2.0
Finetuned from model :
unsloth/qwen2.5-coder-7b-instruct-bnb-4bit
This qwen2 model was trained 2x faster with
Unsloth
Downloads last month
7
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Collection including
uavleeva/grpo_merged_math_sql_code_ties_001
Multitask RLVR using GRPO (HSE Project)
Collection
15 items
•
Updated
7 days ago