RLVF pipeline using parser oracles to align LMs for Icelandic and Danish. GPT-SW3 and Viking-13B trained with Delta-DPO.
Fakhar
Hodfa71
AI & ML interests
None yet
Recent Activity
updated a model 16 minutes ago
Hodfa71/normistral-11b-nb-delta-dpo-nosft published a model 40 minutes ago
Hodfa71/llama-8b-nb-delta-dpo published a model about 3 hours ago
Hodfa71/normistral-11b-nb-delta-dpo-nosftOrganizations
models 30
Hodfa71/normistral-11b-nb-delta-dpo-nosft
11B • Updated
Hodfa71/llama-8b-nb-delta-dpo
Updated
Hodfa71/normistral-11b-nb-saga-kl-sft-delta-dpo
Text Generation • Updated • 12
Hodfa71/normistral-11b-nb-saga-nosft-delta-dpo
Text Generation • Updated • 11
Hodfa71/llama-3.2-1b-da-saga-delta-dpo
Updated • 14
Hodfa71/llama-3.2-1b-is-saga-delta-dpo
Updated • 11
Hodfa71/viking-13b-nb-saga-kl-sft-delta-dpo
Updated • 7
Hodfa71/viking-13b-nb-saga-nosft-delta-dpo
Updated • 13
Hodfa71/gpt-sw3-1b3-nb-saga-kl-sft-delta-dpo
Text Generation • Updated • 9
Hodfa71/gpt-sw3-1b3-nb-saga-nosft-delta-dpo
Text Generation • Updated • 11
datasets 11
Hodfa71/normistral-11b-nb-saga-kl-sft-delta-dpo-pairs
Viewer • Updated • 8.87k • 11
Hodfa71/normistral-11b-nb-saga-nosft-delta-dpo-pairs
Viewer • Updated • 3.12k • 10
Hodfa71/gpt-sw3-1b3-nb-saga-delta-dpo-pairs
Viewer • Updated • 7.08k • 20
Hodfa71/normistral-7b-nb-saga-delta-dpo-pairs
Viewer • Updated • 9.13k • 19
Hodfa71/OmniAgentBench
Viewer • Updated • 30 • 11
Hodfa71/OmniAgentBench-Audio
Viewer • Updated • 30 • 53
Hodfa71/saga-da-delta-dpo-r1
Viewer • Updated • 7.41k • 22
Hodfa71/saga-da-delta-dpo-r2
Viewer • Updated • 7.31k • 26
Hodfa71/pstu-synthetic-secrets
Viewer • Updated • 175 • 31
Hodfa71/NER-German
Preview • Updated • 17