Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
LAION eV
non-profit
AI & ML interests
open multi-modal foundation models and datasets for their creation; scaling laws, model evaluation; fully local, sovereign model deployment, personalized assistants and open local agentic systems
Recent Activity
View all activity
Organization Card
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 8 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 6 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 5 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 4
Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 8 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 6 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 5 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 4
models 708
laion/tis_smoke_0p6b
Updated
laion/Qwen-1.7B-Base-MixtureVitae-v1.5-finephrasePermissive-100BT
2B • Updated • 16
laion/Qwen-1.7B-Base-finephrasePermissive-100BT
2B • Updated • 10
laion/a3-rl-laion_exp_rpt_stack-bash-v3-70-8B
8B • Updated • 20
laion/a3-rl-laion_exp_rpt_codenet-python-v2
8B • Updated • 50
laion/a3-rl-laion_exp_rpt_methods2test-large-v2
8B • Updated • 26
laion/universal-audio-annotation-pipeline
Updated • 9.04k • 2
laion/a3-rl-laion_exp_rpt_stack-bash-v3
8B • Updated • 14
laion/Qwen-1.7B-Base-MixtureVitae-v1-100BT
2B • Updated • 1
laion/Qwen-1.7B-Base-MixtureVitae-v1.5-100BT
2B • Updated • 27
datasets 435
laion/dramabox-voice-acting-data-annotated
Viewer • Updated • 345k • 1.38k
laion/small-overlapping-speech-bench
Viewer • Updated • 100 • 46
laion/nemotron-gym-qa-abstention-v2
Viewer • Updated • 3.15k • 22
laion/nemotron-gym-math-v4
Viewer • Updated • 6.53k • 25
laion/nemotron-gym-multichallenge-vanilla-v2
Viewer • Updated • 1.05k • 23
laion/nemotron-gym-multichallenge-advanced-v2
Viewer • Updated • 1.07k • 24
laion/nemotron-gym-instruction-following-multiturnchat-v2
Viewer • Updated • 2.01k • 23
laion/nemotron-gym-cfbench-v2
Viewer • Updated • 1.11k • 22
laion/nemotron-gym-sysbench-v2
Viewer • Updated • 1.01k • 28
laion/nemotron-gym-inverse-ifeval-v2
Viewer • Updated • 1k • 23