arxiv:2601.07767
Deqing Fu PRO
deqing
AI & ML interests
None yet
Recent Activity
updated
a model about 4 hours ago
deqing/llama-300M-v3-muon-original published
a model about 4 hours ago
deqing/llama-300M-v3-muon-original updated
a model about 6 hours ago
deqing/llama-300M-v3-original