arxiv:2502.07780
Tang
Shengkun
AI & ML interests
None yet
Recent Activity
published a dataset 1 day ago
Shengkun/chemistry_dataset upvoted a paper 10 days ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training submitted a paper 10 days ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-trainingOrganizations
None yet