Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
12
46
58
Tong Zhu
Spico
Follow
zzzhr97's profile picture
g-ronimo's profile picture
ych133's profile picture
25 followers
·
74 following
https://Spico197.github.io
TongZhu197
Spico197
AI & ML interests
Information Extraction, Mixture-of-Experts, LLM
Recent Activity
upvoted
an
article
14 days ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
published
an
article
14 days ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
upvoted
an
article
29 days ago
Transformers v5: Simple model definitions powering the AI ecosystem
View all activity
Organizations
Spico
's models
7
Sort:Â Recently updated
Spico/LLaMA-MoE-v1-2_8-UniformSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
3
Spico/LLaMA-MoE-v1-2_8-DynamicSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
Spico/sheared-llama-2.7b-deita-6k-sft
Text Generation
•
3B
•
Updated
Feb 25, 2024
•
1
Spico/internlm2-7b-hf-llama
Text Generation
•
Updated
Feb 23, 2024
•
1
Spico/mirror-chinese-mrcqa-alpha
Updated
Dec 4, 2023
Spico/Humback-Myx
Text Generation
•
Updated
Aug 19, 2023
•
8
•
3
Spico/Humback-M0
Text Generation
•
Updated
Aug 18, 2023
•
4
•
3