2 3 26

Zhicheng Wang

Dicer

https://blog.dicer.fun

Dicer-Zz

AI & ML interests

NLP

Recent Activity

liked a model 15 days ago

deepseek-ai/DeepSeek-V4-Pro

upvoted an article about 2 months ago

Introducing smolagents: simple agents that write actions in code.

liked a model about 2 months ago

Qwen/Qwen3-VL-Embedding-8B

View all activity

Organizations

liked a model 15 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 3 days ago • 1.17M • • 3.77k

upvoted an article about 2 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.19k

liked 2 models about 2 months ago

Qwen/Qwen3-VL-Embedding-8B

Qwen/Qwen3-VL-Embedding-2B

liked a dataset 2 months ago

cais/hle

Benchmark • Updated Jan 20 • 2.5k • 48.3k • 795

liked a model 3 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 18.6M • • 1.23k

liked a model 4 months ago

Langboat/mengzi-t5-base

0.2B • Updated May 8, 2023 • 7.24k • 60

liked a model 9 months ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated 19 days ago • 5.82M • • 1.01k

liked a model 11 months ago

thenlper/gte-large-zh

updated a model about 1 year ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25, 2025 • 20

published a model about 1 year ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25, 2025 • 20

updated a model about 1 year ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25, 2025

published a model about 1 year ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25, 2025

upvoted 2 articles about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

291

Article

Vision Language Models Explained

Apr 11, 2024

•

531

liked 5 datasets over 1 year ago

Zhicheng Wang

AI & ML interests

Recent Activity

Organizations

Dicer's activity

Introducing smolagents: simple agents that write actions in code.

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Vision Language Models Explained