Geyang's picture

Geyang

geyang627

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Safe and Scalable Web Agent Learning via Recreated Websites

upvoted an article 2 months ago

Deriving the PPO Loss from First Principles

upvoted an article 2 months ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

View all activity

Organizations

Collections 1

Papers 1

arxiv:2311.04072

models 12

geyang627/care-arabic-qwen2.5-7b

8B • Updated Jun 30, 2025 • 8 • 2

geyang627/care-arabic-gemma2-9b

9B • Updated Jun 30, 2025 • 2 • 2

geyang627/care-arabic-llama3.1-8b

8B • Updated Jun 30, 2025 • 5 • 1

geyang627/care-arabic-mistral-7b

7B • Updated Jun 30, 2025 • 3 • 1

geyang627/care-japanese-qwen2.5-7b

8B • Updated Jun 28, 2025 • 1

geyang627/care-japanese-mistral-7b

7B • Updated Jun 28, 2025 • 2 • 1

geyang627/care-japanese-llama3.1-8b

8B • Updated Jun 28, 2025 • 2 • 1

geyang627/care-japanese-gemma2-9b

9B • Updated Jun 28, 2025 • 4 • 1

geyang627/care-chinese-mistral-7b

7B • Updated Apr 8, 2025 • 2 • 1

geyang627/care-chinese-llama3.1-8b

8B • Updated Apr 8, 2025 • 6 • 1

datasets 3

geyang627/CARE

Viewer • Updated Jun 28, 2025 • 28.1k • 51 • 3

geyang627/CARE-eval

Viewer • Updated Jun 28, 2025 • 450 • 9 • 2

geyang627/Erya

Updated Jul 21, 2023 • 6