Resources for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
Xin Lai
xinlai
AI & ML interests
Multimodal LLM, LLM Reasoning, Point Cloud Segmentation, Image Segmentation
Recent Activity
upvoted a paper 2 days ago
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation upvoted a paper about 1 month ago
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression upvoted a paper 2 months ago
Efficient Reasoning with Balanced ThinkingOrganizations
None yet