NovaSky's picture

NovaSky

NovaSkyAI

·

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 23

upvoted a collection about 1 year ago

SkyRL-Agent-v0

6 items • Updated May 7, 2025 • 5

upvoted 3 papers about 1 year ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20, 2025 • 63

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 59

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11, 2025 • 40