Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Amarjyoti's picture

6 2

Amarjyoti

amar-bach

·

AI & ML interests

None yet

Organizations

amar-bach 's collections 5

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 105

The Art of Efficient Reasoning: Data, Reward, and Optimization

Paper • 2602.20945 • Published Feb 24 • 7
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 53
Efficient Reinforcement Learning with Semantic and Token Entropy for LLM Reasoning

Paper • 2512.04359 • Published Dec 4, 2025
How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

VLANeXt: Recipes for Building Strong VLA Models

Paper • 2602.18532 • Published Feb 20 • 52

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1, 2025 • 35

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published Feb 24 • 31

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 105

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1, 2025 • 35

The Art of Efficient Reasoning: Data, Reward, and Optimization

Paper • 2602.20945 • Published Feb 24 • 7
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 53
Efficient Reinforcement Learning with Semantic and Token Entropy for LLM Reasoning

Paper • 2512.04359 • Published Dec 4, 2025
How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published Feb 24 • 31

VLANeXt: Recipes for Building Strong VLA Models

Paper • 2602.18532 • Published Feb 20 • 52

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs