deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated 2 days ago • 2.83M • • 1.46k
Sapiens Collection Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 63 items • Updated Mar 2 • 62
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper • 2509.23661 • Published Sep 28, 2025 • 51
Running on Zero MCP Featured 1.61k Wan2.1 Fast 🎥 1.61k Animate a still image into a short video using a prompt
Running Featured 601 Image Arena Leaderboard 📊 601 Image Generation and Image Editing Arena & Leaderboard
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published Feb 14, 2025 • 57