-
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Paper • 2407.15841 • Published • 39 -
Stable Audio Open
Paper • 2407.14358 • Published • 26 -
PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Paper • 2407.13976 • Published • 5 -
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
Paper • 2407.14329 • Published • 5
Joe
pushkin05
·
AI & ML interests
None yet
Recent Activity
liked
a dataset about 12 hours ago
Voxel51/gaussian_splatting published
a model 2 months ago
pushkin05/trm liked
a model 5 months ago
arcprize/trm_arc_prize_verification