-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 41 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 28 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157
MN
ma1664
·
AI & ML interests
None yet
Recent Activity
updated a collection about 2 months ago
Papers updated a collection 2 months ago
Models updated a collection 2 months ago
ModelsOrganizations
None yet
Spaces
- RunningFeatured446
FastVLM WebGPU
🍎446Real-time video captioning powered by FastVLM
- Running on ZeroMCPFeatured2.21k
Qwen Image Edit Camera Control
🎬2.21kFast 4 step inference with Qwen Image Edit 2509
- Running on ZeroAgentsFeatured420
Depth Anything 3
🏢420Generate depth maps from your photos
Papers
-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 41 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 28 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157
Spaces
- RunningFeatured446
FastVLM WebGPU
🍎446Real-time video captioning powered by FastVLM
- Running on ZeroMCPFeatured2.21k
Qwen Image Edit Camera Control
🎬2.21kFast 4 step inference with Qwen Image Edit 2509
- Running on ZeroAgentsFeatured420
Depth Anything 3
🏢420Generate depth maps from your photos
models 0
None public yet
datasets 0
None public yet