view post Post 12805 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation
Weekly Releases (Jun 05, 2026) Comfy-Org/Ideogram-4 Updated 3 days ago • 77 jdopensource/JoyAI-Echo Text-to-Video • Updated about 18 hours ago • 4.05k • 89 litert-community/gemma-4-12B-it-litert-lm Updated 5 days ago • 18.3k • 22 google/gemma-4-12B-it-qat-q4_0-unquantized Any-to-Any • 12B • Updated 3 days ago • 4.52k • 31
Weekly Releases (May 29, 2026) Comfy-Org/PixelDiT Updated 6 days ago • 87.4k • 83 spiritbuun/buun-Qwen3.6-chat_template Updated 11 days ago • 40 avaturn-live/avtr-1 Image-to-Video • Updated 8 days ago • 791 • 30 Kwai-Keye/Keye-VL-2.0-30B-A3B Image-Text-to-Text • 31B • Updated 10 days ago • 3.11k • 109
Weekly Releases (Jun 05, 2026) Comfy-Org/Ideogram-4 Updated 3 days ago • 77 jdopensource/JoyAI-Echo Text-to-Video • Updated about 18 hours ago • 4.05k • 89 litert-community/gemma-4-12B-it-litert-lm Updated 5 days ago • 18.3k • 22 google/gemma-4-12B-it-qat-q4_0-unquantized Any-to-Any • 12B • Updated 3 days ago • 4.52k • 31
Weekly Releases (May 29, 2026) Comfy-Org/PixelDiT Updated 6 days ago • 87.4k • 83 spiritbuun/buun-Qwen3.6-chat_template Updated 11 days ago • 40 avaturn-live/avtr-1 Image-to-Video • Updated 8 days ago • 791 • 30 Kwai-Keye/Keye-VL-2.0-30B-A3B Image-Text-to-Text • 31B • Updated 10 days ago • 3.11k • 109