checkpoints for the paper Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
AI & ML interests
We focus on Natural Language Processing and Multimodal Learning, exploring generative AI across different modalities.
Recent Activity
View all activity
Papers
View all Papers
Organization Card
models
12
ModalityDance/IVTLR_QWEN_SQA
Updated
ModalityDance/IVTLR_CHAMELEON_M3COT
Updated
ModalityDance/IVTLR_QWEN_M3COT
Updated
ModalityDance/IVTLR_CHAMELEON_SQA
Updated
ModalityDance/AR-Omni-Pretrain-v0.1
Any-to-Any
•
Updated
ModalityDance/AR-Omni-Chat-v0.1
Any-to-Any
•
Updated
ModalityDance/Omni-R1-Zero
Any-to-Any
•
7B
•
Updated
•
27
ModalityDance/Omni-R1
Any-to-Any
•
7B
•
Updated
•
1.5k
ModalityDance/latent-tts-coconut
Text Generation
•
0.1B
•
Updated
•
20
ModalityDance/latent-tts-colar
Text Generation
•
1B
•
Updated
•
21