-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper • 2408.02657 • Published • 35 -
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
Paper • 2508.10711 • Published • 145 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 147
Charles Cai
charlescai2016
AI & ML interests
None yet
Recent Activity
liked
a model
about 15 hours ago
nvidia/Cosmos-1.0-Tokenizer-CV8x8x8
liked
a model
4 days ago
Qwen/Qwen3-VL-30B-A3B-Thinking
liked
a model
12 days ago
meituan-longcat/LongCat-Flash-Lite