-
Visual Representation Alignment for Multimodal Large Language Models
Paper • 2509.07979 • Published • 84 -
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Paper • 2509.22638 • Published • 70 -
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper • 2510.05034 • Published • 51 -
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Paper • 2601.10332 • Published • 32
Jeffrey Van de zande
Sexhuis
·
AI & ML interests
None yet
Recent Activity
updated a collection 12 days ago
X1 updated a collection 4 months ago
X1 new activity 7 months ago
openai/gpt-oss-safeguard-20b:streamingOrganizations
None yet