Vision-Language-Action models for end-to-end robotic control. SmolVLA, RDT2-FM action generation.
AI & ML interests
None defined yet.
Recent Activity
View all activity
INT4 vision-language models for robotic scene understanding. Qwen2.5-VL for visual QA and grounding.
INT8 quantized vision models for real-time robotic perception. SAM2, DINOv2, CLIP, SigLIP, Depth Anything.
-
robotflowlabs/clip-vit-large-patch14-int8
Zero-Shot Image Classification • Updated • 5 -
robotflowlabs/sam2.1-hiera-large-int8
Image Segmentation • 0.2B • Updated • 5 -
robotflowlabs/sam2.1-hiera-small-int8
Image Segmentation • 38.5M • Updated • 4 -
robotflowlabs/sam2.1-hiera-tiny-int8
Image Segmentation • 31.4M • Updated • 14
Vision-Language-Action models for end-to-end robotic control. SmolVLA, RDT2-FM action generation.
INT4 vision-language models for robotic scene understanding. Qwen2.5-VL for visual QA and grounding.
INT4 quantized language models for robotic reasoning. Qwen2.5, SmolLM2 optimized for edge deployment.
INT8 quantized vision models for real-time robotic perception. SAM2, DINOv2, CLIP, SigLIP, Depth Anything.
-
robotflowlabs/clip-vit-large-patch14-int8
Zero-Shot Image Classification • Updated • 5 -
robotflowlabs/sam2.1-hiera-large-int8
Image Segmentation • 0.2B • Updated • 5 -
robotflowlabs/sam2.1-hiera-small-int8
Image Segmentation • 38.5M • Updated • 4 -
robotflowlabs/sam2.1-hiera-tiny-int8
Image Segmentation • 31.4M • Updated • 14