Om AI Lab

company

https://github.com/om-ai-lab

AI & ML interests

Multimodal AI, VLM, VLA, VAM, etc

Recent Activity

Heting updated a model about 11 hours ago

omlab/OmTrackVLA-0.6B

P3ngLiu updated a collection about 11 hours ago

OmDet-Turbo-Models

P3ngLiu updated a model about 13 hours ago

omlab/VLM-FO1-3B-v01

View all activity

Papers

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

View all Papers

Articles

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

Improving Object Detection through Reinforcement Learning with VLM-R1

omlab 's models 9

omlab/OmTrackVLA-0.6B

Other • 0.6B • Updated about 11 hours ago • 129 • 4

omlab/VLM-FO1-3B-v01

Object Detection • 4B • Updated about 13 hours ago • 263 • 16

omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321

Image-Text-to-Text • 4B • Updated Jul 18, 2025 • 133 • 24

omlab/ImageRAG

Updated Jul 10, 2025 • 1

omlab/VLM-R1-Qwen2.5VL-3B-Math-0305

Visual Question Answering • 4B • Updated Apr 14, 2025 • 29 • 8

omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps

Zero-Shot Object Detection • 4B • Updated Apr 14, 2025 • 56 • 25

omlab/omdet-turbo-swin-tiny-hf

Zero-Shot Object Detection • 0.1B • Updated Dec 18, 2024 • 11.5k • 41

omlab/omchat-v2.0-13B-single-beta_hf

13B • Updated Sep 19, 2024 • 8 • 5

omlab/OmDet-Turbo_tiny_SWIN_T

Zero-Shot Object Detection • Updated Jun 13, 2024 • 8