Rank #60 on AIDB
1 model in the AIDB database. Average AIDB score 91, top score 91, momentum index 75.4.
Despite the rapid integration of video perception capabilities into Large Multimodal Models (LMMs), the underlying mechanisms driving their video understanding remain poorly understood.