Video Understanding and Grounding with Qwen 2.5