Object Detection and Visual Grounding with Qwen 2.5