The Rise of Multimodal LLMs and Efficient Serving with vLLM