Fine Tuning SmolVLM for Human Alignment Using Direct Preference Optimization