Chương 14: Multimodal Learning (Vision–Language Models)