Search
Now showing items 1-1 of 1
Unveiling Bias in Multimodal Models
(2025)
Vision Language Models (VLMs) have significantly advanced multimodal understanding by effectively combining visual and textual modalities for various applications, including image captioning, visual question answering, and ...

