Gerolamo
How Well Do Vision-Language Models Understand Sequential Driving Scenes? A Sensitivity Study | Gerolamo