Gerolamo
VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments | Gerolamo