Gerolamo
HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models | Gerolamo