Gerolamo
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges | Gerolamo