Reward hacking
From RB Wiki
Reward hacking occurs when an algorithm achieves its goal in an unexpected (and often undesired) manner.
Examples
List examples.
Reward hacking occurs when an algorithm achieves its goal in an unexpected (and often undesired) manner.
List examples.