Reward hacking

From RB Wiki
Revision as of 23:33, 20 January 2020 by Lê Nguyên Hoang (talk | contribs) (Created page with "Reward hacking occurs when an algorithm achieves its goal in an unexpected (and often undesired) manner. == Examples == List examples.")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Reward hacking occurs when an algorithm achieves its goal in an unexpected (and often undesired) manner.

Examples

List examples.