Reward hacking describes a (unwanted) strategy or (unwanted) behaviour of AI algorithms for achieving goals that lie outside the rules of a system. For example, an AI for the game TETRIS finds out that it could simply interrupt the game forever, so that it can never lose. Practical examples (that made it into the media) are two AI financial systems that predicted a rapid decline in stock market values and tried to close markets autonomously for an indefinite period of time.

The (highly entertaining) book "The Fear-Index" (year of publication: 2011) by the bestselling author Robert Harris ultimately also revolves around a scenario of Reward Hacking.

Author

Sebastian Zang has cultivated a distinguished career in the IT industry, leading a wide range of software initiatives with a strong emphasis on automation and corporate growth. In his current role as Vice President Partners & Alliances at Beta Systems Software AG, he draws on his extensive expertise to spearhead global technological innovation. A graduate of Universität Passau, Sebastian brings a wealth of international experience, having worked across diverse markets and industries. In addition to his technical acumen, he is widely recognized for his thought leadership in areas such as automation, artificial intelligence, and business strategy.