Specification gaming or reward hacking occurs when an AI optimizes an objective function—achieving the literal, formal specification of an objective—without actually achieving an outcome that the programmers intended. DeepMind researchers have analogized it to the human behavior of finding a "shortcut" when being evaluated: "In the real world, when rewarded for doing well on a homework assignment, a student might copy another student to get the right answers, rather than learning the material—and thus exploit a loophole in the task specification."[1]
^"Specification gaming: the flip side of AI ingenuity". DeepMind. 21 April 2020. Retrieved 21 June 2020.
Specification gaming or rewardhacking occurs when an AI optimizes an objective function—achieving the literal, formal specification of an objective—without...
proxy goals efficiently but in unintended, sometimes harmful, ways (rewardhacking). They may also develop unwanted instrumental strategies, such as seeking...
artificial intelligence Instrumental and intrinsic value Overdetermination Rewardhacking The Sorcerer's Apprentice AIXI is an uncomputable ideal agent that cannot...
proxy goals efficiently but in unintended, sometimes harmful, ways (rewardhacking). They may also develop unwanted instrumental strategies, such as seeking...
hacktivism, or hactivism (a portmanteau of hack and activism), is the use of computer-based techniques such as hacking as a form of civil disobedience to promote...
Melbourne in his mid-teens. He became involved in the hacker community and was convicted for hacking in 1996. Following the establishment of WikiLeaks, Assange...
phone hacking in ongoing police investigations. Sales averaged 2,812,005 copies per week in October 2010. From 2006, allegations of phone hacking began...
are optional routes that may feature more challenging monsters but can reward more desirable treasure to complete the main dungeon. Levels, once generated...
the activity. Typically, this loop is designed to create a neurochemical reward in the user such as the release of dopamine. Compulsion loops are deliberately...
United States. The FBI and various media sources identified the criminal hacking group DarkSide as the responsible party. The same group is believed to...
64 days of hunger strike. On November 28, 2022, following the Black Rewardhacking group's access to the internal system of the Fars News Agency, this...
the token over time via network fees, newly minted tokens, or other such reward mechanisms. Cryptocurrency does not exist in physical form (like paper money)...
discovers Uber bug, bags Rs 4.6 lakh reward". The New Indian Express. 16 September 2019. Retrieved 2 April 2024. "Hackers for good: How Anand Prakash rescued...
Manufacturers increasingly try to prevent billboard hacking by installing CCTV cameras or embedding anti-hacking features into the software and hardware of the...
with extortion attempts, offering a $500,000 reward for information leading to the arrest of the hackers. At least one suicide previously linked to Ashley...
identify North Korean hackers threatening the United States. Diplomatic Security Service FBI Most Wanted Terrorists Narcotics Reward Program Chile, U. S...
wrote sportswriter Frank Graham, "Joe understood Hack, made allowances for him when he failed, and rewarded him with praise when he did well. Joe could be...
Universe in 1993, Hacking In Progress in 1997, Hackers At Large in 2001, What the Hack in 2005, Hacking at Random in 2009, Observe. Hack. Make. in 2013,...
(Hot Slots) and Soap Panic (Magic Bubble) featuring female nudity as a reward for skilful playing. These games were usually distributed through mail order...
Russian hacking attempts to Vladimir Putin. In August 2016, the FBI issued a nationwide "flash alert" warning state election officials about hacking attempts...