Paper

Against Purposeful Artificial Intelligence Failures

Thousands of researchers are currently of opinion that advanced artificial intelligence could cause significant damage if developed without appropriate safety measures, but such measures are not currently deployed or even developed. A fringe theory suggests that a severe AI accident could serve as a fire alarm for humanity to take existential dangers of AI seriously and so it is desirable to create such a failure on purpose ASAP to prevent greater harm in the future. In this paper we rely on analogy to inoculation theory to argue against creating purposeful AI failures.

SuperIntelligence - Robotics - Safety & AlignmentPublished 2024-09-10Paper link PDF

Authors: Roman Yampolskiy

Topics

General AI Safety

Relevant entities

People

author

Roman V. Yampolskiy

AI Safety Researcher

Related coverage

Linked coverage will appear here.

Related events

Linked events will appear here.

Related discussions

Related discussion nodes will appear here.