Paper

Unpredictability of AI: On the Impossibility of Accurately Predicting All Actions of a Smarter Agent

The young field of AI Safety is still in the process of identifying its challenges and limitations. In this paper, we formally describe one such impossibility result, namely Unpredictability of AI. We prove that it is impossible to precisely and consistently predict what specific actions a smarter-than-human intelligent system will take to achieve its objectives, even if we know the terminal goals of the system. In conclusion, the impact of Unpredictability on AI Safety is discussed.

Journal of Artificial Intelligence and ConsciousnessPublished 2020-03-01Paper link

Authors: Roman V. Yampolskiy

Topics

Agents General AI Safety

Relevant entities

People

author

Roman V. Yampolskiy

AI Safety Researcher

Related coverage

Linked coverage will appear here.

Related events

Linked events will appear here.

Related discussions

Related discussion nodes will appear here.