“Towards more cooperative AI safety strategies” by richard_ngo
EA Forum Podcast (All audio) - A podcast by EA Forum Team
Categories:
This post is written in a spirit of constructive criticism. It's phrased fairly abstractly, in part because it's a sensitive topic, but I welcome critiques and comments below. The post is structured in terms of three claims about the strategic dynamics of AI safety efforts; my main intention is to raise awareness of these dynamics, rather than advocate for any particular response to them. Claim 1: The AI safety community is structurally power-seeking. By “structurally power-seeking” I mean: tends to take actions which significantly increase its power. This does not imply that people in the AI safety community are selfish or power-hungry; or even that these strategies are misguided. Taking the right actions for the right reasons often involves accumulating some amount of power. However, from the perspective of an external observer, it's difficult to know how much to trust stated motivations, especially when they often lead to the [...] --- First published: July 16th, 2024 Source: https://forum.effectivealtruism.org/posts/wcKD8dJgC55cLijN3/towards-more-cooperative-ai-safety-strategies --- Narrated by TYPE III AUDIO.