The Easy Goal Inference Problem Is Still Hard

AI Safety Fundamentals: Alignment - A podcast by BlueDot Impact

Podcast artwork

One approach to the AI control problem goes like this:Observe what the user of the system says and does.Infer the user’s preferences.Try to make the world better according to the user’s preference, perhaps while working alongside the user and asking clarifying questions.This approach has the major advantage that we can begin empirical work today — we can actually build systems which observe user behavior, try to figure out what the user wants, and then help with that. There are many applicati...