Navigated to Introduction to AI Control

Introduction to AI Control

September 18
10 mins

Episode Description

By Sarah Hastings-Woodhouse

AI Control is a research agenda that aims to prevent misaligned AI systems from causing harm. It is different from AI alignment, which aims to ensure that systems act in the best interests of their users. Put simply, aligned AIs do not want to harm humans, whereas controlled AIs can't harm humans, even if they want to.

Source:

https://bluedot.org/blog/ai-control

A podcast by BlueDot Impact.

Learn more on the AI Safety Fundamentals website.

See all episodes

Never lose your place, on any device

Create a free account to sync, back up, and get personal recommendations.