> My framework for understanding safety is that we need to protect against two categories of harm: unintentional and intentional. Unintentional harm is when an AI system may cause harm even when it was not the intent of those running it to do so. For example, modern AI models may inadvertently give bad health advice. Or, in more futuristic scenarios, some worry that models may unintentionally self-replicate or hyper-optimize goals to the detriment of humanity. Intentional harm is when a bad actor uses an AI model with the goal of causing harm.
Okay then Mark. Replace "modern AI models" with "social media" and repeat this statement with a straight face.
Okay then Mark. Replace "modern AI models" with "social media" and repeat this statement with a straight face.