Skills › AI Safety & Ethics

AI Alignment Basics

Understand the alignment problem, RLHF, and what safe AI development means.

0%
Confidence · no data yet
Sign in to track

After this skill you can…

  • Explain the alignment problem
  • Describe RLHF and Constitutional AI
  • Identify common failure modes in deployed LLMs

Learn this skill (2 videos)

We Were Right! Real Inner Misalignment
Robert Miles AI Safety · advanced
Are We Building Superintelligence Backwards?
ML Street Talk · intermediate