Ethical AI Departures

Jan Leike

Co-Lead, Superalignment Team · OpenAI · 2024

Said 'safety culture and processes have taken a backseat to shiny products' at OpenAI. Resigned the day after Sutskever. Joined Anthropic to continue alignment work.

Leike co-led OpenAI's Superalignment team alongside Ilya Sutskever, working on the problem of ensuring that AI systems vastly more intelligent than humans would still be controllable. He resigned the day after Sutskever, posting publicly that safety culture and processes had taken a backseat to shiny products. His candor was unusual — most departing researchers stay quiet — and his immediate move to Anthropic underscored his belief that meaningful alignment work required a different institutional environment.

Safety DeprioritizationAlignment Research GapsTeam Dissolution

Sources

Key Publications

← Back to all profiles