Back to Philosophical Concept
Philosophical Concept

AI Alignment

Problem of ensuring AI systems pursue intended goals and values. Orthogonality thesis: intelligence and goals are independent. Instrumental convergence: AIs may pursue harmful subgoals. Control problem: how to maintain control over superior intelligence. Value learning: AIs should learn human values. Corrigibility: AI should accept shutdown and correction. Existential risk if misaligned superintelligence emerges. Technical and philosophical challenge. Urgency debated.

Your Reaction

🧠 1
❤️ 0
🔥 0
🧩 0
🕳️ 0

Explore Related Domains

Discover connections across different types of entities

Person

Eliezer Yudkowsky

American AI researcher and writer who founded rationalist community and works on AI alignment. Yudkowsky's …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Organization

Anthropic

AI safety and research company founded by former OpenAI leadership including Dario and Daniela Amodei. …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Organization

DeepMind

British AI research lab acquired by Google. Founded by Demis Hassabis, Shane Legg, and Mustafa …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Organization

OpenAI

AI research laboratory that created GPT models and ChatGPT. Founded by Sam Altman, Elon Musk, …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Film

Ex Machina

Garland's AI chamber piece follows a programmer testing an android's consciousness. Through the Turing test, …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Game

Factorio

Wube's factory automation game has player crash on alien planet and build industrial complex. Optimizing …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Game

The Talos Principle

Croteam's puzzle game places AI in philosophical trials by mysterious administrator. Portal-like puzzles interweave with …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Person

Demis Hassabis

British neuroscientist and AI researcher who co-founded DeepMind, advancing AI through games and protein folding. …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Book

The Singularity Is Near

Kurzweil's futurist manifesto predicts that exponential technological growth will lead to the Singularity—a point where …

🧠 0
❤️ 0
🔥 0
🧩 0
🕳️ 0
Book

Foundation

Asimov's science fiction masterpiece follows Hari Seldon's use of psychohistory to predict and shape the …

🧠 1
❤️ 0
🔥 0
🧩 0
🕳️ 0