The Hard Problem of AI Alignment: Value Forks in Moral Judgment
Markus Kneer & Juri Viehoff (2025)
One-Sentence Thesis
The type of agent, human or AI, influences moral judgments in complex trade-off situations, with participants preferring AI agents to prioritize fairness over utility maximization.
Argument Outline
- 1Introduction to the Hard Problem of AI Alignment
- 2Background on value alignment, moral dilemmas, and public reflective equilibrium
- 3Discussion of empirical findings on agent-type value forks in moral judgment
- 4Analysis of the implications of these findings for AI value alignment
Key Distinctions
Human vs. AI agents in moral decision-making
Top-down vs. bottom-up approaches to value alignment
Consequentialist vs. non-consequentialist moral theories
Key Terms
Value alignment
Agent-type value forks
Public reflective equilibrium
Flashcards
17 cardsRelated Questions
Which of the following does Markus Kneer & Juri Viehoff contrasts with in "The Hard Problem of AI Alignment: Value Forks in Moral Judgment"?
In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Joshua Gert defines which of the following?
In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Value Forks depends on which of the following?
In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Consequentialism criticizes which of the following?
In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", experimental psychologists and philosophers studied which of the following?
What approach to value alignment involves explicitly programming AI systems to follow pre-determined norms and values?