C-044Value Alignment and AI EthicsConfidence: Medium

The Hard Problem of AI Alignment: Value Forks in Moral Judgment

Markus Kneer & Juri Viehoff (2025)

One-Sentence Thesis

The type of agent, human or AI, influences moral judgments in complex trade-off situations, with participants preferring AI agents to prioritize fairness over utility maximization.

Argument Outline

  1. 1Introduction to the Hard Problem of AI Alignment
  2. 2Background on value alignment, moral dilemmas, and public reflective equilibrium
  3. 3Discussion of empirical findings on agent-type value forks in moral judgment
  4. 4Analysis of the implications of these findings for AI value alignment

Key Distinctions

Human vs. AI agents in moral decision-making
Top-down vs. bottom-up approaches to value alignment
Consequentialist vs. non-consequentialist moral theories

Key Terms

Value alignment
The process of ensuring that AI systems' actions and decisions align with human values
Agent-type value forks
Differences in moral judgments based on the type of agent, human or AI, making the decision
Public reflective equilibrium
A method of value alignment that combines top-down and bottom-up approaches to achieve a more secure justificatory foundation

Flashcards

17 cards

Related Questions

4

Which of the following does Markus Kneer & Juri Viehoff contrasts with in "The Hard Problem of AI Alignment: Value Forks in Moral Judgment"?

4

In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Joshua Gert defines which of the following?

3

In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Value Forks depends on which of the following?

3

In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", Consequentialism criticizes which of the following?

3

In Markus Kneer & Juri Viehoff's "The Hard Problem of AI Alignment: Value Forks in Moral Judgment", experimental psychologists and philosophers studied which of the following?

4

What approach to value alignment involves explicitly programming AI systems to follow pre-determined norms and values?