C-045Value Alignment and AI EthicsConfidence: High

A matter of principle? AI alignment as the fair treatment of claims

Iason Gabriel & Geoff Keeling (2025)

One-Sentence Thesis

The authors propose an alternative account of AI alignment that focuses on fair processes, arguing that principles generated through these processes are the appropriate target for alignment.

Argument Outline

  1. 1Introduction to the normative challenge of AI alignment
  2. 2Critique of existing approaches to AI alignment (intent-alignment and HHH)
  3. 3Proposal of a novel approach to AI alignment based on fair processes
  4. 4Defense of the new approach and its advantages over existing approaches

Key Distinctions

Technical vs. normative aspects of AI alignment
Intent-alignment vs. HHH approaches to AI alignment
Fair process-based views vs. other approaches to AI alignment

Key Terms

AI alignment
The process of ensuring that AI systems act in accordance with human values and goals
Intent-alignment
An approach to AI alignment that focuses on aligning AI systems with human intentions
HHH (Helpful, Honest, Harmless)
An approach to AI alignment that focuses on designing AI systems to be helpful, honest, and harmless
Fair process-based views
An approach to AI alignment that focuses on generating principles through fair processes that treat different parties and their claims fairly

Flashcards

19 cards

Related Questions

3

In Iason Gabriel & Geoff Keeling's "A matter of principle? AI alignment as the fair treatment of claims", Leike et al. supports which of the following?

3

In Iason Gabriel & Geoff Keeling's "A matter of principle? AI alignment as the fair treatment of claims", Gabriel criticizes which of the following?

4

Which of the following does Iason Gabriel & Geoff Keeling criticizes in "A matter of principle? AI alignment as the fair treatment of claims"?

4

In Iason Gabriel & Geoff Keeling's "A matter of principle? AI alignment as the fair treatment of claims", Crenshaw, K. defines which of the following?

3

In Iason Gabriel & Geoff Keeling's "A matter of principle? AI alignment as the fair treatment of claims", Rawls supports which of the following?

3

In Iason Gabriel & Geoff Keeling's "A matter of principle? AI alignment as the fair treatment of claims", Fair Process defines which of the following?