C-045Value Alignment and AI EthicsConfidence: High

A matter of principle? AI alignment as the fair treatment of claims

Iason Gabriel & Geoff Keeling (2025)

Source link ↗Drill this reading Discuss with AI

One-Sentence Thesis

The authors propose an alternative account of AI alignment that focuses on fair processes, arguing that principles generated through these processes are the appropriate target for alignment.

Argument Outline

1Introduction to the normative challenge of AI alignment
2Critique of existing approaches to AI alignment (intent-alignment and HHH)
3Proposal of a novel approach to AI alignment based on fair processes
4Defense of the new approach and its advantages over existing approaches

Key Distinctions

Technical vs. normative aspects of AI alignment

Intent-alignment vs. HHH approaches to AI alignment

Fair process-based views vs. other approaches to AI alignment

Key Terms

AI alignment

The process of ensuring that AI systems act in accordance with human values and goals

Intent-alignment

An approach to AI alignment that focuses on aligning AI systems with human intentions

HHH (Helpful, Honest, Harmless)

An approach to AI alignment that focuses on designing AI systems to be helpful, honest, and harmless

Fair process-based views

An approach to AI alignment that focuses on generating principles through fair processes that treat different parties and their claims fairly

Flashcards

19 cards