PeterMcCluskey comments on Community Polls on Alignment Controversies

PeterMcCluskey 18 Jun 2026 18:32 UTC
1 point
0 ∶ 0
Alignment to specific values is underrated in research relative to control
I’m unsure how broadly to interpret “specific values”. If it’s values such as democracy or equality, then both values and control are overrated.
- Miles Tidmarsh 19 Jun 2026 21:41 UTC
  1 point
  0 ∶ 0
  Parent
  By specific values we mean any particular goal we want AIs to pursue besides deferrence to humans. So democracy and equality would both count, as would goals like harm reduction or utilitarianism