alignment tax

Etymology
First attested in a 2019 speech by computer scientist Paul Christiano (see quote), who attributed the idea to AI researcher and writer.

Noun

 * 1)  A cost to the capabilities of an artificial intelligence resulting from the effects of aligning it with human ethics and morality.