claim
Instructability in AI safety refers to the assurance that an AI system understands and complies with user preferences, policies, and moral beliefs.

Authors

Sources

Referenced by nodes (1)