claim
Scalable oversight, as defined by Bowman et al. (2022), is a technical challenge that seeks to enable relatively weak human supervisors to reliably evaluate and align AI systems that are far stronger and more complex than themselves.

Authors

Sources

Referenced by nodes (1)