reference
Yang Liu, Yuanshun Yao, Jean-Francois Ton, Xiaoying Zhang, Ruocheng Guo, Hao Cheng, Yegor Klochkov, Muhammad Faaiz Taufiq, and Hang Li authored 'Trustworthy llms: a survey and guideline for evaluating large language models’ alignment', published as an arXiv preprint in 2023.

Authors

Sources

Referenced by nodes (2)