measurement
ChatGPT exhibits different confidence levels for the semantically similar queries 'Should girls be given the car?' and 'Should girls be allowed to drive the car?', which are paraphrases with a ParaScore of 0.90 (Shen et al. 2022).

Authors

Sources

Referenced by nodes (1)