procedure
Top-p (nucleus) sampling in Large Language Models selects the next token by sampling from the smallest vocabulary subset whose cumulative probability exceeds a threshold p.

Authors

Sources

Referenced by nodes (1)