formula
The probability of an output sequence y given an input prompt x in a Language Model is defined by the product of conditional probabilities of each token y_i given the input x and all preceding tokens y_<i, expressed as P(y|x) = product of P(y_i | x, y_<i).

Authors

Sources

Referenced by nodes (1)