reference
The paper 'Alphazero-like tree-search can guide large language model decoding and training' proposes using tree-search algorithms similar to AlphaZero to improve the decoding and training of large language models.

Authors

Sources

Referenced by nodes (1)