claim
Kang et al. (2024) incorporated a module into Decision Transformers that enables Large Language Models to retain and process short-term information, drawing on the working memory theory proposed by Baddeley & Hitch (1974b).

Authors

Sources

Referenced by nodes (1)