reference
The paper 'Jamba: a hybrid transformer-mamba language model' is available as arXiv preprint arXiv:2403.19887.
Authors
Sources
- A Survey on the Theory and Mechanism of Large Language Models arxiv.org via serper
Referenced by nodes (2)
- Language Model concept
- Transformer concept