reference
The paper 'Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models' by Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi, published as an arXiv preprint in 2023, presents a method for bootstrapping language-image pre-training using frozen image encoders and large language models.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- Large Language Models concept