reference
The paper 'Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models' by Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi, published as an arXiv preprint in 2023, presents a method for bootstrapping language-image pre-training using frozen image encoders and large language models.

Authors

Sources

Referenced by nodes (1)