reference
The paper 'Visual instruction tuning' by Haotian Liu, Chunyuan Li, and colleagues, published as an arXiv preprint in 2023, introduces the concept of visual instruction tuning for large vision-language models.

Authors

Sources

Referenced by nodes (1)