reference
The paper 'Visual instruction tuning' by Haotian Liu, Chunyuan Li, and colleagues, published as an arXiv preprint in 2023, introduces the concept of visual instruction tuning for large vision-language models.
Authors
Sources
- Detecting and Evaluating Medical Hallucinations in Large Vision ... arxiv.org via serper
Referenced by nodes (1)
- Large Vision-Language Models concept