
[11] Visual Instruction Tuning (LLaVA: Large Language and Vision Assistant)
·
Paper Review/etc
[paper] https://arxiv.org/pdf/2304.08485[Github] https://github.com/haotian-liu/LLaVA GitHub - haotian-liu/LLaVA: [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyo[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond. - haotian-liu/LLaVAgithub.com Abstract 기존 LLM의 문제점: 이미지를 입력 받지 못해 vision 정보를 처리..