LLaVA Visual Instruct 150K Dataset Card

Dataset details

Dataset type: LLaVA Visual Instruct 150K is a set of GPT-generated multimodal instruction-following data. It is constructed for visual instruction tuning and for building large multimodal towards GPT-4 vision/language capability.

Dataset date: LLaVA Visual Instruct 150K was collected in April 2023, by prompting GPT-4-0314 API.

Paper or resources for more information: https://llava-vl.github.io/

License: Attribution-NonCommercial 4.0 International It should abide by the policy of OpenAI: https://openai.com/policies/terms-of-use

Where to send questions or comments about the model: https://github.com/haotian-liu/LLaVA/issues

Intended use

Primary intended uses: The primary use of LLaVA is research on large multimodal models and chatbots.

Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

作者:

liuhaotian

数据集大小:

558.74 MB