LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education

This research uses multimodal large language models (MLLMs) to enhance art appreciation education. The study introduces LLaVA-Docent, a model developed using MLLMs that aims to improve access and engagement in art education. The model was developed through extensive literature review and expert consultation, and trained using a virtual dialogue dataset on GPT-4. The effectiveness of LLaVA-Docent was assessed through quantitative and qualitative evaluations, revealing its strengths and weaknesses. The study concludes that LLaVA-Docent can significantly contribute to the field of art education.

Publication date: 12 Feb 2024
Project Page: N/A
Paper: https://arxiv.org/pdf/2402.06264

Post Views: 263

LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Prompt Learning on Temporal Interaction Graphs

DeAL: Decoding-time Alignment for Large Language Models

Leave a Reply Cancel reply

Please allow ads on our site