SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
The article presents SPHINX, a versatile multi-modal large language model (MLLM) that incorporates a joint mixing strategy of model weights, tuning tasks, and visual embeddings. The model is designed to…
Continue reading