ELIXR, the “Embeddings for Language/Image-aligned X-Rays”, is a new approach to medical AI that addresses previous limitations. It does this by using a multimodal model that combines a vision encoder with a large language model (LLM) to leverage medical images and their associated text reports. This model improves the generalization of AI to various tasks, such as high-performance zero-shot and data-efficient classification, semantic search, visual question answering (VQA), and radiology report quality assurance (QA). Moreover, ELIXR provides better interpretability and interaction between humans and AI, unlocking the potential for a new generation of medical AI applications.

 

Publication date: Aug 2, 2023
Project Page: N/A
Paper: https://arxiv.org/ftp/arxiv/papers/2308/2308.01317.pdf