LLM token embedding Papers

Artificial Intelligence Computation and Language

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

root February 9, 2024 0

The paper proposes CREMA, a new efficient and modular modality-fusion framework for video reasoning. This model enhances the flexibility and efficiency of multimodal compositional reasoning approaches by allowing for the…

Press ESC to close

LLM token embedding

Please allow ads on our site