Parameter-efficient modules

Artificial Intelligence Computation and Language

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

root February 11, 2024 0

This paper introduces CREMA, a new and efficient modality-fusion framework designed to improve video reasoning. By leveraging existing pre-trained models, it incorporates multiple informative modalities from videos, such as optical…

Artificial Intelligence Computation and Language

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

root February 9, 2024 0

The paper proposes CREMA, a new efficient and modular modality-fusion framework for video reasoning. This model enhances the flexibility and efficiency of multimodal compositional reasoning approaches by allowing for the…

Page 1 of 1

Press ESC to close

Parameter-efficient modules

Please allow ads on our site