In-context Autoencoder for Context Compression in a Large Language Model

The team from Microsoft introduces In-context Autoencoder (ICAE), a novel method aimed at tackling the long context issue in large language models (LLMs). This approach seeks to compress lengthy contexts into a compact form, reducing computational requirements and memory overheads while maintaining high performance levels. Using a combination of pretraining and fine-tuning, the ICAE proves effective in creating compressed context representations that the LLM can leverage for a variety of purposes, demonstrating significant potential for more efficient LLM inference in real-world applications.

Publication date: 13 Jul 2023
Project Page: Not provided
Paper: https://arxiv.org/pdf/2307.06945.pdf

Post Views: 461

Press ESC to close

Share Article:

root

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition

NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis

Please allow ads on our site