CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks
The article presents CompactifAI, a novel compression method for Large Language Models (LLMs) like ChatGPT and LlaMA. LLMs are advancing rapidly in AI, but their large size leads to significant…
Continue reading