L4Q Papers - BytesArchive

Computation and Language Machine Learning

L4Q: Parameter Efficient Quantization-Aware Training on Large Language Models via LoRA-wise LSQ

root February 8, 2024 0

This article introduces L4Q, a novel algorithm for parameter-efficient quantization-aware training on Large Language Models (LLMs). L4Q aims to improve the generality of these models using a low-rank adaptation (LoRA)…

Press ESC to close

L4Q

Please allow ads on our site