AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers

The paper discusses AttnLRP, a method that extends Layer-wise Relevance Propagation to handle attention layers in transformer models. It aims to provide better understanding of the reasoning process of these models, which are prone to biased predictions. Unlike other methods, AttnLRP can attribute not only input but also latent representations of transformer models, maintaining computational efficiency similar to a singular backward pass. The paper demonstrates that AttnLRP surpasses alternative methods in terms of faithfulness. An open-source implementation is available on GitHub.

Publication date: 8 Feb 2024
Project Page: https://github.com/rachtibat/LRP-for-Transformers
Paper: https://arxiv.org/pdf/2402.05602

Post Views: 296

AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Pretrained Generative Language Models as General Learning Frameworks for Sequence-Based Tasks

SoftEDA: Rethinking Rule-Based Data Augmentation with Soft Labels

Leave a Reply Cancel reply

Please allow ads on our site