ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models
The paper introduces a new feature attribution (FA) method for generative language models called the Recursive Attribution Generator (ReAGent). Unlike existing FAs, which are mostly developed for encoder-only language models…
Continue reading