Is it Possible to Edit Large Language Models Robustly?

This article delves into the realm of model editing in Large Language Models (LLMs). The authors seek to understand its strengths and limitations, thus facilitating robust, realistic applications of communicative AI. The research tackles three key questions: Can edited LLMs behave consistently in realistic situations? To what extent does rephrasing prompts lead LLMs to deviate from the edited knowledge memory? Which knowledge features are correlated with the performance and robustness of editing? The findings reveal a substantial disparity between existing editing methods and the practical application of LLMs, showing that more popular knowledge is easier to recall and more challenging to edit effectively.

Publication date: 9 Feb 2024
Project Page: https://github.com/xbmxb/edit_analysis
Paper: https://arxiv.org/pdf/2402.05827

Post Views: 293

Is it Possible to Edit Large Language Models Robustly?

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs

Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models

Leave a Reply Cancel reply

Please allow ads on our site