Jailbreaking Papers - BytesArchive

Computation and Language Cryptography and Security

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild

root November 13, 2023 0

This paper delves into the novel human activity of attacking large language models (LLMs) to generate abnormal outputs – a practice known as ‘Red Teaming’. Interviews with practitioners from various…

Press ESC to close

Jailbreaking

Please allow ads on our site