Does your LLM truly unlearn? An embarrassingly simple approach to recover unlearned knowledge
View PDF
HTML (experimental)
Abstract:Large language models (LLMs) have shown remarkable proficiency in generating text, benefiting from extensive training on vast textual corpora. However, LLMs may also acquire unwanted behaviors from the diverse and sensitive nature of their training data, which can include copyrighted and private content. Machine unlearning has been introduced as a viable solution to remove the influence of such problematic content without the need for costly and time-consumi...
Read more at arxiv.org