Researchers Expose Flaw in AI Unlearning: Quantization Recovers 83% of 'Forgotten' Data, Revealing Need for Robust Strategies

Does your LLM truly unlearn? An embarrassingly simple approach to recover unlearned knowledge

View PDF HTML (experimental) Abstract:Large language models (LLMs) have shown remarkable proficiency in generating text, benefiting from extensive training on vast textual corpora. However, LLMs may also acquire unwanted behaviors from the diverse and sensitive nature of their training data, which can include copyrighted and private content. Machine unlearning has been introduced as a viable solution to remove the influence of such problematic content without the need for costly and time-consumi...