Researchers Use AI to Explain Human Decision-Making: LLMs Trained with Reinforcement Learning Offer Predictive and Interpretable Cognitive Models

Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions

View PDF HTML (experimental) Abstract:A central goal of cognitive modeling is to develop models that not only predict human behavior but also provide insight into the underlying cognitive mechanisms. While neural network models trained on large-scale behavioral data often achieve strong predictive performance, they typically fall short in offering interpretable explanations of the cognitive processes they capture. In this work, we explore the potential of pretrained large language models (LLMs) ...