Awesome LLM-based Human Simulation

A curated list of research papers and resources on LLM-based Human Simulation — the use of Large Language Models to simulate human behavior, cognition, and social interactions. This rapidly growing field spans psychology, economics, education, political science, and AI safety.

Contributions are welcome! If you would like to add a paper, please open a pull request.

1. Foundations & Surveys

From Persona to Personalization: A Survey on Role-Playing Language Agents (arXiv, 2024.04) [Paper]
Can Large Language Models Transform Computational Social Science? (Computational Linguistics, 2024.03) [Paper]
Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review (arXiv, 2024.01) [Paper]
Towards a Psychological Generalist AI: A Survey of Current Applications of Large Language Models and Future Prospects (arXiv, 2023.12) [Paper]
Using Large Language Models in Psychology (Nature Reviews Psychology, 2023.10) [Paper]
Emergent Abilities of Large Language Models (arXiv, 2022.06) [Paper]

2. LLM for Human Behavior Simulation

Lost in Simulation: LLM-Simulated Users are Unreliable Proxies for Human Users in Agentic Evaluations (arXiv, 2026.01) [Paper]
Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning (NeurIPS, 2025) [Paper] [Code]
A Mega-Study of Digital Twins Reveals Strengths, Weaknesses and Opportunities for Further Improvement (arXiv, 2025.09) [Paper]
How Many Human Survey Respondents is a Large Language Model Worth? An Uncertainty Quantification Perspective (arXiv, 2025.02) [Paper] [Code]
Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations (EMNLP, 2025) [Paper] [Code]
Generative Agent Simulations of 1,000 People (arXiv, 2024.11) [Paper]
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers (arXiv, 2024.09) [Paper] [Code]
Language Models Show Stable Value Orientations Across Diverse Role-Plays (arXiv, 2024.08) [Paper]
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents (ACL, 2024) [Paper]
Limited Ability of LLMs to Simulate Human Psychological Behaviours: A Psychometric Analysis (arXiv, 2024.05) [Paper]
Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality (arXiv, 2024.02) [Paper]
LLM Agents for Psychology: A Study on Gamified Assessments (arXiv, 2024.02) [Paper]
Quantifying the Persona Effect in LLM Simulations (arXiv, 2024.02) [Paper] [Code]
"Kelly Is a Warm Person, Joseph Is a Role Model": Gender Biases in LLM-Generated Reference Letters (arXiv, 2023.10) [Paper] [Code]
Personality Traits in Large Language Models (arXiv, 2023.07) [Paper] [Code]
Role Play with Large Language Models (Nature, 2023.11) [Paper]
The Challenge of Using LLMs to Simulate Human Behavior: A Causal Inference Perspective (arXiv, 2023.12) [Paper]
Meet Your Favorite Character: Open-Domain Chatbot Mimicking Fictional Characters with Only a Few Utterances (arXiv, 2022.04) [Paper]

3. LLM Agent

An LLM-based Simulation Framework for Embodied Conversational Agents in Psychological Counseling (arXiv, 2024.10) [Paper] [Code]
MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems (arXiv, 2024.08) [Paper]
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue (arXiv, 2024.06) [Paper] [Code]
Towards Lifelong Learning of Large Language Models: A Survey (arXiv, 2024.06) [Paper] [Code]
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents (arXiv, 2024.05) [Paper]
MetaAgents: Simulating Interactions of Human Behaviors for LLM-based Task-Oriented Coordination via Collaborative Generative Agents (arXiv, 2023.10) [Paper]
Agents: An Open-Source Framework for Autonomous Language Agents (arXiv, 2023.09) [Paper] [Code]
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework (arXiv, 2023.08) [Paper] [Code]
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework (arXiv, 2023.08) [Paper] [Code]
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents (arXiv, 2023) [Paper] [Code]

4. LLM Bias & Value

Gender Bias of LLM in Economics: An Existentialism Perspective (arXiv, 2024.10) [Paper]
Measuring Human and AI Values based on Generative Psychometrics with Large Language Models (arXiv, 2024.09) [Paper] [Code]
United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections (arXiv, 2024.08) [Paper]
Representation Bias in Political Sample Simulations with Large Language Models (arXiv, 2024.07) [Paper]
New Job, New Gender? Measuring the Social Bias in Image Generation Models (MM, 2024) [Paper]
Whose Opinions Do Language Models Reflect? (ICML, 2023) [Paper]
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models (arXiv, 2023.10) [Paper]
Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation (arXiv, 2023.11) [Paper]
Evaluating the Moral Beliefs Encoded in LLMs (NeurIPS, 2023) [Paper] [Code]
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment (NeurIPS, 2022) [Paper] [Code]

5. LLM Simulation Applications

5.1 Economics & Finance

AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions (arXiv, 2026.02) [Paper] [Code]
Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus? (ACM EC, 2024) [Paper]
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method (Findings of ACL, 2024) [Paper]
CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading (arXiv, 2024.06) [Paper] [Code]
Simulating Financial Market via Large Language Model Based Agents (arXiv, 2024.06) [Paper]
EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities (ACL, 2024) [Paper] [Code]
Designing Heterogeneous LLM Agents for Financial Sentiment Analysis (ACM TMIS, 2024.08) [Paper]

5.2 Politics & Society

Auditing Political Exposure Bias: Algorithmic Amplification on Twitter/X Approaching the 2024 US Presidential Election (arXiv, 2024.11) [Paper]
GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy (arXiv, 2024.07) [Paper]
Simulating The U.S. Senate: An LLM-Driven Agent Approach to Modeling Legislative Behavior and Bipartisanship (arXiv, 2024.06) [Paper]
Trump, Twitter, and Truth Social: How Trump Used Both Mainstream and Alt-Tech Social Media to Drive News Media Attention (Journal of Information Technology & Politics, 2024.03) [Paper]
War and Peace (WarAgent): Large Language Model-Based Multi-Agent Simulation of World Wars (arXiv, 2023.11) [Paper] [Code]

5.3 Education

Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course (arXiv, 2024.07) [Paper]
Simulating Classroom Education with LLM-Empowered Agents (arXiv, 2024.06) [Paper]
Generative Students: Using LLM-Simulated Student Profiles to Support Question Item Evaluation (arXiv, 2024.05) [Paper]
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education (arXiv, 2024.04) [Paper]
PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations (arXiv, 2024.03) [Paper]

5.4 Recommendation System & User Simulation

PUB: An LLM-Enhanced Personality-Driven User Behaviour Simulator for Recommender System Evaluation (SIGIR, 2025) [Paper]
LLM as User Simulator: Towards Training News Recommender without Real User Interactions (SIGIR, 2025) [Paper]
Agentic Feedback Loop Modeling Improves Recommendation and User Simulation (SIGIR, 2025) [Paper] [Code]
SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation (ACL Industry, 2025) [Paper]
A LLM-based Controllable, Scalable, Human-Involved User Simulator Framework for Conversational Recommender Systems (WWW, 2025) [Paper]
RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems (WWW Companion, 2025) [Paper]
LLM-Powered User Simulator for Recommender System (AAAI, 2025) [Paper]
RecAgent: User Behavior Simulation with Large Language Model-based Agents (ACM TOIS, 2024) [Paper] [Code]
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents (Findings of EMNLP, 2024) [Paper]
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems (SCI-CHAT Workshop, 2024) [Paper]
A Survey on Large Language Models for Recommendation (arXiv, 2024.08) [Paper] [Code]
LLM-Rec: Personalized Recommendation via Prompting Large Language Models (arXiv, 2024.07) [Paper]
Generating Personalized Recommendations via Large Language Models (LLMs) (Technical Disclosure Commons, 2022.12) [Paper]

5.5 Customer & Consumer Simulation

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants (arXiv, 2026.01) [Paper]
Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping (arXiv, 2025.10) [Paper]
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents (NeurIPS, 2025) [Paper]
Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning (NeurIPS SEA Workshop, 2025) [Paper]
LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings (arXiv, 2025.10) [Paper] [Code]
LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants? (arXiv, 2025.09) [Paper]
What Is Your AI Agent Buying? Evaluation, Biases, Model Dependence, & Emerging Implications for Agentic E-Commerce (arXiv, 2025.08) [Paper]
ShoppingBench: A Real-World Intent-Grounded Shopping Benchmark for LLM-based Agents (arXiv, 2025.08) [Paper]
LLM-Based Multi-Agent System for Simulating and Analyzing Marketing and Consumer Behavior (IEEE ICEBE, 2025) [Paper]
Predicting Behaviors with Large Language Model (LLM)-Powered Digital Twins of Consumers (MSI Working Paper, 2025) [Paper]
AI-Human Hybrids for Marketing Research: Leveraging Large Language Models (LLMs) as Collaborators (Journal of Marketing, 2025) [Paper]
Large Language Models for Market Research: A Data-augmentation Approach (arXiv, 2024.12) [Paper]
Can Large Language Models Capture Human Preferences? (Marketing Science, 2024) [Paper]
Can LLM Agents Simulate Multi-Turn Human Behavior? Evidence from Real Online Customer Behavior Data (arXiv, 2025.03) [Paper]
Using LLMs for Market Research (Harvard Business School Working Paper, 2025) [Paper]
Challenges and Opportunities of LLM-Based Synthetic Personae and Data in HCI (CHI EA, 2024) [Paper]

5.6 Others

Improve Temporal Awareness of LLMs for Sequential Recommendation (arXiv, 2024.05) [Paper]
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments (arXiv, 2024.03) [Paper] [Code]
Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf (arXiv, 2023.09) [Paper]
The SocialAI School: Insights from Developmental Psychology towards Artificial Socio-Cultural Agents (arXiv, 2023.07) [Paper] [Code]

6. LLM Evaluation

Benchmarking LLMs' Judgments with No Gold Standard (arXiv, 2024.11) [Paper]
Cognitive Overload Attack: Prompt Injection for Long Context (arXiv, 2024.10) [Paper]
Moral Alignment for LLM Agents (arXiv, 2024.10) [Paper]
Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing (arXiv, 2024.09) [Paper] [Code]
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (ACL, 2024) [Paper] [Code]
Regurgitative Training: The Value of Real Data in Training Large Language Models (arXiv, 2024.07) [Paper]
Evaluating the Performance of Large Language Models via Debates (arXiv, 2024.06) [Paper]
Auto-Arena: Automating LLM Evaluations with Agent Peer Battles and Committee Discussions (arXiv, 2024.05) [Paper] [Code]
AgentClinic: A Multimodal Agent Benchmark to Evaluate AI in Simulated Clinical Environments (arXiv, 2024.05) [Paper] [Code]
How Reliable is Your Simulator? Analysis on the Limitations of Current LLM-based User Simulators for Conversational Recommendation (arXiv, 2024.03) [Paper] [Code]
Humans or LLMs as the Judge? A Study on Judgement Bias (arXiv, 2024.02) [Paper] [Code]
LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Games (CoRR, 2023.09) [Paper]
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback (NeurIPS, 2023) [Paper] [Code]
AgentSims: An Open-Source Sandbox for Large Language Model Evaluation (arXiv, 2023.08) [Paper] [Code]

7. Cognition & Psychology

Grounded Cognition (Annual Review of Psychology, 2008) [Paper]
Language and Simulation in Conceptual Processing (Symbols, Embodiment, and Meaning, 2008) [Paper]
Dual Coding Theory: Retrospect and Current Status (Canadian Journal of Psychology, 1991) [Paper]
How and Why Thoughts Change: Foundations of Cognitive Psychotherapy (Oxford University Press, 2015) [Paper]
Emotion and Social Theory: Corporeal Reflections on the (Ir)rational (Sage, 2000) [Paper]
Neural Dynamics of Decision Making Under Risk: Affective Balance and Cognitive-Emotional Interactions (Psychological Review, 1987) [Paper]
A Cognition-Based View of Decision Processes in Complex Social-Ecological Systems (Ecology and Society, 2007.06) [Paper]
Information, Incentives, and Proenvironmental Consumer Behavior (Journal of Consumer Policy, 1999) [Paper]
Human Incentives (The Greek Economy and the Crisis, 2012) [Paper]

8. Social Simulation

AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society (arXiv, 2025.02) [Paper] [Code]
OASIS: Open Agent Social Interaction Simulations with One Million Agents (arXiv, 2024.11) [Paper] [Code]
Multi-Agents Are Social Groups: Investigating Social Influence of Multiple Agents in Human-Agent Interactions (CSCW, 2025) [Paper]
Unpacking a Black Box: A Conceptual Anatomy Framework for Agent-Based Social Simulation Models (JASSS, 2023) [Paper]
Simulation: A Tool for System Design and Analysis (GPH-IJSSHR, 2023) [Paper]
Analysing the Combined Health, Social and Economic Impacts of the Coronavirus Pandemic Using Agent-Based Social Simulation (Minds and Machines, 2020.06) [Paper]
The Termination Risks of Simulation Science (Erkenntnis, 2020) [Paper]
Simulating Societies: The Computer Simulation of Social Phenomena (2018) [Paper]
Computer Simulations of Space Societies (2018) [Paper]
Can Robots Be Lawyers? Computers, Lawyers, and the Practice of Law (Georgetown Journal of Legal Ethics, 2017) [Paper]
Ethics in Planning (2017) [Paper]
Social Self-Organization: Agent-Based Simulations and Experiments to Study Emergent Social Behavior (2012) [Paper]
Analyzing and Modeling Real-World Phenomena with Complex Networks: A Survey of Applications (Advances in Physics, 2011.03) [Paper]
A Simulation System of Social Economic (Computer and Information Science, 2011.07) [Paper]
A Methodology for Complex Social Simulations (JASSS, 2010) [Paper]
Agent-Based Modeling: A New Approach for Theory Building in Social Psychology (Personality and Social Psychology Review, 2007) [Paper]
Simulated Experiments: Methodology for a Virtual World (Philosophy of Science, 2003.01) [Paper]
Understanding Climate Policy Using Participatory Agent-Based Social Simulation (MABS, 2000) [Paper]
System Dynamics: Simulation for Policy Analysis from a Feedback Perspective (Qualitative Simulation Modeling and Analysis, 1991) [Paper]
Policy Exploration through Microanalytic Simulation (1976) [Paper]

9. Conference

The Multi-hub Academic Conference: Global, Inclusive, Culturally Diverse, Creative, Sustainable (Frontiers in Research Metrics and Analytics, 2021.07) [Paper]
Ten Simple Rules to Host an Inclusive Conference (PLOS Computational Biology, 2022.07) [Paper]
Transitioning to Sustainable Academic Conferences Needs More Experimentation and Reflection (Global Sustainability, 2023.09) [Paper]
Creative Destruction in Academia: A Time to Reimagine Practices in Alignment with Sustainability Values (Sustainability Science, 2023.07) [Paper]

10. Others

Can Generative AI Improve Social Science? (PNAS, 2024.03) [Paper]
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design (TACL, 2024) [Paper] [Code]
Employing Large Language Models in Survey Research (NLP Journal, 2023.09) [Paper]
Sequential Modeling Enables Scalable Learning for Large Vision Models (CVPR, 2024) [Paper] [Code]
"You Are Not the Expert Here": How Large Language Models Impact Help-Seeking in Online Communities (CHI, 2024) [Paper]

Acknowledgement

This repository is initially built and maintained by Qian Wang (persdre@gmail.com).

Citation

If you find this repository useful, please consider citing our papers:

@inproceedings{wang2025canllmsimulations,
  author    = {Wang, Qian and Tang, Zhenheng and He, Bingsheng},
  title     = {Can LLM Simulations Truly Reflect Humanity? A Deep Dive},
  booktitle = {ICLR Blogposts 2025},
  year      = {2025},
  url       = {https://iclr-blogposts.github.io/2025/blog/rethinking-llm-simulation/}
}

@misc{wang2025llmbasedhumansimulationsreliable,
  title         = {LLM-based Human Simulations Have Not Yet Been Reliable},
  author        = {Qian Wang and Jiaying Wu and Zichen Jiang and Zhenheng Tang and Bingqiao Luo and Nuo Chen and Wei Chen and Bingsheng He},
  year          = {2025},
  eprint        = {2501.08579},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CL},
  url           = {https://arxiv.org/abs/2501.08579}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome LLM-based Human Simulation

Table of Contents

1. Foundations & Surveys

2. LLM for Human Behavior Simulation

3. LLM Agent

4. LLM Bias & Value

5. LLM Simulation Applications

5.1 Economics & Finance

5.2 Politics & Society

5.3 Education

5.4 Recommendation System & User Simulation

5.5 Customer & Consumer Simulation

5.6 Others

6. LLM Evaluation

7. Cognition & Psychology

8. Social Simulation

9. Conference

10. Others

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Folders and files

Latest commit

History

Repository files navigation

Awesome LLM-based Human Simulation

Table of Contents

1. Foundations & Surveys

2. LLM for Human Behavior Simulation

3. LLM Agent

4. LLM Bias & Value

5. LLM Simulation Applications

5.1 Economics & Finance

5.2 Politics & Society

5.3 Education

5.4 Recommendation System & User Simulation

5.5 Customer & Consumer Simulation

5.6 Others

6. LLM Evaluation

7. Cognition & Psychology

8. Social Simulation

9. Conference

10. Others

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Packages