Why Prolific’s AI user experience leaderboard matters

Prolific’s AI user experience leaderboard combines behavioral science with reliable public opinion methods. Using stratified sampling, we recruit representative participants across key demographics like age, gender, ethnicity, education, and geography.
In each study, participants complete a set of standardized real-world tasks—such as email drafting, meal planning, trip organisation, and creative problem-solving—with randomly assigned AI models presented anonymously to avoid biases.
Models are evaluated on seven core dimensions including helpfulness, communication clarity, adaptiveness, understanding, trustworthiness, personality, and cultural alignment. Results are weighted through multilevel regression with poststratification (MRP) to create estimates that reflect broader population experiences.
Discover the AI user experience leaderboard