Bringing light to the GPT-4o vs. GPT-5 personality controversy
Blog post from Surge AI
In a detailed evaluation of GPT-4o and GPT-5, a study involving 490 human evaluators compared the conversational abilities of these AI models across 850 interactions, focusing on aspects such as emotional intelligence and tone. The results revealed that GPT-4o was generally preferred for its friendly, sycophantic style, while GPT-5 was seen as more professional and balanced, offering constructive advice with follow-up suggestions. Despite a slight preference for GPT-4o, both models were criticized for occasionally adopting a tone akin to a "self-help pamphlet" rather than engaging in natural human conversation. The study underscored that while GPT-5 improved in providing logical, structured advice, it sometimes felt too detached, leading to a call for enhanced personalization in AI models to better meet diverse user needs.