I ran the same complex prompts on ChatGPT, Claude, and Gemini — the results surprised me
A tech writer conducted a comparison of AI models ChatGPT, Claude, and Gemini using complex prompts. The results revealed significant differences in reasoning and formatting among the models. This hands-on approach highlighted the limitations of traditional AI benchmarks in real-world applications.
- ▪The writer used paid versions of ChatGPT, Claude, and Gemini for a fair comparison.
- ▪Complex, multi-layered prompts were employed to test the models' capabilities.
- ▪The variations in performance were surprising and showcased each model's unique strengths.
Opening excerpt (first ~120 words) tap to expand
{ "@context": "https://schema.org", "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": "1", "name": "Home", "item": "https://www.xda-developers.com/" }, { "@type": "ListItem", "position":"2", "name": "AI tools", "item": "https://www.xda-developers.com/ai-tools/" }, { "@type": "ListItem", "position":"3", "name": "I ran the same complex prompts on ChatGPT, Claude, and Gemini \u2014 the results surprised me", "item": "https://www.xda-developers.com/ran-same-complex-prompts-on-chatgpt-claude-gemini-and-local-llm/" } ] } I ran the same complex prompts on ChatGPT, Claude, and Gemini — the results surprised me By Parth Shah Published May 19, 2026, 8:30 AM EDT Parth, a seasoned tech writer, wields the keyboard (or pen) with finesse to unravel the intricacies of…
Excerpt limited to ~120 words for fair-use compliance. The full article is at XDA Developers.