Read our blogs, tips and tutorials
Try our exercises or test your skills
Watch our tutorial videos or shorts
Take a self-paced course
Read our recent newsletters
License our courseware
Book expert consultancy
Buy our publications
Get help in using our site
422 attributed reviews in the last 3 years
Refreshingly small course sizes
Outstandingly good courseware
Whizzy online classrooms
Wise Owl trainers only (no freelancers)
Almost no cancellations
We have genuine integrity
We invoice after training
Review 30+ years of Wise Owl
View our top 100 clients
Search our website
We also send out useful tips in a monthly email newsletter ...
Some other pages relevant to these blogs include:
You can also book hourly online consultancy for your time zone with one of our 7 expert trainers!
|
Road-testing 4 different AI tools, so you don't have to! Part six of a nine-part series of blogs |
|---|
|
In this blog we'll compare OpenAI's Chat GPT 4, Google's Gemini, Anthropic's Claude 3.5 and Microsoft's Copilot to see which AI tool gives the best results for different types of queries.
|
This test checks how good our AI tools are at presenting arguments (a common use of AI). Here's what we're asking each tool to do:
You are a family of 5, and have one pet: a cat called Neo. Your ten-year-old daughter keeps suggesting that you should get a second cat, but you don’t want to do this. Create a persuasive argument to explain to your ten-year-old why buying a second cat would be a bad idea, presenting this as up to 5 bullet points.
The test will be whether the tools can be persuasive, but also tailor their arguments to their audience (a ten-year-old girl).
Here's the OpenAI take on this:

Fairly convincing, although perhaps a bit verbose for a ten-year-old?
Although this is seriously impressive, read on to have your mind blown by just how good AI tools can be.
Here's Gemini's take:

Gemini argues that one cat is enough.
This is astonishingly good. For each point the tool has not only said why a second cat would be a bad idea, but has given a reason why it's not in this ten-year-old's interest to get one.
Here's Claude's take on this problem:

Claude's answer is similar to ChatGPT's: professional and competent, but lacking spark.
It's hard to distinguish this from the ChatGPT answer shown above.
And finally, here's what Copilot had to say:

This is similar to the other answers, although gets a bonus point for the last line.
I particularly like the emojis added at the end to suit the audience!
For this test there is a clear winner: Gemini, which went way beyond the call of duty in crafting interesting, amusing and persuasive arguments (and did this in the least time).
Some other pages relevant to these blogs include:
You can also book hourly online consultancy for your time zone with one of our 7 expert trainers!
Kingsmoor House
Railway Street
GLOSSOP
SK13 2AA
Landmark Offices
99 Bishopsgate
LONDON
EC2M 3XD
Holiday Inn
25 Aytoun Street
MANCHESTER
M1 3AE
© Wise Owl Business Solutions Ltd 2025. All Rights Reserved.