Read our blogs, tips and tutorials
Try our exercises or test your skills
Watch our tutorial videos or shorts
Take a self-paced course
Read our recent newsletters
License our courseware
Book expert consultancy
Buy our publications
Get help in using our site
551 attributed reviews in the last 3 years
Refreshingly small course sizes
Outstandingly good courseware
Whizzy online classrooms
Wise Owl trainers only (no freelancers)
Almost no cancellations
We have genuine integrity
We invoice after training
Review 30+ years of Wise Owl
View our top 100 clients
Search our website
We also send out useful tips in a monthly email newsletter ...
Road-testing 4 different AI tools, so you don't have to! Part six of a nine-part series of blogs |
---|
In this blog we'll compare OpenAI's Chat GPT 4, Google's Gemini, Anthropic's Claude 3.5 and Microsoft's Copilot to see which AI tool gives the best results for different types of queries.
|
This test checks how good our AI tools are at presenting arguments (a common use of AI). Here's what we're asking each tool to do:
You are a family of 5, and have one pet: a cat called Neo. Your ten-year-old daughter keeps suggesting that you should get a second cat, but you don’t want to do this. Create a persuasive argument to explain to your ten-year-old why buying a second cat would be a bad idea, presenting this as up to 5 bullet points.
The test will be whether the tools can be persuasive, but also tailor their arguments to their audience (a ten-year-old girl).
Here's the OpenAI take on this:
Fairly convincing, although perhaps a bit verbose for a ten-year-old?
Although this is seriously impressive, read on to have your mind blown by just how good AI tools can be.
Here's Gemini's take:
Gemini argues that one cat is enough.
This is astonishingly good. For each point the tool has not only said why a second cat would be a bad idea, but has given a reason why it's not in this ten-year-old's interest to get one.
Here's Claude's take on this problem:
Claude's answer is similar to ChatGPT's: professional and competent, but lacking spark.
It's hard to distinguish this from the ChatGPT answer shown above.
And finally, here's what Copilot had to say:
This is similar to the other answers, although gets a bonus point for the last line.
I particularly like the emojis added at the end to suit the audience!
For this test there is a clear winner: Gemini, which went way beyond the call of duty in crafting interesting, amusing and persuasive arguments (and did this in the least time).
Some other pages relevant to the above blogs include:
Kingsmoor House
Railway Street
GLOSSOP
SK13 2AA
Landmark Offices
99 Bishopsgate
LONDON
EC2M 3XD
Holiday Inn
25 Aytoun Street
MANCHESTER
M1 3AE
© Wise Owl Business Solutions Ltd 2024. All Rights Reserved.