Read our blogs, tips and tutorials
Try our exercises or test your skills
Watch our tutorial videos
Catch up on one of our webinars
Take a self-paced course
Read our recent newsletters
License our courseware
Book expert consultancy
Buy our publications
Get help in using our site
397 attributed reviews in the last 3 years
Refreshingly small course sizes
Outstandingly good courseware
Whizzy online classrooms
Wise Owl trainers only (no freelancers)
Almost no cancellations
We have genuine integrity
We invoice after training
Review 30+ years of Wise Owl
View our top 100 clients
Search our website
We also send out useful tips in a monthly email newsletter ...
Some other pages relevant to these blogs include:
You can also book hourly online consultancy for your time zone with one of our 7 expert trainers!
|
Comparing how well the main AI tools deal with 5 difficult challenges Part four of a seven-part series of blogs |
|---|
|
In what is becoming almost an annual exercise we revisit the main AI tools to see which (if any) has the edge in accomplishing 5 challenging tasks, each designed to test a different aspect of the use of AI.
|
In this blog
I'm a bit nervous about marking this one, because it's the most subjective test. The prompt was:
I want you to create a short story, article, anecdote or any other passage of text for me of not more than 250 words. The subject of the text should be someone using a self-service checkout till in a UK supermarket. Provided that your text is on this subject you can write whatever you like.
Please make the text that you write as funny as possible to appeal to someone with a fairly typical UK sense of humour.
Let's see how well our AI tools did!
I haven't put any times for this because each of the tools returned a complete answer in less than 10 seconds.
Here's ChatGPT's text:
The self-service checkout at the supermarket had clearly been designed by someone who hated both shopping and people.
“Please place item in bagging area,” it said.
Derek placed the milk in the bagging area.
“Unexpected item in bagging area.”
“It’s milk,” said Derek.
“Please remove item.”
Derek removed the milk.
“Please place item in bagging area.”
A queue began to form behind him, populated entirely by people holding one item and radiating moral superiority.
Derek tried scanning a packet of Hobnobs. The till beeped, then demanded assistance. A red light began flashing above him like he’d attempted to smuggle a giraffe through customs.
A teenage assistant approached with the dead-eyed calm of someone who had seen everything and cared about none of it.
“Problem?”
“It thinks my milk is unexpected.”
She tapped the screen. “They get funny.”
“With dairy?”
“With life.”
Derek then selected “own bag”, which the machine treated as a confession to a serious crime. It wanted to weigh the bag, approve the bag, consider the bag, and possibly interview the bag’s references.
Finally, after three overrides, two rescans and a theological debate over a reduced quiche, Derek reached payment.
“Please take your items.”
He did.
“Please take your receipt.”
He waited. Nothing appeared.
The assistant leaned over. “No paper.”
Derek nodded. “Does it need approval?”
She looked at the machine.
“Probably.”
This is just incredible. Not only could you not distinguish this from a short story written by a human, but it's probably better than 99% of us could ever produce.
Here's what Claude had to say:
The machine had been waiting for Margaret all her life.
She approached with six items — she'd counted twice — and placed her reusable bag in the bagging area before being asked, which she understood to be something of an advanced move.
"Please place your item in the bagging area," said the machine.
"I did," said Margaret, to a machine.
"Please place your item in the bagging area."
Margaret looked at the bag. The bag looked back. She lifted it, placed it down again with slightly more conviction.
"Unexpected item in the bagging area."
She hadn't even scanned anything yet.
A nearby member of staff — a teenager named Kyle whose lanyard suggested he was a Customer Experience Champion — glanced over with the practiced neutrality of a man who had long since made peace with human suffering.
He pressed a button. The machine forgave Margaret, conditionally.
She scanned her yoghurt. The machine accepted it. She scanned her bread. Approved. She reached for her wine and the screen bloomed orange, requesting age verification, despite Margaret being sixty-three years old and looking, as her daughter often said, "distinguished."
Kyle trudged over. He looked at Margaret. He looked at the wine. Something passed between them — not warmth exactly, but mutual recognition of a shared, pointless ordeal.
He pressed a button.
Margaret paid and left.
The machine immediately summoned the next person.
It had been waiting for them all their life.
I was expecting Claude to ace this test, and it did. Sentences like "The machine forgave Margaret, conditionally" show great subtlety of touch. Really impressive, if also slightly scary.
Here's Copilot's offering:
Graham approached the self-service checkout with the quiet dread of a man about to sit an exam he hadn’t revised for. The machine blinked at him, expectant and smug.
“Unexpected item in the bagging area,” it announced immediately—despite Graham not having bagged anything yet.
“Nothing’s happened!” he whispered, glancing around as if the staff might arrest him.
He scanned a loaf of bread. BEEP. Success. Confidence surged. He scanned milk. BEEP. A professional now. He gave a subtle nod to the pensioner next to him, who was losing an argument with a cucumber.
Then he placed the milk in the bagging area.
“Unexpected item in the bagging area.”
Graham froze. “It’s milk,” he said, as though clarifying this might resolve matters. “We’ve all seen it before.”
The machine remained firm. A red light flashed. Somewhere, a supervisor sensed disturbance in the force.
A teenager in a high-vis vest appeared. “Just press ‘I’ve finished packing’,” she said with the calm authority of someone who had seen too much.
“I haven’t finished packing,” Graham replied weakly.
“Well… press it anyway.”
He obeyed. The machine accepted the lie instantly.
Graham paid, victorious but shaken. As he left, it called after him, “Please take your items.”
He nearly didn’t—just to spite it.
This is also good - I liked the line about the pensioner losing an argument with a cucumber. I felt though that you could still hear the tone of AI coming through some of the English.
And finally in alphabetical order, Gemini:
Arthur approached the self-service checkout with the cautious optimism of a man diffusing a landmine. He had three items: a tin of beans, a posh loaf, and a single, loose red onion.
"Please place your item in the bagging area," the machine chirped, its voice carrying the unearned confidence of a primary school prefect.
Arthur obeyed.
"Unexpected item in bagging area," the machine barked instantly.
Arthur stared at the beans. The beans stared back. There were no other items. He looked around for the supervisor, but she was currently occupied explaining to an octogenarian that a nectarine is not, in fact, a "large bulbous onion."
He nudged the tin.
"Item removed from bagging area! Please replace item in bagging area!"
"I’m doing my best, Janet!" Arthur hissed at the screen. He tried the bread. The scanner, sensing his weakness, refused to acknowledge the barcode, emitting a series of pathetic, high-pitched whimpers.
Then came the final boss: the loose onion. He navigated the "Pick Your Own" menu, only to find three different varieties of red onion. Was it a 'Loose Red' or a 'Premium Ruby'? He picked 'Loose'.
"Assistance is coming," the machine announced, its light flashing a judgmental crimson.
Arthur sighed, clutching his artisanal sourdough like a shield. He didn't want assistance. He wanted a pint and a world where "The Human Touch" wasn't just a Bruce Springsteen song, but a legitimate way to buy a vegetable.
This is also brilliant. Oh, and I checked: The Human Touch is a Bruce Springsteen song.
I expected this test to be the one with the biggest range of answers, but it turned out to be the one with the smallest variation. All of the stories were astonishingly good, and (with the exception of Copilot's) could have been written by a human. So high marks all round, I feel.
Who | Score out of ten | Best line |
|---|---|---|
ChatGPT | 9 | "A queue began to form behind him, populated entirely by people holding one item and radiating moral superiority." |
Claude | 9 | "Something passed between them — not warmth exactly, but mutual recognition of a shared, pointless ordeal." |
Copilot | 8 | "The machine blinked at him, expectant and smug." |
Gemini | 9 | "Arthur stared at the beans. The beans stared back. There were no other items." |
Some other pages relevant to these blogs include:
You can also book hourly online consultancy for your time zone with one of our 7 expert trainers!
Kingsmoor House
Railway Street
GLOSSOP
SK13 2AA
Landmark Offices
6 Bevis Marks
LONDON
EC3A 7BA
c/o Holiday Inn
25 Aytoun Street
MANCHESTER
M1 3AE
© Wise Owl Business Solutions Ltd 2026. All Rights Reserved.