Read our blogs, tips and tutorials
Try our exercises or test your skills
Watch our tutorial videos or shorts
Take a self-paced course
Read our recent newsletters
License our courseware
Book expert consultancy
Buy our publications
Get help in using our site
551 attributed reviews in the last 3 years
Refreshingly small course sizes
Outstandingly good courseware
Whizzy online classrooms
Wise Owl trainers only (no freelancers)
Almost no cancellations
We have genuine integrity
We invoice after training
Review 30+ years of Wise Owl
View our top 100 clients
Search our website
We also send out useful tips in a monthly email newsletter ...
Comparing the 5 leading AI tools for image generaton from text Part five of an eight-part series of blogs |
---|
Having recently compared the 4 main AI tools for text prompts, I thought I'd do the same for image generation tools. In this blog series we compare Dall-E (via ChatGPT and separately via Copilot), Firefly, Midjourney and Stable Diffusion for 3 pre-defined tests to see which ones score most highly for cost, ease-of-use, speed, editing ability and above all for quality of image.
|
Here's a reminder of the second picture I want to create:
Create a line drawing of a treasure island with a curvy coastline. The background of the island should be white apart from child-like drawings of the following features:
Hell’s Kitchen (a volcano)
Hangman’s lookout (a gibbet with a skeleton hanging from it)
Dead man’s desert
Sulphur Springs
Magwitch marshes
Tombs Town
Lone Palm
Treachery Hills
These features should be roughly equally spaced around the island. Do not show any text on the picture – I will add that later.
The coastline of the island should be clearly delineated, and child-like drawings appear in the sea of the following:
2 or 3 child-like drawings of whales
A line-drawing of a large bird like an albatross
A pirate ship in a bay
Keep everything simple. Avoid having much fill or background colouring. When in doubt, use simple outline drawings of things. Use pastel colours throughout.
Here's the sort of thing I'm trying to produce:
I'm fervently hoping my AI tools can do far better than this!
Here's what the 4 tools produced:
AI tool | Image produced |
---|---|
ChatGPT | |
Copilot | |
Firefly | |
Midjourney | |
Stable Diffusion |
After reviewing the images, I decided there was no point trying to edit them, as they were all so far away from what I had tried to achieve. This is probably my fault, not that of the AI tools!
For me there's a clear winner and a clear loser:
Position | AI tool | Reasoning |
---|---|---|
Winner | Copilot | Copilot has given me 4 good choices, with the bottom left image showing exactly the effect I was trying to achieve. |
Loser | Stable Diffusion | If you're wondering why this picture doesn't seem to contain many of the things I asked for, here's why: Stable Diffusion not only has a 500-character prompt limit and also seemed to object to some of my punctuation. So in the end here's what this image is showing: |
Other runners | ChatGPT, Firefly, Midjourney | These aren't bad attempts - Firefly tried hard to follow my instructions, and ChatGPT came up with an image having exactly the vibe I was looking for. I feel if I was a better consumer of AI tools I could have given a better steer and got better results. |
I'm nervous now for Stable Diffusion: it's come bottom in both of the tests set so far. There's one more test for you to redeem yourself, Stable Diffusion!
Parts of this blog |
---|
|
Some other pages relevant to the above blogs include:
Kingsmoor House
Railway Street
GLOSSOP
SK13 2AA
Landmark Offices
99 Bishopsgate
LONDON
EC2M 3XD
Holiday Inn
25 Aytoun Street
MANCHESTER
M1 3AE
© Wise Owl Business Solutions Ltd 2024. All Rights Reserved.