Creating Whimsical, Improbable, and Impossible Images with AI
Keeping It Surreal with ChatGPT 4 and Dall-E
You might have noticed, based on the sudden appearance of AI-generated images here at the Freedom Scale, that I recently got off the waiting list for ChatGPT 4 and Dall-E.
As the managing editor of an online news magazine for ten years, I became very sensitive to the issue of image copyrights. You might get away with using a copyrighted image, but you might not. And it’s definitely not worth the hassle when you run afoul of an image owner.
Unfortunately, finding good images that you are sure are copyright-free—and that match the topic of an article—is very challenging. Now, Dall-E is making my life so much easier! The only limitation is my imagination, and whatever limitations Dall-E might have.
Does Dall-E have limitations? I suppose it does. It has difficulty making requested changes to an image it has already given you. (I just creates a new base image most of the time.) And by now, many of you will have heard (or noticed) that it sometimes gives people an extra finger or two hands sticking out of a single sleeve. But other than that, its abilities are simply stunning.
Just ask it to draw a picture—any picture—and it will make a picture based on (its interpretation of) your description. It’s a ton of fun, and easy to lose track of time— asking it to draw image after image just to see what it comes up with.
Last night, I went down that rabbit hole for a while, and I want to share the results with you. These will be in order, with the exact wording I used for each prompt, followed by the image and text Dall-E provided in reply.
At first, my theme was “things that probably didn’t happen, but it’s fun to pretend that they did.”
I had just asked GPT 4 to tell me about pre-ice age North American megafauna, so, that was in my head when I made my first prompt:
Please draw a caveman riding an Elasmotherium.
Here's the image of a caveman riding an Elasmotherium, set in a primitive landscape.
As an aside, I said “please” because I wanted to see if politeness changed GPT 4’s demeanor. I then said, “Ha, that's a great picture!” to which GPT replied, “I'm glad you liked it! If you have any more requests or need another image, feel free to ask. Enjoy your prehistoric adventure!” Very interesting.
Next, I tried another improbable prompt:
Marc Antony, Caesar, and Cleopatra surfing a big wave in Hawaii
Here's the image of Marc Antony, Julius Caesar, and Cleopatra surfing a big wave in Hawaii, depicted in an eclectic mix of traditional attire and contemporary surfing gear.
I am not sure why it added a manservant (or whoever that is) for Cleopatra. I didn’t ask for that. But that is one of the many questions about Dall-E—why does it do what it does? (As you can see, he has at least six fingers on that left hand. So weird.) Weirder still are the creatures on the surfboards in the background. Dali would have been pleased (or deeply horrified) by his artificial-intelligence namesake.
Next, I became curious how it would handle more bizarre requests, so I wrote…
A tarsier laughing at a funny cloud
Here's the image of a tarsier laughing at a funny-shaped cloud in the sky. The tarsier is depicted with a joyful and amused expression, set in a natural green environment.
Okay, that was cute. And weird—did you notice that the reflection in the cloud’s eyes looks more like a farm field than the tropical paradise where the tarsier sits?
I decided to ramp up the surreality of my requests. From here, I did not give myself time to think—I just spat out surreal thoughts as soon as they came to me.
A greater bushbaby drinking a cloud
Here is the whimsical image of a greater bushbaby drinking from a cloud. The scene captures the bushbaby in a playful and curious pose, set against a dreamy and serene natural background.
Heh. Strange and cute. I had given it a weird assignment, and it handled it comparatively well. Next…
A 13th century sea captain discovering a sense of smell for the first time
Here's the image of a 13th-century sea captain discovering a sense of smell for the first time. The captain, in traditional seafaring attire, is on the deck of his ship, expressing wonder and amazement at this new sensory experience.
Not as surreal as I expected, but that’s part of the fun. And look at how happy that sea captain is!
An underground crystalline structure communicating with a distant star
Here's the image of an underground crystalline structure communicating with a distant star. The scene depicts a cavern filled with luminous, multi-colored crystals, creating a magical and otherworldly atmosphere.
I had no image in my head when I gave it that prompt—all I had were the words. Dall-E brought the idea to life brilliantly.
I know it is drawing on the accumulation of human culture, knowledge, and expression, but there seems to be more of that—some sort of synergism that makes Dall-E a player in the equation…with a “mind” of its own. We’re in uncharted waters here—waters that, prior to this point, have been imagined by sci-fi writers, but never actually sailed.
Next, I take things up a notch. How will Dall-E handle even greater levels of abstraction? To wit, my next prompt was simply…
Rigid conformity to an impossible ethos
Here's an abstract and symbolic representation of rigid conformity to an impossible ethos. The image features a series of identical figures striving to reach an abstract, floating symbol, representing an unattainable ideal. The tone is surreal and somewhat dystopian, highlighting the struggle and futility of adhering to an unrealistic standard.
Okay, that was impressive.
Again, I had no image in my head—just words that seemed very abstract to me. Dall-E made them visual.
The next one made me laugh aloud.
A marshmallow becomes self-aware
Here's a whimsical image of a marshmallow becoming self-aware. The marshmallow is anthropomorphized, expressing surprise and curiosity, as if it has just awakened to its own existence. The setting is playful and light-hearted.
So cute!
Next, I decide to see how Dall-E handles two classic examples of ‘impossible’ images: “a horse riding astride its own back” and “a square circle.”
A horse riding astride its own back
Here's a surreal and whimsical image of a horse riding astride its own back. The horse is depicted in a playful, impossible pose, creating a fantastical and humorous scene.
Not so impossible after all, apparently!
A square circle
Here's a surreal and abstract representation of a square circle, merging the features of a square and a circle in an impossible way. This geometric anomaly challenges conventional perceptions of space and form.
Very interesting. Say what you will about the result, but at least it tried. It didn’t refuse. Smoke didn’t start belching from my computer the way it does when Captain Kirk poses an unsolvable problem. Fascinating.
All of this left me thinking about the infinite depths that Dall-E appears to be plumbing. That made me think of fractals, which led me to one final prompt:
A fractal cat begging for his supper
Here's a whimsical and surreal image of a fractal cat begging for its supper. The cat is composed of intricate fractal patterns, capturing the blend of mathematical art with the charm of a cat.
I am absolutely blown away by this technology, to the point where I am at a bit of a loss for words.
And as you know, that doesn’t happen often!
Hahaha. I have one in the works too, a post with many AI images. I was trying to take it EASY with them because I have so many and they are so beautiful. I have learned things like Moroccan tile with Rococo details. That makes some beautiful stuff. I will add curves of porcelain, curves of mist, curves of bamboo, curved of polished redwood; stuff like that too.
Your little bush baby drinking a cloud is SO CUTE! The pirate smelling air for the first time made me laugh, just that description. Haha. Also, the abstract of "Rigid Conformity in an Impossible Ethos" was extremely well done. Both the idea and the image are interesting. The square circle? I would have never thought of that one, for sure. It's interesting to engage with, isn't it? I also like the words "luminescent" and "ethereal" at the end. If you say celestial, you will probably get wings somewhere. And if I want a woman with clothes on, I tend to say in a high collared, lace dress. She has been covered in mycelium and falling through the cloudy sky with a portal nearby made of Moroccan tile with rococo details. (e.g. details are in green and gold). It's a lot of fun. Falling is more fun than flying, because of how AI perceives it. That's how I got one of the cathedral windows in the clouds that I made. I said the window was falling through a cloudy sky. Fun stuff!
I see things in between places, lights and shapes and so forth. (in my normal life, in the air - no I am not on shrooms) I feel like that has gotten stronger since I started using AI image creator. It could just be a coincidental timing as well. I am in a serious growth pattern. It could also be the VERY, VERY-careful-and-I-do-not-recommend-it-at-all-sungazing that I have been doing. (That was one word, I hope I didn't hurt your editor's heart). :)
I feel the sun and the earth giving and taking every day when I sit outside. I believe it's even more inspired than AI. I highly recommend earthing. I feel the difference between doing it and not. So, it could be a mix of these things. Anyway, The horse riding itself is HILARIOUS. I have some INCREDIBLE horse images by the way :) I will need a new external hard drive very soon. *note to self.
I have to make myself not use it and do my homework or my work.
It's a new draw, for sure. Plus, I am relearning Illustrator right now. With those two things in hand, whoa, Nelly.
This was fun :)
Nooo, why did it give Cleopatra a tail?!
Interestingly, the fractal spiral cat looks like a modern version of Louis Wain's work.