Jan 28, 2025

Covered Parking

Figure 1. Whisk-generated. Sam's General Store.

In my previous blog post, I showed some Whisk-generated images similar to the one that is shown to the right in Figure 1. In the scene depicted in Figure 1, people have to wear spacesuits in order to walk from their parked spaceships to the store. What if there was a dome covering both the store and the parking lot?

I wanted Whisk to create an image in which the entire store and the surrounding parking lots were covered by a transparent glass dome. In another blog post, Whisk had been able to make glass domes such as the one shown in Figure 2, below.

Figure 2. Whisk-generated. A glass dome.
SSS asteroid
However, when I provided Whisk with the "subject", "scene" and "style" images shown to the right, Whisk generated the image shown in Figure 3. Is she walking on the surface of an asteroid?

Figure 3. Whisk-generated. Shopping on an asteroid?
 Suspect AI. I suspected that maybe Whisk could not really "see" the "subject" image and appreciate the large parking area around the store. Here is how Gemini described the "subject" image: "The image depicts a large, sprawling general store named "Sam's General Store" situated on a celestial body, possibly a planet or moon. The store is surrounded by a vast parking lot filled with various vehicles, including cars, trucks, and even flying vehicles. The scene is filled with celestial bodies like planets and asteroids, suggesting an interplanetary or interstellar setting. The overall style of the image is reminiscent of retro-futuristic artwork with a touch of humor."

A colder asteroid?
Generated by Mr. Wombo.
Here is the description that was generated by Whisk: "A long, rectangular building sits on a flat, light-grey surface, surrounded by numerous vehicles.  The building has a grey roof and dark windows. Two tall, thin towers rise from either side of the building. The vehicles are predominantly white or light-grey, with a few yellow ones visible. They appear to be futuristic in design, with rounded shapes and sleek lines. The surface the building and vehicles rest on is surrounded by a dark, seemingly rocky edge, which is itself surrounded by a dark void speckled with stars. Several large, dark asteroids are visible in the void, along with a few smaller, lighter-colored spacecraft.  The words "SAM'S GENERAL STORE" are written in large, light-blue capital letters at the top of the image.  A faint, light-colored band stretches across the bottom of the image, curving around the dark void." When I asked Whisk to cover the store with a glass dome, Whisk generated Figure 4.

Figure 4. Whisk-generated general store under a glass dome.

Figure 5. Whisk-generated general store on an asteroid.

Figure 6. Whisk-generated general store on an asteroid with some covered parking.
 
Figure 7. Whisk-generated general store on an asteroid with a covered store.
SSS dome "subject"

 Whisk really likes to get up close the the store and shrink the size of the parking area (see Figure 5). I was only able to get Whisk to cover part of the store (see Figure 7, above) or part of the parking lot (see Figure 6). I asked for spaceships to be parked near the store, but Whisk often found it difficult not to generate vehicles that look like cars (Figure 5).

I changed the "subject" and "scene" images to those that are shown to the left. Whisk now generated images such as the one in Figure 8, below. It does not look like Whisk was concerned to provide spacesuits to any of the shoppers in the parking lot.

Figure 8. Whisk-generated. Who needs air? Click image to enlarge.
Ask Gemini. I had a discussion with Gemini in which I tried to gain insight as to why it is so hard for Whisk to put a dome over both the store and the parking spaces. I showed Gemini Figure 1 from this blog page. Gemini generated the text description of the image that is shown above on this page.

Figure 9. Generated by Gemini who
claimed that in this image: "A
large translucent glass dome covers the
 entire store and the adjacent parking areas.
"
 Me: "Gemini, imagine that Sam's General Store is located on an asteroid in deep space. The parking area around the store is full of futuristic spacecraft. Gemini, please generate a similar image in which a large translucent glass dome covers the entire store and the adjacent parking areas."

Gemini generated the image shown in Figure 9.

 Me: "Gemini, in that image that you generated, the glass dome does not cover the entire store and the adjacent parking areas. In fact it looks like the dome might be in the distance behind the store. I want an image in which the arching structure of a transparent glass dome covers the entire general store building and also the nearby parking area." {I uploaded the image that is shown in Figure 2, above.} Gemini, can you see the glass dome in the image that I just uploaded? Please describe the image that I just uploaded

Generated by Wombo Dream.
 Gemini: "Sure, here is an image of a large, sprawling general store named "Sam's General Store" situated on a celestial body, possibly a planet or moon. The store is surrounded by a vast parking lot filled with various vehicles, including cars, trucks, and even flying vehicles. The scene is filled with celestial bodies like planets and asteroids, suggesting an interplanetary or interstellar setting. The overall style of the image is reminiscent of retro-futuristic artwork with a touch of humor. A large translucent glass dome covers the entire store and the adjacent parking areas:  The image you uploaded shows a large, sprawling general store named "Sam's General Store" situated on a celestial body, possibly a planet or moon. The store is surrounded by a vast parking lot filled with various vehicles, including cars, trucks, and even flying vehicles. The scene is filled with celestial bodies like planets and asteroids, suggesting an interplanetary or interstellar setting. The overall style of the image is reminiscent of retro-futuristic artwork with a touch of humor. A large translucent glass dome covers the entire store and the adjacent parking areas. However, the dome appears to be in the distance behind the store, rather than covering the entire store and parking area."

Generated by Mr. Wombo.

 Me: "Gemini, do you realize that I have uploaded two different images during this current conversation?"

 Gemini: "Yes, I do.

You first uploaded an image of a retro-futuristic illustration of a general store on a celestial body, surrounded by spaceships.

Then, you uploaded a different image of a futuristic city with a large glass dome covering it.

I have responded to both images and tried to fulfill your request for an image with a dome covering the store and parking lot.

Is there anything else I can help you with regarding these images or anything else?
"

Figure 10. Generated by Gemini.
 Me: "The image that you generated during this conversation does NOT show a dome covering the entire store and the surrounding parking spaces. I wonder why Imagen 3 has so much trouble generating an image in which the dome covers the entire store and also the parking spaces."

 Gemini: "Sure, here you go!" {At this point, Gemini generated the image that is shown in Figure 10.} 

 Me: "The edge of the dome should extend almost to the bottom of the rendered image. The parked space craft and the store should be visible through the transparent glass of the dome. In the image that you just generated, the dome fails to cover most of the parked spaceships."

Figure 11. Generated by Gemini.
 Gemini: "Sure, here it is!" {At this point, Gemini generated the image that is shown in Figure 11.}

It was not clear in Figure 10 that the dome extended past the front of the store. In Figure 11 it is clear that the dome covers the entire store and spaceships that are parked to the sides of the store and in front of the store.

I then used the image from Figure 11 as a reference image for Whisk, which allowed Whisk to generate the image shown in Figure 12. Whisk was still struggling to not include cars and instead have spaceships parked near the store.

Figure 12. Whisk-generated glass dome over the store and part of the parking area.
Figure 13. A larger dome?

When I complained that I wanted a larger dome covering a larger store and larger parking area, Gemini generated the image shown in Figure 13.

SSS red Sam's
Going through another iteration, I took the Whisk-generated image from Figure 12 and and used it as a new "subject" image (see the image to the left). With these new inputs, Whisk generated images such as the one shown in Figure 14.
Figure 14. Whisk-generated small store. 18 parked spaceships.

 Exits. I wanted there to be an obvious way for spaceships to to go into and out of the dome, so I included in the image description, "There are two openings in the glass dome that allow spaceships to enter and exit the dome." Whisk then generated images like the one shown in Figure 15. I wanted the store and the parking are to be larger, so I specified, "The dome is ten kilometers across and covers nine hundred parked spaceships." Whisk then made a slightly larger parking area as shown in Figure 16, below. Still not satisfied with the size of the glass dome, I tried specifying, "The dome is twenty kilometers across and covers nine thousand parked spaceships," but Whisk seemed unwilling to make a significantly larger parking area (see Figure 17).

Figure 15. Lower right; an opening for spaceships to enter the dome. 19 parked spaceships.

Figure 16. A larger dome with 50 parked spaceships.

Figure 17. Sixty parked spaceships.
Here in my final text prompt: "A photorealistic rendering of science fiction setting featuring "Sam's General Store" located on an large asteroid in deep space. An extremely large translucent glass dome covers the entire general store and the all of adjacent vast parking areas. The store is depicted as a very large, low-slung building with a dark roof and several antenna or communication towers protruding from it. The wide parking areas that surround the store are filled with many futuristic spacecraft, including a few yellow and red vehicles that stand out among the predominantly white and silver ones, all rendered in great detail and in a hyperrealistic fashion. The glass dome is twenty kilometers across and covers the entire general store and a hundred thousand parked spaceships that are in the vast parking areas around the store. The parked spacecraft can be seen through the clear glass of the covering dome.

 The scene is set against a dark space background filled with stars, and there are multiple asteroids visible in the background. Some small spacecraft can be seen flying in the space around the store. The entire image has a distinctive sci-fi aesthetic that reflects a vision of a future with advanced technologies, with a futuristic general store in an outer space setting. The seemingly endless rows of parked spaceships surrounding the store are rendered in great detail and white photorealistic quality. All of the parked spaceships fit in the protected space under the glass dome. The edge of the glass dome extends to the bottom of the rendered image. There are two sophisticated air-lock openings that are shaped like arches located in the sides of the glass dome that look like short tunnels and allow spaceships to enter and exit the dome without allowing the air inside the dome to escape into outer space. A flying spaceship can be seen passing through the air-lock opening in the wall of the dome." Whisk finally generated the image that is shown in Figure 18. I did some manual modifications of the image in Figure 18 so as enhance the appearance of the "tunnels" that penetrate the dome.
Figure 18. Too many spaceships to count.
Generated by Leonardo.

I tried using Leonardo and got images such as the one shown in Figure 19 (below) using their "legacy mode". I started with their "new" image generating mode and the first Leonardo-generated image that I got is shown in Figure 20. Figure 20 was using their default setting for how much weight was given to my reference image. For Figure 21, I increased the weight given to my reference image. As seen in Figure 21, Leonardo made all of the same mistakes as did Whisk, initially failing to make a glass dome that would cover all of the parking area.

As shown in the image to the right, Leonardo sometimes seemed to ignore the idea that it should generate images on the surface of an asteroid. Also, Leonardo seems to prefer circular buildings rather than rectangular buildings.

Figure 19. Generated by "Classic" Leonardo.

Figure 20. Generated by Leonardo "Flux". Less weight to my reference image.

Figure 21. Generated by Leonardo "Flux". More weight to my reference image.

Next:




Jan 26, 2025

Sam's General Store

Figure 1. Image originally generated by Gemini,
but then enhanced by Mr. Wombo.

 In my first blog post of 2025, I made some comments about a May 1954 short story called "Collision Orbit" that was published by Katherine MacLean in the magazine Science Fiction Adventures. In her story, a guy named Sam ran a store located in the asteroid belt.

At that time (early January), I had Gemini generate an image depicting a retail store located on an asteroid. Today I updated that Gemini-generated image with some help from WOMBO Dream so as to add more detail to the spaceships parked at Sam's General Store (see Figure 1). The text "Sam's General Store" was added by me.

Figure 2. Jax's spaceship.
I told Gemini: "Invent the plot outline for a science fiction story with the title 'Sam's General Store' that could accompany the image that I just uploaded. What types of technological marvels might be on sale in the year 3025?" {I uploaded to Gemini a copy of the image that is shown in Figure 1.}

Gemini: "Here's a possible plot outline for a science fiction story titled 'Sam's General Store,' inspired by the image you provided:

 Image Description: A large, retro-style general store sits on a flat, desolate moon. The store is surrounded by a ring of asteroids and space debris, with various futuristic vehicles parked around it.
 Logline: In the year 3025, a lone traveler stumbles upon Sam's General Store, a relic of a bygone era, where he discovers not just strange merchandise, but also a forgotten past.
 Plot Outline:

Figure 2a. Another spaceship created by Gemini.
 1) The Arrival. The story opens with a lone traveler, Jax, navigating a treacherous asteroid field in his sleek, personal spacecraft. His navigation system fails, and he's forced to make an emergency landing on a nearby moon. 2) Discovery. As Jax surveys the barren landscape, he spots a peculiar sight: a large, retro-style general store, complete with a weathered sign reading "Sam's General Store." Intrigued, he approaches the store. 3) Step Inside. Inside, Jax finds a bizarre assortment of goods. There are antique gadgets alongside futuristic tech, vintage clothing next to nanotech enhancements, and even a dusty old record player playing space-age jazz. 4) The Storekeeper. Jax encounters Sam, the storekeeper, a seemingly ageless man with a twinkle in his eye. Sam explains that the store is a time capsule, a remnant of a lost era when people valued craftsmanship and personal connection over instant gratification. 5) A Hidden Past. As Jax explores the store, he discovers hidden compartments and secret rooms, revealing a history of intergalactic travelers and their encounters with Sam's unique establishment. 6) A Lesson Learned. Through his interactions with Sam and the store's strange inventory, Jax learns about the importance of preserving history, appreciating craftsmanship, and valuing human connection in a hyper-technological future. 7) The Departure. Jax's ship is repaired, and he prepares to leave. Sam gifts him a small, antique item – a pocket watch that tells time in multiple dimensions. Jax promises to spread the word about Sam's General Store, ensuring its legacy lives on.

Sam
 Technological Marvels in 3025 at Sam's General Store:
 Self-repairing Nanotech Clothing: Clothes that adapt to any environment and heal minor damage.
 Holographic Entertainment Units: Portable devices that project immersive 3D experiences.
 Bio-engineered Food Replicators: Machines that create personalized meals from basic nutrients.
 Sentient AI Companions: Small, intelligent robots that provide companionship and assistance.
  Anti-gravity Boots that allow the wearer to defy gravity and walk on ceilings.
  Time-Travel Devices: Experimental prototypes that allow limited time travel.
 Mind-Reading Helmets: Devices that allow users to read and influence the thoughts of others.

 Themes:
  i) Nostalgia and the past
  ii) The value of human connection in a technological age
  iii) The importance of preserving history and craftsmanship
  iv) The dangers of unchecked technological advancement
  v) The search for meaning and purpose in a vast universe
Figure 3. Whisk-generated storyboard #1 for Jax's spaceship at Sam's.

 This plot outline is just a starting point, and there are many ways to expand upon it. You can add twists, subplots, and deeper character development to create a unique and engaging story. Let me know if you'd like me to elaborate on any specific aspect of the plot or the technological marvels."

Figure 4. Whisk-generated storyboard #2 for Jax's spaceship at Sam's.

SSS Sam's ship
 Me: "Gemini, generate an image depicting an exterior view of the personal spaceship owned by Jax."

 Gemini: "Sure, here is an image depicting an exterior view of the personal spaceship owned by Jax:" {Gemini generated the image that is shown in Figure 2.}

I used the spaceship from Figure 2 as a Whisk subject image and the image that is shown in Figure 1 as the Whisk "scene" (the Whisk subject, scene and style are shown in the image to the right). Using those reference images, Whisk generated the "storyboard" shown in Figure 3.

I then edited the Whisk-generated text description of the image in Figure 3 and then Whisk generated the more realistic "storyboard" shown in Figure 4.

Me: "Gemini, provide a technical description of exactly how the navigation system of Jax's spaceship failed."

 Gemini: "Sure, here is a technical description of the navigation system failure of Jax's spaceship, as depicted in the image:

{At this point Gemini generated the spaceship shown in Figure 2a, above. http://googleusercontent.com/image_generation_content/3}

 
Jax's Spaceship Navigation System Failure
In the year 3025, Jax's spaceship, a sleek, retro-futuristic craft with a bubble cockpit, was equipped with a state-of-the-art navigation system. However, during his journey through a treacherous asteroid field, the system experienced a catastrophic failure.

Jax
 Technical Analysis:

Upon closer inspection, it was revealed that the navigation system's failure was due to a combination of factors:

1. Asteroid Impact. A stray asteroid, undetected by the system's sensors, struck the ship's hull, damaging the navigation array. This impact caused a cascade of errors, rendering the system inoperable.

2.  Software Glitch. A rare software glitch, triggered by the impact, further compromised the system's functionality. The glitch caused the system to misinterpret data, leading to inaccurate readings and erroneous calculations.

3.  Power Surge. The impact also caused a power surge, overloading the system's components. This surge damaged critical circuits, preventing the system from rebooting or recalibrating.


SSS Sam & Jax
 
Consequences:
As a result of the navigation system failure, Jax was left stranded in the middle of the asteroid field. Without the system's guidance, he was unable to determine his location or plot a course back to safety. He was forced to rely on his piloting skills and visual cues to navigate through the debris field, increasing the risk of further damage to his ship. Ultimately, Jax's resourcefulness and quick thinking allowed him to overcome the navigation system failure and reach Sam's General Store, where he sought refuge and assistance.
"  

 Me: "Gemini, let us continue to develop the story with these two postulates in mind: postulate 1) that the setting of "Sam's General Store" is a distant star system and postulate 2) assume that Sam and Jax are aliens from distant exoplanets. Sam and Jax are humanoid aliens that have the same basic mammalian body plan as do humans from Earth, but they have different skin colors and differently shaped eyes and ears compared to humans and are "hexapodal", with six limbs. Sam is a male alien creature who has green skin and very large yellow glowing eyes. Sam has four legs and two arms and can be depicted like a mythical centaur. Jax is a female alien creature who has purple skin, long pointed ears and vestigial wings on her back in addition to two legs and two arms. Jax can be depicted as a bipedal bat with two small arms. I wonder how a humanoid bat-like creature would communicate. Gemini, suggest details for the spoken language of Jax." {At this point, I had ImageFX generate two images: 1) an image of a humanoid robot and 2) an image depicting Sam and Jax. Using those reference images (shown to the right), Whisk made Figure 5.

Figure 5. Whisk-generated storyboard Sam & Jax #1.

The alien android.
Apparently, Whisk had not noticed that in the reference image for the Sam "subject", Sam was depicted as having six limbs and four legs. I modified the storyboard description to include the idea that Sam be depicted as a "green, Centaur-like alien with yellow eyes". Whisk then generated the new version of this storyboard that is shown in Figure 6. I then used Mr. Wombo to alter the appearance of Sam and Jax to create the edited version of the storyboard shown in Figure 7.

Having seen how easily Whisk depicted a transparent "holographic" display, I decided to try to get Whisk to make the white android transparent, but I could not find the "magic words" needed to make Whisk depict her as being a transparent element in this storyboard image. The image shown to the right was generated by WOMBO Dream using a Whisk-generated image as a reference.

Figure 6. Whisk-generated storyboard Sam & Jax #2.

Figure 7. Whisk-generated storyboard Sam & Jax #2a. Edited by Mr. Wombo.
Figure 8. Transparent robot (PhotoShop).

SSS translucent
It was highly frustrating to see Whisk so easily generate the transparent green "holographic" display while always refusing to generate a transparent robot. I gave up trying to get Whisk to make the robot transparent while using a "subject" image that showed the robot as opaque. Instead, I used PhotoShop to make a 50% transparent android as shown in the "scene" image in Figure 8.

Along with that PhotoShop-generated transparent robot "scene" image, I provided Whisk with another "subject", a kind of human-alien hybrid (see the top panel in the image to the left. With those reference images, Whisk generated the image shown in Figure 9. Once again, Whisk failed to recognize that Sam has four legs and failed to make the robot transparent. I then provided Whisk with a storyboard description that explicitly requested the robot be depicted as transparent and then Whisk made Figure 10.

Figure 9. Transparent robot storyboard #1. A human with long pointed ears.

Figure 10. Transparent robot storyboard #2. Jax with green wings.
Generated by Mr. Wombo.
 There are two ways to provide text prompts to Whisk: 1) in the text description of the storyboard and 2) by using another text entry field just below the storyboards that is labeled "add additional details". When I used the "add additional details" text field to also instruct Whisk that the robot should be depicted as being transparent, then I got Whisk-generated storyboard images such as the one in Figure 11. The Whisk software did finally generate a transparent figure, but it had a very odd mixture of features including pointed ears. At the same time, Whisk ignored other instructions such a those telling it to make the scene look like a futuristic electronics shop, give Jax wings and make Sam have four legs. 

When I told Gemini that Whisk would not depict the robot as being transparent, Gemini suggested that I try using language such as "The android is rendered like a ghostly apparition", but doing so sometime resulted in Whisk-generated storyboard images such as the one in Figure 12.

Figure 11. Transparent robot storyboard #3. Jax with horns and Sam with a tail.

Figure 12. Transparent robot storyboard #4. Two holographic displays.
SSS transparent
human and
transparent robot.
I then used part of one these new Whisk-generated storyboard images similar to the one in Figure 11 as a new "subject" image in combination with the same "scene" image that had been made with PhotoShop (see the image to the left). With these new inputs to Whisk, Whisk now generated images such as the one shown in Figure 13, below. 

As seen in Figure 13, Whisk was still quite poor at consistently making a transparent robot and could not follow instructions to keep the wings on Jax and provide Sam with four legs. It seems that Whisk is easily confused if there is too much going on in a requested "storyboard". 

Figure 13. Transparent robot storyboard #5. Sam with wings.

Generated by Mr. Wombo.
 One of the best of the transparent humanoid figures that Whisk generated is shown in Figure 14. However, in the "storyboard" image that is shown in Figure 14, Whisk seemed to only use the provided "subject" image and seemed to completely forget about the "scene" image. Figure 14 is one of the rare "storyboard" images that Whisk generated with a transparent humanoid figure in a "futuristic shop that is full of technologically advanced gadgets", as requested in my text prompt.

Back in 2023, one of the first experiments that I performed with Mr. Wombo was in finding ways to get an AI to make transparent objects. Last year, I made this image with three transparent human figures. For the image to the right, I simply asked Mr. Wombo to make "a transparent glass statue".

Figure 14. Transparent robot storyboard #6. Simplified; Sam and Jax are gone.

SSS final

A Whisk-generated robot.
When I simplified everything and provided Whisk with only a reference image like the one in Figure 14, then Whisk was able to make transparent robots like the one shown to the right.

I tried one more iterative cycle in which I provided two "subject" images, one using part of Figure 7 (see the image to the left). Using that new combination of images, Whisk generated the kind of "storyboard" image that is shown below in Figure 15.

Jax
The image in Figure 15 was altered by using Mr. Wombo to make adjustments to the Whisk-generated faces of Jax and the robot. In the original Whisk-generated "storyboard" image, Whisk had given Jax horns, small eyes and a black top to wear (see the image to the right).

robot
In the original Whisk-generated "storyboard" image, Whisk had made the robot's hair less transparent than the rest of the robot's body (see the image to the right).

Figure 15. Transparent robot storyboard #7. Sam and Jax return.

Next: putting a glass dome over Sam's store.

Visit the Gallery of Movies, Book and Magazine Covers