Through the ages of AI

July 13, 2024

Through the ages of AI

how Midjourney changed in the last 2 years

I started using Midjourney exactly 2 years ago, mainly driven by curiosity about the technology. The technical side of how visual generative AI works is absolutely amazing to me. That’s why I, like so many others, started early on to experiment and see what these tools can do.

‍

First off, let's look at how it works:

How Visual Generative AI Works

At its core, visual generative AI leverages advanced machine learning techniques to create images.

Learning from Examples:

The AI is trained on a massive collection of images. These images cover various subjects, styles, and scenes, helping the AI learn what different things look like.

Neural Networks:

The AI uses a special system called Generative Adversarial Networks (GANs), which consist of two parts:

The Generator:

This part tries to create new images based on what it has learned.

The Discriminator:

This part checks the images created by the generator against real images from the training set and provides feedback.

Improvement through Feedback:

The generator and discriminator work together in a game-like process. The generator creates images, and the discriminator evaluates them. Over time, the generator gets better at making realistic images based on the feedback it receives.

Understanding Prompts:

When you give the AI a prompt (a description of what you want), it uses language processing techniques to understand your request.

Creating the Image:

The AI combines the information from your prompt with the patterns it learned during training to generate a new image that matches your description.

‍

My Personal Journey with It

In the beginning, I used it primarily for two things: 1) creating images for my DnD campaign and 2) testing what it’s capable of.

1. Creating Images for My DnD Campaign:

‍

I’m not that great at drawing myself, and this is exactly the kind of low-stakes personal use AI is great for, in my opinion. I created images of my character and designs for projects within the world of the game, such as flyers for my god and an image for a beverage label (very subtly inspired by Mountain Dew).

The image of the bird is generated by AI

‍

2. Testing AI’s Capabilities:

I wanted to challenge the AI to see what it can and can’t do. I gave it a couple of prompts that I wanted to repeat with each iteration. This is by no means a scientific approach. For that, I should have timed everything out and tested many more aspects. It was for my own curiosity. But I still found it interesting and fun, which is why I’m sharing it here.

Prompt one: A shark with a pearl necklace drinking tea

(this is the full prompt, I never specified the kind of teacup or settings or anything else)

‍