Every Midjourney Feature Explained in 9 Minutes

Future Tech Pilot
16 May 202308:25

TLDRThis video script explains the features of Midjourney, an AI art generation tool. It covers the process of creating images in public channels, using various algorithms like versions 4, 5, and niji styles. Users can customize their creations with parameters like aspect ratio, chaos value, and quality. The script also introduces commands like 're-roll', 'upscale', and 'remix' for post-generation adjustments. Additionally, it highlights advanced features such as negative prompts, multi-prompts, and image blending, offering a comprehensive guide to unleashing one's creativity with Midjourney.

Takeaways

  • 📨 Reacting with an envelope emoji to a generated picture in public channels allows private generation via direct message.
  • 🖼️ Use '/imagine' in Discord to write your prompt and generate images, like 'cyborg dinosaur' for a demonstration.
  • 🖋️ Different algorithms are available: Version 1, 2, 3, and 4 (with sub-styles A, B, C, and Cursed), and Version 5, 5.1, and 5.1 Raw.
  • 🎨 Niji Journey offers anime-specific algorithms with versions and styles such as Expressive, Cute, and Scenic.
  • 🔄 Options include re-roll, upscale, and variations to generate different images based on one prompt.
  • 🔧 Remix allows small adjustments to images; older versions offer different upscalers and the remaster button.
  • 🔢 Parameters enhance prompts: aspect ratio (--AR), chaos (--chaos), stylized value (--stylize), quality (--q), and seed number.
  • ⏹️ Stop parameter controls generation percentage, and video prompts create seamless textures with '--tile'.
  • ✏️ Multi-prompt and negative prompts refine images further, using '--repeat' for multiple attempts and squiggly brackets for permutations.
  • 🖇️ Image prompts combine a picture URL with words; older versions include '--test' and '--HD' parameters for specific enhancements.
  • 📋 Use '/prefer' for prompt shortcuts, '/describe' for image-based prompts, and '/show' to recall previous generations.
  • 🌀 '/blend' merges up to five images; '/info' provides profile details including remaining GPU fast time.
  • ⏱️ Fast and relaxed generation modes available based on plan, with '/settings' for customization and '/help' for basic commands.

Q & A

  • What is Midjourney and how does it update its features?

    -Midjourney is a dynamic platform that is frequently updated. It offers various features for image generation, which can be accessed and utilized in different ways as explained in the script.

  • How can I generate a picture privately using Midjourney?

    -To generate a picture privately, you can react to a generated picture in a public channel with an envelope emoji, and Midjourney will then send you a direct message to generate in private.

  • What is the command to start generating an image on Discord?

    -The command to start generating an image on Discord is 'forward slash imagine', followed by the prompt you want to use for the image generation.

  • What does the 'cyborg dinosaur' prompt demonstrate about Midjourney?

    -The 'cyborg dinosaur' prompt is used to showcase the capabilities of Midjourney, especially how far it has come in generating high-quality and varied images of complex subjects.

  • How many versions of algorithms does Midjourney offer for image generation?

    -Midjourney offers multiple versions of algorithms for image generation, including version one up to version five, with additional styles within version four and different styles in niji Journey.

  • What is the 're-roll' option in Midjourney and what does it do?

    -The 're-roll' option in Midjourney allows you to run the same prompt again, generating a new set of images based on the original prompt.

  • What does the 'upscale' feature in Midjourney do?

    -The 'upscale' feature in Midjourney generates a bigger picture of one of the images from your generation, providing a higher resolution image.

  • How can I make small adjustments to one of my pictures in Midjourney?

    -You can make small adjustments to one of your pictures by typing 'forward slash remix' and hitting enter, which turns the variation button into the remix button.

  • What parameters can be added to a prompt in Midjourney to influence image generation?

    -Parameters that can be added to a prompt in Midjourney include the aspect ratio (-AR), chaos value (-chaos or -C), stylized value (-stylize), quality parameter (-q), and a seed number for generating images.

  • What is the 'stop' parameter used for in Midjourney?

    -The 'stop' parameter in Midjourney allows you to stop the generation process at a specified percentage, giving you control over the progress of the image generation.

  • How can I create a seamless texture using Midjourney?

    -You can create a seamless texture by including 'dash dash tile' in your prompt, which will allow the generated images to be repeated without any visible seams.

  • What is the 'multi-prompt' feature and how does it work?

    -The 'multi-prompt' feature allows you to include multiple parts in your prompt, separated by two colons. Midjourney will read each part separately, allowing for more complex and nuanced image generation.

  • How can I save time by using prompt shortcuts in Midjourney?

    -You can save time by creating prompt shortcuts using the 'forward slash prefer' and 'option set' commands. This allows you to define commonly used terms or parameters that can be quickly inserted into your prompts.

  • What does the 'forward slash blend' command do in Midjourney?

    -The 'forward slash blend' command in Midjourney allows you to blend up to five images together, creating a composite image from the selected ones.

  • How can I access additional help and documentation in Midjourney?

    -You can access additional help and documentation by using commands like 'forward slash FAQ' for quick access to threads with information, 'forward slash docs' to generate a link to topics covered in the user guide, and 'forward slash help' for a rundown of basic functionalities.

Outlines

00:00

🔄 Mid-Journey Updates and Basic Features

Mid-Journey frequently updates, making some information quickly outdated. However, here's a comprehensive overview of its features. When you generate a picture in a public channel, you can react with an envelope emoji to get a direct message for private generation. Use the command `/imagine` to input prompts. Demonstrating the evolution, 'cyborg dinosaur' prompts showcase the improvements across versions 1 to 5.1, including various styles like 4A, 4B, 4C, and the secret 'cursed' style. Additionally, Niji Journey offers anime-focused algorithms with expressive, cute, and scenic styles. Options after generation include re-rolling, upscaling, and creating variations. The remix feature allows slight adjustments, and different upscalers are available depending on the version.

05:01

🎨 Mid-Journey Parameters and Advanced Features

Mid-Journey offers various parameters to customize your prompts. The aspect ratio can be adjusted with `--AR` followed by ratios like 1:2, 2:3, etc. Chaos values (`--chaos`) range from 0 to 100, affecting image variety. Stylized values (`--stylize`) control creativity, with higher values yielding more artistic images. Quality (`--q`) affects rendering time and cost. Seeds (`--seed`) provide consistent starting points for images. The stop parameter (`--stop`) halts generation at specific completion stages. Older versions support video (`--video`) and tile (`--tile`) parameters for seamless textures. Multi-prompts and weights adjust prompt importance. The `--repeat` parameter runs multiple attempts, while squiggly brackets create permutations. Image prompts combine pictures with text, and negative prompts (`--no`) exclude unwanted elements.

Mindmap

Keywords

💡Mid-journey

Mid-journey refers to the ongoing development and updates of a system or process, in this case, an AI-driven image generation tool. The term is used to convey that while the tool is constantly evolving, the features explained in the video may not remain relevant for long due to these updates. In the script, it is mentioned that the mid-journey features are subject to change, emphasizing the dynamic nature of the technology.

💡Public Channels

Public channels are shared spaces within a platform, such as Discord, where users can interact and communicate openly. In the context of the video, public channels are used for generating images that can be reacted to with an envelope emoji, which triggers a private message from the AI tool for further interaction.

💡Prompt

A prompt, in the context of AI image generation, is a textual description or request that guides the AI to create a specific image. The script uses 'cyborg dinosaur' as an example prompt to demonstrate the capabilities of the mid-journey tool, showing how it can interpret and generate images based on the given description.

💡Algorithms

Algorithms are the set of rules or processes that the AI follows to generate images. The video script mentions different versions of algorithms (version one through five, and 5.1 style raw), indicating the progression and improvements in the AI's ability to create images. Each version represents a step towards better image generation capabilities.

💡Niji Journey

Niji Journey is a specific mode or algorithm within the mid-journey tool that is trained on anime, suggesting that it generates images with a style or aesthetic inspired by anime. The script mentions different versions of Niji Journey and additional styles within it, such as expressive, cute, and scenic, to illustrate the variety of image styles the tool can produce.

💡Re-roll

Re-roll is an option that allows users to run the same prompt again, potentially generating a different image each time. It is one of the post-generation options provided by the mid-journey tool, offering users the chance to explore variations based on the same initial request.

💡Upscale

Upscale is a feature that generates a larger version of an image. The script mentions that users can upscale images using different versions of mid-journey, which may provide access to different upscalers like the beta upscaler, light upscaler, and upscale to the max, indicating the tool's ability to enhance image resolution and detail.

💡Parameters

Parameters are additional options or settings that users can add to their prompts to influence the AI's image generation process. The script lists several parameters, such as aspect ratio, chaos value, stylize value, quality parameter, and seed number, which can be adjusted to customize the output of the AI tool.

💡Chaos Value

The chaos value is a parameter that determines the level of variety or randomness in the generated images. A higher chaos value (e.g., 50 or 100) results in more diverse images, while a lower value (e.g., 5 or 20) produces images that are more similar to each other. This parameter allows users to control the creative range of the AI's output.

💡Stylize Value

The stylize value is another parameter that affects the artistic style and creativity of the generated images. A high stylize value (e.g., S500 or S1000) creates more artistic and potentially abstract images, while a lower value (e.g., S0, the default) results in images that are more literal and closely follow the user's prompt.

💡Quality Parameter

The quality parameter, denoted by 'q', is a setting that determines the rendering time and quality of the generated images. A lower value (e.g., 0.25) results in faster and cheaper image generation, while a higher value (e.g., 1 or 2) makes the image generation more expensive and time-consuming, but potentially of higher quality.

💡Seed Number

A seed number is a starting point for the AI to create an image. The script mentions that each image generation is assigned a random seed number, but users can also specify a seed number to reproduce the same image. This feature allows for consistency and the ability to recreate specific images.

💡Stop Parameter

The stop parameter is used to control the completion percentage of the image generation process. By specifying a stop value (e.g., stop 10, stop 50, stop 80), users can see the image at various stages of completion, from barely started to almost finished. This parameter provides insight into the AI's generation process.

💡Multi-prompt

A multi-prompt is a feature that allows the AI to interpret and generate images based on multiple parts of a prompt separately. The script uses 'cyborg dinosaur' as an example, where the AI can assign different weights to each part of the prompt, emphasizing certain elements over others in the final image.

💡Permutations

Permutations refer to the different combinations or variations that the AI can generate based on the elements within brackets in a prompt. The script provides examples like 'stained glass', 'isometric', 'watercolor', 'tilt shift', and 'origami', which can be used to create diverse image styles and concepts.

💡Negative Prompt

A negative prompt is a directive to exclude certain elements from the generated image. The script illustrates this with the example 'a field of roses --no red', where the AI is instructed to create an image of roses without the color red, demonstrating the tool's ability to interpret and apply exclusions.

💡Fast Time

Fast Time refers to the rendering time allocated for image generation on the mid-journey platform. Users can have a certain amount of fast time per month, which can be used for quicker image generation. The script mentions options for 15 hours or 30 hours of fast time per month, with unlimited relaxed generations on the standard plan.

💡Stealth Mode

Stealth mode is a feature that allows users to keep their image generations private, so that they are not visible to others on the mid-journey website. By default, all creations are public, but users can opt for stealth mode to maintain privacy and control over their content.

💡Slash Commands

Slash commands are shortcuts used within the platform to access various features and functions. The script mentions several slash commands, such as 'forward slash prefer', 'forward slash describe', 'forward slash show', 'forward slash blend', 'forward slash info', 'forward slash relax', 'forward slash settings', 'forward slash FAQ', 'forward slash docs', and 'forward slash help', which provide quick access to functionalities like creating prompt shortcuts, image descriptions, viewing profiles, blending images, accessing FAQs, user guides, and getting help.

Highlights

Mid-journey updates frequently, making this feature overview potentially temporary.

Public channel picture generation allows private generation via an envelope Emoji reaction.

Using 'forward slash imagine' in Discord to write prompts, demonstrated with 'cyborg dinosaur'.

Different algorithms available for image generation: versions one to four.

Version 4 introduced four distinct styles including a 'cursed' secret style.

Version 5 and 5.1 style raw, along with niji, an algorithm trained on anime.

Niji offers additional styles: expressive, cute, and scenic.

Post-generation options include re-roll, upscale, variation, and remix.

Parameters can be added to prompts to change image aspect ratio using '--AR'.

Chaos value ('--chaos' or '--C') adds variety to grid images, with values 0 to 100.

Stylize value ('--stylize') influences creativity and artistic style of generation.

Quality parameter ('--q') determines rendering time, with values 0.25 to 2.

Seed numbers ensure unique image generation with each prompt.

Stop parameter allows control over the completion percentage of image generation.

Older versions include '--video' to create a video of image generation.

The '--tile' parameter creates seamless textures for pattern creation.

Multi-prompt feature allows weighting of different parts of the prompt.

The '--repeat' parameter enables multiple attempts of the same prompt.

Permutations can be created using squiggly brackets for varied prompt outcomes.

Image prompting combines an image with text for unique creations.

Negative prompting with '--no' excludes unwanted elements from images.

Parameters like '--test', '--test-e', '--test-p', and '--HD' modify image characteristics.

Slash commands like 'prefer', 'suffix', 'describe', and 'show' enhance user experience.

Slash commands 'blend', 'info', 'relax', 'settings', 'FAQ', 'docs', and 'help' provide additional functionality.