Midjourney v6.1 & Leonardo.AI Acquisition!

Theoretically Media
2 Aug 202411:02

TLDRThe video discusses AI image generation updates, focusing on Midjourney's v6.1 model release, which promises improved image quality and features. It also covers the acquisition of Leonardo.AI by Canva, hinting at potential integration with Canva's suite. Additionally, it touches on the open-source alternative 'flux' and the upcoming updates for Runway ML's Gen 3, including a faster, more affordable Turbo model.

Takeaways

  • 🚀 Midjourney has released its v6.1 model, which offers sharper image quality, more coherent outputs, improved text rendering, and an enhanced upscaler.
  • 🆕 The v6.1 model is set as the default for Midjourney users, with subtle but notable improvements over the previous version.
  • 🔍 Personalization of images can be achieved by adding '-d-p' to the prompt, which helps in refining the aesthetic output.
  • 🌟 The new 'Q mode' increases the texture of images, potentially at the cost of some coherence.
  • 📈 The upscalers in v6.1 are noted for their effectiveness, with 'subtle' being the recommended setting for most users.
  • 📖 Midjourney's text coherence has been improved, allowing for better rendering of words within quotation marks.
  • 🎨 The 'describe' feature is undergoing updates, which may currently be causing some issues with image references.
  • 🔮 Upcoming version 7 of Midjourney promises enhanced aesthetics, faster performance, and smarter prompt understanding, along with significant overall enhancements.
  • 🛠️ Flux, an open-source text-to-image model created by ex-Stability employees, has been released as a potential Midjourney competitor.
  • 💰 Canva has acquired Leonardo.AI, which will continue to operate independently, and may influence the future of AI imagery, particularly with Canva's Magic Media feature.
  • 🎉 Runway ML's Gen 3 pricing is set to become more accessible, with a new 'turbo' model for faster video generation and lower pricing for image-to-video conversion.

Q & A

  • What is the significance of Midjourney v6.1's release in AI image generation?

    -Midjourney v6.1 is significant as it introduces improvements such as sharper image quality, more coherent outputs, improved text rendering, and an enhanced upscaler, which collectively contribute to a more refined AI image generation experience.

  • What does the new personalization feature in Midjourney v6.1 entail?

    -The new personalization feature in Midjourney v6.1 allows users to add a personalization code to their prompts by adding '-d-p' to the end of their prompt, which helps in generating images that are more tailored to the user's aesthetic preferences.

  • How does the 'Q mode' in Midjourney v6.1 affect the texture of images?

    -The 'Q mode' in Midjourney v6.1, when activated with the command '--Q Space 2', increases the textures of the images, potentially at the cost of image coherence, providing a more detailed and textured visual output.

  • What is the recommended upscaler to use in Midjourney v6.1 according to the script?

    -The script suggests that the 'subtle upscale' is the recommended option in Midjourney v6.1, as the 'creative upscale' may be too heavy-handed and result in an overly airbrushed look.

  • How does Midjourney v6.1 handle in-image text coherence?

    -Midjourney v6.1 has improved in-image text coherence, ensuring that words enclosed in quotation marks are accurately rendered in the generated image, enhancing the overall quality and readability of the text.

  • What updates are expected in the upcoming Midjourney version 7?

    -Version 7 of Midjourney is expected to feature enhanced aesthetics, faster performance, smarter prompt understanding, increased knowledge-based improvements, improved word comprehension and rendering, and significant overall enhancements. It also has 3D and video capabilities on the roadmap.

  • What is the significance of Canva's acquisition of Leonardo.AI?

    -Canva's acquisition of Leonardo.AI is significant as it suggests an integration of Leonardo's advanced AI image generation capabilities into Canva's suite of design tools, potentially enhancing Canva's offerings and providing users with more sophisticated image editing features.

  • What is the relationship between Canva, Affinity, and Leonardo.AI post-acquisition?

    -After Canva's acquisition of Affinity and Leonardo.AI, it is speculated that Affinity, which is similar to Photoshop, might integrate Leonardo's creative AI capabilities, potentially offering a more powerful image editing tool within the Canva ecosystem.

  • What is the status of Gen 3 pricing according to the script?

    -The script mentions that there was a misconception about Gen 3's pricing doubling, but in reality, Runway ML is planning to roll out a turbo model for Gen 3 with significantly lower pricing, making it more accessible to users.

  • What is the expected impact of the new turbo model for Gen 3 on its pricing and availability?

    -The new turbo model for Gen 3 is expected to generate video much faster and will be available at a significantly lower price, with the possibility of being accessible to free users, addressing previous concerns about cost.

Outlines

00:00

🚀 Mid Journey v6.1 Model Release and Features

The script discusses the release of Mid Journey's v6.1 model, highlighting its improved image quality, coherence, text rendering, and upscaler. It compares the outputs of different versions of Mid Journey, noting the subtle but significant enhancements in v6.1. The script also covers personalization codes, the new Q mode for texture enhancement, and the recommended use of 'subtle' upscale for creative outputs. Additionally, it touches on the improved text coherence in images and the potential upcoming updates for Mid Journey.

05:01

🌐 Open Source AI Imagery and Canva Acquisitions

This paragraph delves into the open-source text-to-image model 'flux' by black forest Labs, an initiative by ex-Stability employees, and its potential as a Mid Journey competitor. The script also covers Canva's acquisition of Leonardo.da, discussing the implications for the future of AI imagery and Adobe. It speculates on the integration of Leonardo's Phoenix model into Canva's Magic Media feature and the potential impact on Affinity Photo, which Canva also owns, suggesting a possible shift in the creative software landscape.

10:02

📈 Runway ML Gen 3 Pricing and Updates

The final paragraph addresses the pricing concerns surrounding Runway ML's Gen 3 and announces a forthcoming 'turbo' model for faster video generation. It mentions the upcoming release of a lower-priced turbo option for image-to-video conversion and its availability to free users. The script clarifies that the previously mentioned $95 unlimited plan was not the new pricing structure and that an official announcement is pending. It concludes by commending Runway for addressing user feedback on pricing.

Mindmap

Keywords

💡Midjourney v6.1

Midjourney v6.1 refers to the latest version of the AI image generation model developed by the company Midjourney. It is highlighted in the video for its improved image quality, coherence, text rendering, and upscaler capabilities. The script mentions a comparison between the outputs of different versions, with v6.1 providing notably better results, such as a more accurate representation of a man in a blue business suit.

💡AI Image Generation

AI Image Generation is the process by which artificial intelligence algorithms create images based on textual descriptions or other input data. The script discusses the advancements in this technology, particularly with the release of Midjourney's v6.1 model, which demonstrates the evolving capabilities of AI to produce more coherent and higher-quality images.

💡Open Source

Open Source refers to a type of software or model whose source code is available to the public, allowing anyone to view, modify, and distribute the software without restrictions. The script introduces 'flux', an open-source text-to-image model created by Black Forest Labs, as a potential competitor to Midjourney, indicating a trend towards more accessible AI technologies.

💡Leonardo.AI Acquisition

The term 'Leonardo.AI Acquisition' refers to the event where the company Canva acquired Leonardo.AI, an AI-driven image generation platform. The acquisition is significant as it suggests a strategic move by Canva to integrate advanced AI capabilities into their suite of design tools, potentially influencing the future of AI imagery and design software.

💡Runway ML's Gen 3 Pricing

Runway ML's Gen 3 Pricing refers to the cost structure for using the third generation of Runway ML's AI image and video generation services. The script humorously addresses rumors of a price increase, then clarifies that the actual news is positive, with the introduction of a 'turbo' model that will generate videos faster and at a lower cost, making the technology more accessible.

💡Text Rendering

Text Rendering in the context of AI image generation is the process by which AI interprets and visually represents text within an image. The script notes that Midjourney v6.1 has improved text rendering, as demonstrated by the accurate depiction of a book title within a generated image.

💡Upscale

Upscale in the context of image processing refers to the enhancement of an image's resolution while maintaining or improving its quality. The script discusses the upscaler feature in Midjourney v6.1, which allows for the creation of higher-resolution images with subtle or creative adjustments.

💡Personalization Code

Personalization Code in AI image generation is a unique set of parameters that tailors the AI's output to the user's preferences or style. The script explains how to add a 'd-p' to the end of a prompt to personalize the AI's output, which can lead to more aesthetically pleasing images according to the user's taste.

💡Q Mode

Q Mode, activated with the command '--Q Space 2', is a feature in Midjourney v6.1 that increases the texture detail in generated images. The script mentions that while it adds texture, it might affect the coherence of the image, providing an example of an abstract image where the texture was enhanced without compromising coherence.

💡Image Coherence

Image Coherence refers to the consistency and logical arrangement of elements within an image. The script discusses how Midjourney v6.1 maintains or improves image coherence, especially when using features like Q Mode or when generating images with specific text elements.

💡Describe

In the context of AI image generation, 'Describe' is a feature that helps in generating images based on detailed textual descriptions. The script mentions that 'Describe' is one of the favorite features of the narrator and notes an issue with the tool being temporarily broken, suggesting an ongoing update.

Highlights

Midjourney v6.1 model has been launched with improved image quality, coherence, text rendering, and upscaler.

Midjourney's v6.1 model is set as the default, offering sharper and more coherent outputs.

Comparison between Midjourney v6.1 and previous versions shows noticeable improvements in image generation.

Personalization code or 'd-p' can be added to prompts in Midjourney to enhance image outputs.

New 'Q mode' in Midjourney increases image textures but may affect coherence.

Subtle upscale is recommended in Midjourney v6.1 for maintaining image quality.

Midjourney v6.1 enhances in-image text coherence, improving the rendering of words within quotation marks.

Describe feature in Midjourney is undergoing updates, potentially improving image reference capabilities.

Flux, an open-source text-to-image model by Black Forest Labs, positions itself as a competitor to Midjourney.

Canva's acquisition of Leonardo raises questions about potential integration with Affinity and impact on Adobe's Gen AI.

Leonardo's Phoenix model may be integrated into Canva's Magic Media feature, enhancing its capabilities.

Affinity Photo, owned by Canva, may benefit from Leonardo's AI technology, offering an alternative to Adobe's Photoshop.

Runway ML's Gen 3 pricing is not doubling; instead, they are introducing a faster and more affordable Turbo model.

Runway ML's Gen 3 Alpha Turbo promises faster video generation at a significantly lower cost.

Runway ML is expected to roll out the Turbo model for image-to-video with lower pricing for free users.

The final pricing for Runway ML's Gen 3 has not been officially announced, but it will be lower than the speculated $95 unlimited plan.