Was NOT Expecting This! Midjourney V6 Competes with DALL-E 3 | Comparison & Review

MattVidPro AI
21 Dec 202319:33

TLDRMidjourney V6, a significant update to the AI art platform, is now competing with DALL-E 3, as showcased in a detailed comparison and review. Despite being in its alpha version, V6 has impressed with its ability to generate realistic images and text, often outperforming DALL-E 3 in certain aspects. The video explores community reactions, prompt accuracy, and photorealism, highlighting Midjourney's strengths in text generation and its cinematic and realistic outputs. While DALL-E 3 remains a leader with superior text and world understanding, Midjourney V6's advancements have reignited interest and placed it back in the competitive AI art landscape.

Takeaways

  • 😮 Midjourney V6 has made significant advancements and is now competing with DALL-E 3 in the AI art landscape.
  • 🕒 The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle, indicating a major update.
  • 🎉 Midjourney V6 is still in its Alpha version, suggesting that its capabilities will continue to improve.
  • 📸 The AI can generate photorealistic images with accurate text, as demonstrated by community examples.
  • 🔍 A comparison with DALL-E 3 shows that while both can produce high-quality images, Midjourney V6 has a more cinematic and realistic vibe.
  • 🤔 SDXL, another AI model, is more versatile and open-source but is not the focus of this comparison.
  • 🍌 Impressive examples from the community include a realistic standup pouch product photo and an anime movie poster with correct text.
  • 🍦 Midjourney V6 has improved in text accuracy and photorealism, leading the presenter to resubscribe after previously cancelling.
  • 📈 The presenter conducted their own tests, finding that Midjourney V6 can produce accurate text but may require specific prompting.
  • 📱 A prompt for a lemon character with the title 'Matt vidpro' was successfully generated with correct text, showcasing the AI's capabilities.
  • 💬 The presenter has a theory that Midjourney V6 might be synthetically trained to produce text, unlike DALL-E 3 which might be naturally trained.

Q & A

  • What is the significance of the development time for Midjourney V6 compared to previous versions?

    -The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle, indicating a significant investment in innovation and improvement to compete with other AI art platforms like DALL-E 3.

  • How does the AI art landscape change with the introduction of DALL-E 3?

    -DALL-E 3 introduced an unprecedented level of coherence in prompt understanding and impressive scalability at a competitive price point, which has shifted the expectations and standards in the AI art landscape.

  • What are the current platforms where DALL-E 3 can be accessed for free?

    -DALL-E 3 can currently be accessed for free on Bing Image Creator and Microsoft Designer Image Creator, both of which utilize the DALL-E 3 API from OpenAI.

  • How does Midjourney V6 compare to DALL-E 3 in terms of text generation and realism in images?

    -Midjourney V6 has made significant strides in text generation and photorealism, with some users finding the text in Midjourney V6 images to be more aesthetically pleasing and the images more cinematic and realistic compared to DALL-E 3.

  • What are the subjective views of the community regarding the visual output of Midjourney V6?

    -The community has mixed views, with some members finding Midjourney V6's images to be more beautiful and realistic, while others note that DALL-E 3's images, despite occasional spelling errors, still appear more photorealistic.

  • What are some of the specialized use cases where SDXL might be preferred over Midjourney or DALL-E 3?

    -SDXL is a versatile, free, and open-source model that is better suited for specialized use cases that require customization and flexibility, whereas Midjourney and DALL-E 3 are more focused on general-purpose AI art generation.

  • How does Midjourney V6 handle the generation of text within images?

    -Midjourney V6 has improved text generation capabilities and can produce text that is more accurate and better integrated into the image. However, it may sometimes require multiple attempts to get the text right.

  • What is the current status of Midjourney V6 in terms of accessibility and subscription requirements?

    -Midjourney V6 is currently in the alpha version and requires a subscription to access. It is not the default model and users need to change the settings within Discord to use it.

  • What are the advantages of using Midjourney V6 over DALL-E 3 in terms of creative control?

    -Midjourney V6 offers more control with less censorship, a better understanding of pop culture characters, more aspect ratios to choose from, and different modes, including an in-painting feature that DALL-E 3 lacks.

  • What is the narrator's theory regarding the difference in text generation between Midjourney V6 and DALL-E 3?

    -The narrator theorizes that Midjourney V6 might be synthetically trained to produce text, which could explain why it sometimes generates text that looks more like it was created with a program like MS Paint or Photoshop, whereas DALL-E 3 might be naturally trained, resulting in more naturally integrated text.

  • How does the narrator view the competition between Midjourney V6 and DALL-E 3 in the context of the AI art market?

    -The narrator views Midjourney V6 as a strong contender that has managed to keep up with DALL-E 3 in many areas, particularly in photorealism. They believe that Midjourney V6's improvements put it almost on par with DALL-E 3, making it a significant player in the AI art market.

Outlines

00:00

🚀 Mid Journey V6: A Competitive AI Art Update

The video discusses the release of Mid Journey V6, an AI art generator that has been in development for an unusually long time compared to its previous versions. The new version is seen as a direct competitor to Dolly 3, another AI art platform. The script highlights community reactions, showcasing examples of generated images that are highly realistic and textually accurate. It compares Mid Journey V6 with Dolly 3 and sdxl, noting that while Dolly 3 is free on certain platforms, Mid Journey V6 offers a more cinematic and realistic output in its alpha version, with the promise of further improvements.

05:02

🎨 Comparing AI Art Generators: Mid Journey V6 vs. Dolly 3

This paragraph delves deeper into the comparison between Mid Journey V6 and Dolly 3, focusing on text generation capabilities and photorealism. It presents various examples from the community, including advertisements and movie posters, and discusses the accuracy and aesthetics of the text in the generated images. The script also mentions the need for specific prompting techniques to achieve the best results with AI generators. The video creator expresses surprise at Mid Journey V6's ability to compete with Dolly 3, especially in text generation, and shares personal testing results that further illustrate the strengths and weaknesses of each platform.

10:03

🤖 Testing AI Image Generators: Mid Journey V6's Photorealism and Pop Culture Accuracy

The script describes the video creator's personal experiments with Mid Journey V6, testing its capabilities in generating photorealistic images and handling prompts involving famous pop culture characters. It details the process and results of generating images of a shitsu puppy in a pirate outfit and a selfie of Walter White and Jesse Pinkman. The video also touches on the limitations and successes of Dolly 3 when faced with similar prompts, noting that while Dolly 3 has a strong grasp on character generation, Mid Journey V6 excels in creating realistic and believable images.

15:04

🌟 Mid Journey V6's Strengths and the Future of AI Art Generation

In the final paragraph, the video creator reflects on Mid Journey V6's performance, highlighting its strengths in photorealism and its competitive edge in text generation against Dolly 3. They discuss the potential reasons behind the differences in text quality between the two AI platforms, suggesting that Mid Journey V6 might be synthetically trained for text production. The script also addresses the pricing models of both AI generators and the creator's decision to resubscribe to Mid Journey due to the impressive advancements in V6. The video ends with a call to action for viewers to share their thoughts and a tease for further exploration of Mid Journey V6's capabilities.

Mindmap

Keywords

💡Midjourney V6

Midjourney V6 refers to the sixth iteration of the AI art generation platform, Midjourney. It is a significant update that has been in development nearly twice as long as the previous longest development cycle. This version is highlighted in the video for its improved capabilities and ability to compete with other AI art generators like DALL-E 3. The script mentions that Midjourney V6 has made impressive strides, especially in generating photorealistic images and handling text within prompts more effectively.

💡DALL-E 3

DALL-E 3 is an advanced AI art generator developed by OpenAI. It is known for its high level of coherence, prompt understanding, and ability to generate images at an impressive scale and price level. The video script compares Midjourney V6 with DALL-E 3, noting that while DALL-E 3 is currently leading in the market, Midjourney V6 has made significant improvements and is now a strong competitor.

💡AI Art Landscape

The AI art landscape refers to the current state and development of AI technologies that generate art. The script discusses how this landscape has completely changed with the introduction of competitors like Midjourney and DALL-E, which have set new standards for AI-generated imagery. The term is used to describe the competitive environment and the rapid advancements in AI art generation.

💡Photorealism

Photorealism in the context of AI art generation refers to the ability of an AI to create images that closely resemble real photographs. The script praises Midjourney V6 for its advancements in photorealism, noting that it has led in this direction since version 5 and has further improved in V6, making the generated images incredibly lifelike and difficult to distinguish from actual photographs.

💡Prompt Understanding

Prompt understanding is the AI's ability to interpret and generate images based on textual descriptions provided by users. The video script discusses how Midjourney V6 has improved in this area, being able to generate more accurate and relevant images in response to user prompts compared to previous versions and in comparison to DALL-E 3.

💡Text Generation

Text generation in AI art refers to the AI's capability to create readable and contextually appropriate text within generated images. The script highlights that Midjourney V6 has made strides in text generation, being able to integrate text more effectively into images, although it sometimes appears less natural compared to DALL-E 3.

💡Anime Movie Poster

An anime movie poster is an example given in the script to illustrate the AI's ability to generate culturally specific content. The video discusses how Midjourney V6 can create an anime-style poster with correct text, showcasing its versatility and improvement in handling different styles and text accuracy.

💡Coca-Cola Ad

The Coca-Cola ad mentioned in the script is an example used to demonstrate the AI's ability to generate branded content accurately. It highlights the AI's challenge in replicating well-known logos and patterns, which is an important aspect of creating realistic and believable advertising imagery.

💡In-Painting

In-painting is a feature that allows AI to fill in missing or selected parts of an image with new content that is consistent with the surrounding areas. The script notes that Midjourney V6 has this feature, which is a significant advantage over DALL-E 3, as it provides users with more control and creative possibilities when editing images.

💡Subscription Plan

A subscription plan in the context of AI art generators like Midjourney refers to the pricing model where users pay a monthly fee to access the AI's services. The script mentions that to access Midjourney V6, users need a subscription, which is a consideration for potential users comparing it with free alternatives like DALL-E 3.

💡Discord

Discord is a communication platform used by Midjourney for users to interact with their AI service. The script expresses frustration with the use of Discord for this purpose, suggesting that a web interface would be more convenient. This highlights the user experience aspect of AI art generation services.

Highlights

Midjourney V6 has made significant advancements, competing with DALL-E 3 in AI art generation.

The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle.

Midjourney V6 is currently in Alpha and has already shown impressive capabilities.

Community reactions suggest that Midjourney V6 can generate more beautiful and realistic words compared to DALL-E 3.

Midjourney V6 has a more cinematic and realistic vibe in its generated images.

SDXL, while versatile and open-source, is not the focus of this comparison, with Midjourney V6 being the primary subject.

Midjourney V6 outperforms SDXL in terms of quality and realism.

The text in Midjourney V6's generated images wraps and conforms around objects in a realistic way.

DALL-E 3 sometimes struggles with spelling accuracy in its generated images.

Midjourney V6 has a stronger emphasis on photorealism compared to DALL-E 3.

Midjourney V6 requires specific prompting to achieve accurate text generation.

DALL-E 3 is available for free on certain platforms, whereas Midjourney V6 requires a subscription.

Midjourney V6 has a competitive edge in generating photorealistic images that resemble professional photography.

DALL-E 3 has a better understanding of complex prompts and pop culture references.

Midjourney V6 has less censorship and offers more control over the generated images.

Midjourney V6 includes an in-painting feature, which DALL-E 3 lacks.

The reviewer believes Midjourney V6 is just one step behind DALL-E 3 and has the potential to compete effectively.

Midjourney V6's text generation appears to be synthetically trained, resulting in a unique aesthetic.

The reviewer has resumed their Midjourney subscription due to the impressive capabilities of V6.