The Free & Uncensored Version of MidJourney! (FLUX.1)

Matt Wolfe
6 Aug 202418:24

TLDRThe video explores Flux.1, a new AI image generating tool by Black Forest Labs, which rivals MidJourney in quality. Developed by the team behind Stable Diffusion, Flux offers three models with varying capabilities and costs. Flux.1 Schnell is open-source, suitable for personal use and local development. The video tests Flux.1's performance in realism, text generation, and prompt adherence, comparing it to MidJourney and DALL-E 3. Flux.1 shows promise, particularly in text-related images and uncensored content generation, and is expected to evolve with community contributions. The video also hints at Flux.1's potential future as a text-to-video model, positioning it as a strong contender in the AI art world.

Takeaways

  • ๐Ÿš€ Black Forest Labs has released a new AI image generating tool called Flux.1, which is being compared to MidJourney.
  • ๐Ÿ› ๏ธ Flux.1 is developed by many team members who contributed to the creation of Stable Diffusion, including innovations like VQ-GAN and Latent Diffusion.
  • ๐Ÿ”‘ There are three models of Flux.1, each with varying power and cost: Flux.1 Schnell (fastest, open source, for local development), Flux.1 Dev (middle model for non-commercial use), and Flux.1 Pro (top model for enterprise solutions).
  • ๐Ÿ“œ Flux.1 Schnell is open source under the Apache 2.0 license, allowing for commercial and non-commercial use of generated images.
  • ๐ŸŒ Several websites have integrated Flux.1, allowing free use of the models, including Hugging Face and Glyph, an AI workflow builder.
  • ๐ŸŽจ Flux.1 is particularly strong in generating photorealistic images and handling text within images, but may not excel in creating certain artistic styles like hand-drawn illustrations or oil paintings.
  • ๐Ÿ” Flux.1's prompt adherence is good but not as advanced as DALL-E 3, which can capture more elements from complex prompts.
  • ๐Ÿ”“ Flux.1 is uncensored, allowing for more creative freedom compared to some other models, although it does have an NSFW filter.
  • ๐ŸŒ The open-source nature of Flux.1 Schnell means that it can be downloaded and run locally on personal computers in the future.
  • ๐Ÿ“น Flux.1 is set to be the foundation for an upcoming suite of text-to-video generative systems, offering an open-source alternative to tools like Lumen Runway and Gen-3 Sora.
  • ๐Ÿ”ฎ While Flux.1 shows great potential, it has not yet surpassed MidJourney in all aspects, but it is rapidly approaching and could become a strong competitor as it evolves and is fine-tuned by the community.

Q & A

  • What is the name of the new AI image generating tool discussed in the video?

    -The new AI image generating tool discussed in the video is called Flux.1, developed by Black Forest Labs.

  • Who are the team members behind Flux.1 and what is their experience?

    -The team behind Flux.1 includes many members who helped build Stable Diffusion. Their innovations include creating VQ-GAN and Latent Diffusion, models for image and AI video generation like Stable Diffusion XL, Stable Video Diffusion, and Rectified Flow Transformers.

  • How many models are there in Flux and what differentiates them?

    -There are three models in Flux: Flux One Schnell, which is the fastest and open source; Flux One Dev, which is more efficient and prompt adherent than Schnell; and Flux One Pro, the top-line model designed for enterprise solutions.

  • What is special about Flux One Schnell in terms of licensing?

    -Flux One Schnell is special because it is openly available under the Apache 2.0 license, making it open source. Any tools created using Flux One Schnell can be sold, and any images generated can be used both non-commercially and commercially.

  • What platforms have integrated the Flux models and how can they be used for free?

    -Several websites have integrated the Flux models. One of the simplest ways to use them for free is through Black Forest Labs on Hugging Face, where you can use the Schnell and Dev models within Hugging Face Spaces. Additionally, the platform Glyph allows you to build your own Flux workflows and generate images for free, even using the Pro model.

  • What type of image does Flux.1 struggle to generate according to the video?

    -Flux.1 struggles to generate illustrations that have a hand-drawn, oil painting, or watercolor style, as it may lack the fine details and specific characteristics of these art styles.

  • What is Flux.1 particularly good at generating?

    -Flux.1 is particularly good at generating realistic images and anything that involves text, such as logos or memes. It also excels in prompt adherence, capturing many elements from complex prompts.

  • What does the video suggest about Flux.1's uncensored nature and its implications?

    -The video suggests that Flux.1's uncensored nature allows for the generation of a wider range of images, including copyrighted material and potentially NSFW content in the future. However, it currently has an NSFW filter and is centered in good ways, providing optionality for creators.

  • How does the video compare Flux.1's prompt adherence to that of Mid Journey and Dolly 3?

    -The video suggests that Flux.1's prompt adherence is on par with Mid Journey, capturing many elements from the prompts but not all. In contrast, Dolly 3 is said to have superior prompt adherence, capturing all elements from complex prompts.

  • What future developments for Flux.1 does the video mention?

    -The video mentions that Flux.1 will serve as the foundation for an upcoming suite of competitive generative text-to-video systems, suggesting that it will soon be possible to generate video content using an open-source model similar to Flux.1.

Outlines

00:00

๐Ÿš€ Introduction to Flux One AI Image Generation Tool

The script introduces Flux One, a new AI image-generating tool from Black Forest Labs, which is being compared to the capabilities of mid-journey models. It highlights the team's background in developing AI models like VQ-GAN and latent diffusion, and offers an overview of three models: Flux One Schnell (open source, suitable for home use), Flux One Dev (non-commercial applications), and Flux One Pro (enterprise solutions). The video aims to explore these models, their capabilities, and how they can be accessed and used, such as through Hugging Face or the Glyph platform.

05:02

๐ŸŽจ Evaluating Flux One's Artistic Capabilities and Realism

This paragraph delves into the capabilities and limitations of Flux One in generating various art styles like illustrations, oil paintings, and watercolors, comparing its outputs to those of mid-Journey. It also discusses the tool's strengths in creating realistic images and handling text within images effectively. The video script includes a demonstration of generating images with prompts like 'a wolf howling at the moon' and explores the use of AI to optimize prompts for better results.

10:03

๐Ÿ” Analyzing Prompt Adherence and Flexibility of Flux One

The script examines Flux One's prompt adherence, comparing it with other models like Dolly 3 and mid-Journey. It discusses the tool's uncensored nature, which allows for more creative freedom, and its potential for generating copyrighted or celebrity images. The paragraph also explores strategies for improving prompts and the anticipation of Flux One's development into a text-to-video model, offering an open-source alternative to proprietary tools.

15:07

๐ŸŒ Flux One's Open Source Advantage and Future Prospects

The final paragraph discusses the open-source nature of Flux One Schnell and its implications for the future of AI image generation. It speculates on the potential for the community to improve and customize the model, possibly surpassing the capabilities of current leading platforms. The script also mentions the upcoming text-to-video functionality of Flux One and encourages viewers to stay updated with the latest AI tools and news through a newsletter, concluding with a call to action for likes and subscriptions.

Mindmap

Keywords

๐Ÿ’กAI Image Generation

AI Image Generation refers to the process by which artificial intelligence algorithms create visual content based on textual descriptions or other input data. In the context of the video, it is the core technology behind the new tool 'Flux.1' from Black Forest Labs, which is compared with 'MidJourney' for its capabilities in generating images that are claimed to be on par or even superior in some aspects.

๐Ÿ’กBlack Forest Labs

Black Forest Labs is the company behind the development of 'Flux.1', an AI image generation tool. The script mentions that this team has a strong background in creating AI models for image and video generation, including contributions to 'Stable Diffusion', indicating their expertise and the potential quality of 'Flux.1'.

๐Ÿ’กFlux.1 Models

The video script discusses three different models of 'Flux.1', each with varying levels of power and cost. Flux.1 Schnell is the fastest and open-source, Flux.1 Dev is more efficient and prompt-adherent, and Flux.1 Pro is the top-line model designed for enterprise solutions. These models represent the different tiers of service offered by Black Forest Labs.

๐Ÿ’กOpen Source

Open Source in the context of the video refers to the fact that Flux.1 Schnell is available under the Apache 2.0 license, meaning its source code is publicly accessible, and anyone can modify and use it for various purposes, including commercial ones, without significant restrictions.

๐Ÿ’กHugging Face

Hugging Face is mentioned as a platform that has integrated the 'Flux.1' models, allowing users to try them out for free. It is a significant aspect of accessibility, as it enables a wide range of users to experiment with AI image generation without financial barriers.

๐Ÿ’กGlyph

Glyph is described as an AI workflow builder in the script, which allows users to create their own workflows for image generation using 'Flux.1 Pro'. It represents a more advanced and customizable approach to using AI models, offering users the ability to optimize prompts and generate images with greater control.

๐Ÿ’กPrompt Adherence

Prompt Adherence is a measure of how well an AI model can incorporate all the elements from a given textual description into the generated image. The video compares 'Flux.1' with 'MidJourney' and 'Dolly 3' in this regard, noting that while 'Flux.1' is good, 'Dolly 3' excels in capturing all prompt details.

๐Ÿ’กRealism

Realism, in the context of AI image generation, refers to the ability of the model to create images that closely resemble real-world objects and scenes. The video suggests that 'Flux.1' is designed to excel in producing realistic images, which is one of the key selling points for users seeking high-quality visual outputs.

๐Ÿ’กUncensored

The term 'Uncensored' in the video script implies that 'Flux.1' does not impose restrictions on the type of content that can be generated, unlike some other models that may have filters against generating certain types of images. However, it is noted that while currently it cannot generate NSFW content, the open-source nature of the model may change this in the future.

๐Ÿ’กText-to-Video Model

The script mentions that 'Flux.1' will eventually serve as the foundation for a text-to-video model, expanding the capabilities of the tool beyond image generation to include video creation. This indicates the forward-thinking and potential evolution of 'Flux.1' in the generative AI space.

Highlights

Introduction of a new AI image generating tool, FLUX.1, by Black Forest Labs.

FLUX.1 was developed by team members who helped build Stable Diffusion, known for innovations like VQ-GAN and Latent Diffusion.

Three models of FLUX.1: Schnell for local development, Dev for non-commercial applications, and Pro for enterprise solutions.

FLUX.1 Schnell is open source under the Apache 2.0 license, allowing commercial and non-commercial use.

Integration of FLUX.1 with platforms like Hugging Face for free use.

Use of Glyph, an AI workflow builder, to create custom workflows and generate images with FLUX.1 Pro for free.

FLUX.1's capability to generate high-quality, realistic images compared to MidJourney.

FLUX.1's prompt adherence and its ability to incorporate multiple elements from a complex prompt.

FLUX.1's strength in text generation within images, making it suitable for creating logos and memes.

FLUX.1's uncensored nature, allowing the generation of copyrighted images and existing IPs.

Comparison of FLUX.1's realism with MidJourney and DALL-E 3, noting FLUX.1's potential to improve.

FLUX.1's potential to become an all-in-one solution, combining the strengths of MidJourney, DALL-E 3, and Stable Diffusion.

The upcoming text-to-video model based on FLUX.1, offering an open-source alternative to tools like Lumen Runway and Gen-3 Sora.

Tips for improving prompts with FLUX.1, including the use of AI methods and detailed descriptions.

FLUX.1's current limitations in generating certain art styles like illustrations compared to MidJourney.

The open-source nature of FLUX.1 Schnell, allowing developers to fine-tune and improve the model.

Anticipation for the future development of FLUX.1 and its potential to outperform existing AI art platforms.

Encouragement for users to experiment with FLUX.1 and stay updated with the latest AI tools and news.