Is FLUX better than Midjourney?
TLDRThe video discusses the new AI model FLUX by Black Forest Labs, which is being hailed as a potential competitor to Midjourney. The model impresses with its high-quality results and ease of use, unlike previous iterations. It comes in three variants: Pro, Dev, and Schnell, each with different capabilities and licensing, including commercial use under the Apache 2.0 license. Viewers are guided on how to use FLUX with Comfy UI, download models, and optimize settings for memory usage. The video showcases FLUX's impressive image generation capabilities and introduces a prompt enhancer tool by Angry Penguin to aid in creating detailed prompts. The host expresses excitement for future updates and the potential of FLUX in video generation.
Takeaways
- 🆕 A new model named FLUX has been released by Black Forest Labs, which is being considered a competitor to Midjourney.
- 🤖 The FLUX model was tested and showed impressive results with less effort compared to the initial testing phase of SD3.
- 📈 Three variants of the FLUX model have been released: FLUX point1 Pro, FLUX point1 Dev, and FLUX point1, with Pro being the top in creative capabilities.
- 📜 The legal use of the models is not entirely clear, but the 'Schnell' version seems to be usable for commercial purposes under the Apache 2.0 license.
- 🚫 The 'Dev' version's commercial use is less clear, and it may require payment or have restrictions.
- 🔗 The Pro version of FLUX is accessible through an API without the need for installation, using platforms like Replicate.
- 💾 To use FLUX in Comfy UI, specific models and files need to be downloaded and placed in the correct directories.
- 🔧 Tips are provided for users with low memory, suggesting the use of a lower diffusion model or changing settings to reduce memory usage.
- 🎨 The quality of images generated by FLUX is high, even at a zoomed-in level, which was a letdown in the initial SD3 testing.
- 📝 A 'flux prompt enhancer' tool created by Angry Penguin is recommended for generating detailed prompts quickly.
- 🖼️ Image-to-image functionality is available for FLUX, with a workflow provided by Curo that allows for adjustments in denoising and image scaling.
Q & A
What is the new model released by Black Forest Labs that is considered a competitor to Midjourney?
-The new model released by Black Forest Labs is called FLUX, and it has garnered attention for its impressive results compared to Midjourney.
How many variants of the FLUX model were released to the public?
-Three variants of the FLUX model were released to the public: FLUX point1 Pro, FLUX point1 Dev, and FLUX point1.
What does the chart mentioned in the transcript suggest about the creative capabilities of the different FLUX variants?
-According to the chart, FLUX Pro appears to have the best creative capabilities, followed by FLUX Dev, with FLUX being the lowest.
What is the Apache 2.0 license, and how does it relate to the FLUX model's commercial use?
-The Apache 2.0 license allows for free use of the software, including commercial use, without paying any fees or royalties. It permits modification and distribution of the licensed software, which applies to the FLUX model for personal, scientific, and commercial purposes.
Is there a difference in commercial use permissions between FLUX point1 and FLUX point1 Dev?
-Yes, the FLUX point1 (Schell) can be used commercially for personal use, while the FLUX point1 Dev is mentioned as being for non-commercial applications, and its commercial use permissions are not clear without additional information.
How can one access the FLUX Pro version without installing anything?
-The FLUX Pro version can be accessed through an API on a website like Replicate, where users can input their prompts and settings directly in their browser to run the model.
What is the recommended memory requirement for using the T5 XXL fp16 model?
-The T5 XXL fp16 model is recommended for users who have more than 32 gigabytes of memory. For lower memory usage, the fp8 model can be used instead.
What is the size of the FLUX point1 Dev model file, and where should it be saved?
-The FLUX point1 Dev model file is a large file, weighing in at 23.8 gigabytes. It should be saved in the 'unet' directory within the Comfy UI models folder.
How can users with low memory address memory issues when using the FLUX model in Comfy UI?
-Users with low memory can set the weight D type and the low diffusion model node to fb8, which will lower the memory usage by half, although it might slightly reduce image quality.
What is the 'flux prompt enhancer' mentioned in the script, and how can it be used?
-The 'flux prompt enhancer' is a tool created by Angry Penguin that takes a basic prompt and generates a more detailed and enhanced version of it. Users can input a simple concept and add a style, and the enhancer will create a comprehensive prompt for use with the FLUX model.
What are the current limitations of the FLUX model in terms of additional features like Control Nets or IP adapters?
-As of the script's recording, there are no Control Nets or IP adapters available for the FLUX model. However, image-to-image functionality is possible by adjusting the denoising settings and following specific workflows.
Outlines
🚀 Introduction to Black Forest Labs' Flood Model
The script introduces a new AI model named 'Flood' by Black Forest Labs, which is being hailed as a competitor to Mid Journey. The narrator shares their positive experience with the model's impressive results, contrasting it with their initial tests of SD3, which yielded mixed outcomes. Black Forest Labs, known for their work on models like Stable Diffusion XL and Stable Video Diffusion, has released three variants of the Flood model: Flux Point1 Pro, Flux Point1 Dev, and Flux Point1. The script discusses the potential commercial use of these models, with the Pro version available under the Apache 2.0 license for free use, including commercial purposes. The Dev version seems to be restricted to non-commercial applications, and the narrator expresses uncertainty about the legalities of commercial use. The paragraph concludes with instructions on downloading and installing the necessary models for use in Comfy UI, a popular AI tool, and provides tips for users with limited memory.
🎨 Exploring Flood Model's Capabilities and Community Tools
This paragraph delves into the practical use of the Flood model, specifically the Flux Point1 Dev variant, within Comfy UI. The narrator guides the audience through the process of generating detailed and high-quality images using the model, showcasing its ability to create intricate scenes like 'Darth Vader holding a sign' with remarkable clarity. The script also highlights the 'flux prompt enhancer' tool created by 'Angry Penguin,' which aids users in crafting effective prompts for the AI model, demonstrated with an example of creating a manga-style image of 'The Hulk driving a convertible.' Furthermore, the narrator discusses the potential for image-to-image generation with the model, despite the lack of control nets or IP adapters, and provides a workflow for achieving this. The paragraph concludes with the narrator's excitement for future updates, particularly the introduction of control nets and AP adapters, and their anticipation for the model's application in video generation.
🔮 Anticipating Future Developments and Closing Remarks
The final paragraph of the script reflects on the ongoing development and potential future improvements of the Flood model. The narrator expresses optimism about the model's progression and hints at creating more content as advancements are made. They encourage viewers to look forward to these updates and thank them for watching, signing off with a casual 'peace.' The paragraph is punctuated with music, adding a light-hearted tone to the closing of the video script.
Mindmap
Keywords
💡FLUX
💡Midjourney
💡Comfy UI
💡Black Forest Labs
💡Apache 2.0 license
💡Darth Vader
💡Image to Image
💡Flux Prompt Enhancer
💡Control Nets
💡Video to Video
Highlights
Introduction of a new model called FLUX by Black Forest Labs, which is being considered a competitor to Midjourney.
FLUX has three variants: Pro, Dev, and Schnell, with varying levels of creative capabilities.
The FLUX Pro variant is suggested as the best among the three versions.
Legal use and commercial viability of FLUX models are not entirely clear, with potential restrictions on the Dev version.
FLUX Pro is available as an API and can be accessed through a website without installation.
Instructions on how to download and install the necessary models for use in Comfy UI.
Recommendation to use the fp16 model for systems with more than 32GB of RAM for optimal performance.
Tips for users with low memory, suggesting adjustments to weight D type and model settings.
A demonstration of the FLUX model's ability to generate high-quality images with minimal prompts.
Comparison of FLUX's performance to the initial disappointments with Midjourney's SD3 model.
Introduction of a tool called 'flux prompt enhancer' created by Angry Penguin to help generate detailed prompts.
A showcase of the prompt enhancer's ability to create a manga-style image of the Hulk driving a convertible.
Discussion on the potential of FLUX for image-to-image generation despite the lack of control nets or IP adapters.
A workflow provided by Curo for image-to-image generation with FLUX, including tips for adjusting settings.
Anticipation for the integration of control nets and AP adapters to enhance FLUX's capabilities.
Speculation on whether Midjourney should be concerned about FLUX as a potential competitor.
A promise to update the audience on FLUX as new developments and improvements emerge.