MidJourney Version 6, finally worth the price?

VISULA by TOBY
3 Jan 202406:36

TLDRIn this video, Toby from Visual Toby reviews Mid Journey's version 6, highlighting its innovative, natural language prompting system for image generation. He compares the quality of images produced by Mid Journey with those from industry-leading models like Juggernaut XL, noting Mid Journey's superior detail and realism. Despite Mid Journey's strengths, Toby suggests there's room for improvement in customizability and privacy features, recommending a dedicated site and presets for enhanced user experience.

Takeaways

  • πŸš€ Introduction of Mid Journey version 6 with innovative features for image generation using AI.
  • 🌟 New prompting method that allows users to describe images in natural language, similar to explaining to a human.
  • 🎨 Comparison of Mid Journey's image quality with industry-leading stable diffusion models, showing Mid Journey's superior performance.
  • πŸ† Mid Journey's ability to capture detailed and realistic images, closely adhering to the user's prompt.
  • πŸ‘‰ Critique of Mid Journey's lack of customizability and limited freedom in generator options.
  • πŸ“± The generator operates on Discord, suggesting a need for a dedicated site for better user experience.
  • πŸ’‘ Suggestion for the introduction of presets or templates for ease of use, similar to other AI models.
  • πŸ’Έ Concern over the premium pricing for privacy features, which should be included in the base plan.
  • πŸ“ˆ Potential for other software to adapt similar technologies, indicating a competitive future in the AI image generation market.
  • πŸ‘ Encouragement for users to provide feedback and engage in discussions for continuous improvement of the AI model.
  • πŸŽ₯ The video serves as an educational resource for understanding the advancements in AI image generation technologies.

Q & A

  • What is the main topic of the video?

    -The main topic of the video is the introduction and review of Mid Journey version 6, an AI model for generating images, and its comparison with other AI models like Stable Diffusion and Deli.

  • How has the prompting process changed in Mid Journey version 6?

    -In Mid Journey version 6, the prompting process has been revamped to allow users to input natural language descriptions instead of using generic keywords. This allows for a more intuitive and human-like interaction with the AI.

  • What are the advantages of using natural language prompts in Mid Journey version 6?

    -Using natural language prompts in Mid Journey version 6 allows for more precise control over the generated images. It reduces the need for multiple iterations and 'prompt fiddling' as the AI can better understand and realize the user's exact ideas.

  • How does the video compare Mid Journey version 6 with Stable Diffusion and Deli?

    -The video compares the image quality and detail produced by Mid Journey version 6 with that of Stable Diffusion and Deli. It highlights how Mid Journey version 6 captures more details and produces more realistic images in line with the user's prompts.

  • What was the result of the photo realistic ice bear baby prompt comparison?

    -Both Mid Journey version 6 and Stable Diffusion struggled with interpreting the 'thumbs up' in the prompt. However, Mid Journey's version was considered better as it was more aligned with the photorealistic aspect of the prompt, despite appearing more plushy teddy bear-like.

  • What feedback does the video provide on the customizability and freedom of Mid Journey's generator?

    -The video suggests that the team behind Mid Journey needs to work on the customizability and freedom of their generator. It mentions the limitations of the current system, which runs on Discord, and the need for a dedicated site or presets/templates for more control.

  • What issue is raised regarding the privacy features of Mid Journey?

    -The video points out that privacy is an issue with Mid Journey as the default setting allows others to see the generated images. Upgrade to a higher plan is required for enhanced privacy, which the video suggests should not be a premium feature.

  • How does the video conclude about the future of AI image generation?

    -The video concludes that while Mid Journey version 6 is currently the best in terms of image generation, it doesn't have to remain that way forever. It suggests that other software will adapt similar technologies, promoting continuous improvement and competition in the field.

  • What is the reviewer's final verdict on Mid Journey version 6?

    -The reviewer is highly impressed with Mid Journey version 6, particularly with its ability to generate high-quality, detailed images based on natural language prompts. However, they also encourage the development team to address the issues of customizability, privacy, and platform accessibility.

  • How does the video address the issue of the birthmark in the prompt examples?

    -The video notes that both Mid Journey version 6 and Stable Diffusion models failed to capture the birthmark detail in the prompt examples. It indicates that while this detail was not accurately represented, it is considered acceptable given the overall quality of the generated images.

  • What does the video suggest as a potential improvement for Mid Journey's generator?

    -The video suggests that creating a dedicated site for the generator and introducing presets or templates similar to those used with Lura for Stable Diffusion could be potential improvements for Mid Journey's generator.

Outlines

00:00

πŸ–ΌοΈ Introduction to Mid Journey Version 6 and Image Prompting

This paragraph introduces the latest Mid Journey Version 6, highlighting its advanced features in image generation using AI. It discusses the new prompting method that has evolved from generic terms to natural language descriptions, akin to explaining to a human. The speaker, Toby, compares the new model with other AI models like Stable and Fusion, emphasizing the improved quality and detail in the generated images. A specific example is given, describing an image of a woman by the sea, comparing it with an industry-leading model, Juggernaut XL based on Stable Diffusion XL. The comparison shows that Mid Journey Version 6 produces more realistic and detailed images, even capturing subtleties like the texture of hair and dress, although it missed the birthmark detail.

05:01

πŸ” Quality Comparison and Sponsor Mention

The second paragraph continues the quality comparison between Mid Journey Version 6 and other models, focusing on a photo realistic image of an ice bear baby. The speaker shares their personal preference for the Stable Diffusion version but acknowledges that Mid Journey performs better in this instance. However, both models struggle with interpreting the 'thumbs up' in the prompt. The speaker also addresses the channel sponsor, encouraging viewers to subscribe for more educational content. The video concludes with a favorite prompt by the speaker, a photorealistic image of a black Porsche, comparing the outputs of Stable Diffusion and Mid Journey. Mid Journey's ability to capture the entire prompt with impressive detail is praised, showing its reliability and capability to transform ideas into high-quality images without much effort.

Mindmap

Keywords

πŸ’‘AI

Artificial Intelligence (AI) refers to the simulation of human intelligence in machines that are programmed to think and learn like humans. In the context of the video, AI is used to generate images through a platform called Mid Journey, which has been updated to its sixth version. The video discusses how this AI model has improved in terms of understanding and executing prompts to create realistic images.

πŸ’‘Prompting

Prompting in the context of AI image generation refers to the input or instructions given to the AI system to produce a specific output. The video highlights a new way of prompting introduced in Mid Journey's version six, where users can input prompts in natural language, similar to explaining to a human, instead of using generic terms or keywords.

πŸ’‘Mid Journey

Mid Journey is an AI image generation platform that has been updated to its sixth version, as discussed in the video. This platform is noted for its ability to reshape the process of generating images with AI by introducing a more natural and intuitive prompting system.

πŸ’‘Image Quality

Image quality refers to the clarity, detail, and overall visual appeal of an image. In the video, the speaker compares the image quality produced by Mid Journey version 6 with that of another industry-leading model, Juggernaut XL, to demonstrate the advancements and improvements in Mid Journey's image generation capabilities.

πŸ’‘Stable Diffusion

Stable Diffusion is a type of AI model used for image generation. The video compares the performance of Mid Journey version 6 with Stable Diffusion, specifically the Juggernaut XL model, to highlight the differences in how each model interprets and executes prompts to generate images.

πŸ’‘Photorealistic

Photorealistic refers to images that are incredibly detailed and lifelike, resembling real photographs. In the video, the speaker uses the term to describe the level of detail and realism they were aiming for in the images generated by the AI models.

πŸ’‘Natural Language

Natural language refers to the way humans naturally communicate with each other, using speech or writing. In the context of the video, natural language is used to describe the new prompting method in Mid Journey version 6, where users can describe what they want in a more conversational manner, rather than using technical or generic terms.

πŸ’‘Customizability

Customizability refers to the ability to modify or adjust a product or service to meet individual needs or preferences. The video mentions that the Mid Journey team still needs to work on improving the customizability of their image generation platform, suggesting that users may want more freedom and options in how they generate their images.

πŸ’‘Discord

Discord is a communication platform designed for communities, including gaming communities, and it allows users to interact via voice, video, and text. In the video, Discord is mentioned as the current platform where the Mid Journey generator operates, suggesting that it may not be the most ideal environment for such a service.

πŸ’‘Privacy

Privacy in this context refers to the protection of user-generated content from being visible to others. The video discusses the need for users to upgrade to a higher plan to ensure privacy, as their images can be seen by others in the base plan.

πŸ’‘Presets or Templates

Presets or templates are pre-defined settings or configurations that users can select to quickly generate content with specific characteristics. The video suggests that introducing presets or templates for the Mid Journey generator could enhance the user experience by providing quick and easy options for common image generation tasks.

Highlights

Mid Journey version 6 introduces a new way of prompting for images, moving away from generic terms.

The new prompting method allows users to describe images in natural language, as if explaining to a human.

Using old 'junk keywords' like 4K, 8K, and photo realistic is now discouraged in Mid Journey version 6.

An example prompt is provided, describing a portrait of a woman in a natural language style.

Mid Journey's images are compared to industry-leading stable diffusion models, specifically Juggernaut XL.

In the quality comparison, Mid Journey's images show superior detail and realism over stable diffusion models.

The new version captures the entire prompt better, with less need for multiple attempts to get the desired image.

Mid Journey's generator currently runs on Discord, suggesting a potential need for a dedicated site in the future.

The video mentions the potential for presets or templates similar to those available with other AI models.

Privacy concerns are raised, as users have to pay extra for a higher plan to hide their images from public view.

The introduction of Mid Journey version 6 is seen as beneficial for all, as it will push other software to adapt similar technologies.

The video encourages viewers to engage by leaving a thumbs up and asking questions in the comments section.

The video concludes by stating that while Mid Journey is the best currently, competition will drive continuous improvement.

The presenter, Toby, is from Visual Toby and provides educational content on AI and image generation.

The video includes a sponsored message promoting the channel for similar educational content.

A specific image prompt about a black Porsche with violet front lights is discussed, highlighting the generator's attention to detail.

The video emphasizes the ease of use and reliability of Mid Journey, allowing users to bring their ideas to life with precision.