Midjourney 5 must be stopped at all costs

Fireship
16 Mar 202303:24

TLDRThe video discusses the release of Mid-Journey's version 5 AI model, which generates hyper-realistic images. It highlights the impact on jobs and creativity, as AI can now produce models and art, making human creation potentially obsolete. The U.S copyright office's stance on AI-generated art is mentioned, along with the monetization strategies of companies like Mid-Journey and OpenAI. The video also provides a tutorial on using Mid-Journey's Discord platform to create AI-generated images, emphasizing the potential and challenges of this technology.

Takeaways

  • ๐Ÿš€ Mid-journey has released its version 5 model in Alpha, showcasing highly realistic AI-generated images.
  • ๐Ÿ˜ฒ The quality of AI-generated images is so high that it can make models and human artists' work seem obsolete.
  • ๐Ÿ‘จโ€๐Ÿ’ป The speaker, a programmer and content creator, has been exploring new career paths due to the impact of AI on his industry.
  • ๐Ÿข Various companies and projects are competing to develop the best generative image models, with Stable Diffusion leading as an open-source project.
  • ๐Ÿค– The U.S copyright office has ruled that generative AI art cannot be copyrighted without proof of human authorship.
  • ๐Ÿ’ฐ Companies providing AI models, like Mid-journey and OpenAI, stand to profit significantly from the subscription-based services they offer.
  • ๐ŸŽจ AI-generated art could potentially devalue human creativity, as companies may steal and remix art to produce countless variations.
  • ๐Ÿ” Mid-journey operates on Discord and currently doesn't have an API, but users can generate images using the 'Imagine' slash command.
  • ๐Ÿ–ผ๏ธ Version 5's alpha release allows users to create highly realistic human images with the 'V' flag and increase quality with the 'Q' flag.
  • ๐Ÿ”„ Users can also provide a starter image via a hyperlink to generate new artwork or recreate images of people, such as long-lost relatives.

Q & A

  • What significant update did Midjourney release on March 16, 2023?

    -Midjourney released its version 5 model in Alpha on March 16, 2023, which is capable of producing AI images that are shockingly realistic.

  • How does the AI-generated image of the guy with a shocked face relate to the content creator who lost his job to AI?

    -The AI-generated image of the guy with a shocked face symbolizes the content creator's surprise and concern over losing his job to AI, highlighting the rapid advancement and impact of AI in various industries.

  • What is the current status of the modeling industry in light of AI-generated models?

    -The modeling industry has been significantly affected as AI can now generate models in all shapes and sizes, making it difficult to distinguish between real and AI-generated models unless one looks closely at the details like fingers.

  • Which projects are competing to be the best generative image model in 2023?

    -In 2023, several projects are competing in the generative image model space. Stable Diffusion is a leading open-source project, while there are numerous closed-source projects like Dolly from OpenAI and others trying to monetize the space.

  • What was the U.S copyright office's recent ruling on generative AI art?

    -The U.S copyright office recently ruled that generative AI art cannot be copyrighted because proof of human authorship is required. However, if AI art is modified by a human, it could become eligible for copyright on a case-by-case basis.

  • How has OpenAI's status changed since its inception?

    -OpenAI started as a non-profit organization but transitioned to a for-profit model when they realized the potential for significant financial gain from their AI technologies.

  • What are the implications of AI-generated art on human creativity?

    -AI-generated art could potentially harm human creativity by reducing the incentive for individuals to create original works, as companies might steal and remix AI creations into countless variations, making it difficult to distinguish between original and derivative works.

  • How can one use Midjourney's version 5 model to create realistic images?

    -To use Midjourney's version 5 model, one can join the Midjourney community on Discord, use the 'Imagine' slash command, and describe the desired image. The model will generate four variations, from which users can choose and further refine or upsample individual images. The V flag can be used for highly realistic human images, and the Q flag can increase the quality.

  • What is the potential future development for Midjourney's platform?

    -While currently handled entirely in Discord, it is anticipated that Midjourney may introduce an API in the future, which would further expand its accessibility and integration into various applications.

  • How can users provide a starter image for Midjourney to generate new artwork?

    -Users can provide a starter image by including a hyperlink to any image URL on the internet. This allows for the creation of new artwork based on existing images, such as bringing a long-lost relative back to life in a new piece of art or photo, although it may not be deepfake accurate.

  • What is the one glimmer of hope for digital creators in the face of AI advancements?

    -The glimmer of hope for digital creators is the possibility that their creations might have been made with AI, and if the distinction between human and AI-generated art becomes indistinguishable, it raises philosophical questions about the value and nature of human creativity in the digital age.

Outlines

00:00

๐Ÿš€ Introduction to AI's Impact on Content Creation

The video begins with the announcement of the release of Mid-Journey's version 5 model in Alpha, highlighting the startlingly realistic AI-generated images. The speaker, a programmer and content creator who lost his job to AI, humorously considers a career in modeling before acknowledging the obsolescence of models due to AI's capabilities. The video discusses various companies and projects, such as Stable Diffusion and Dolly from Open AI, competing to be the best generative image model. It also touches on the ethical and legal implications of AI-generated art, mentioning the U.S. copyright office's ruling on the inability to copyright generative AI art without proof of human authorship.

Mindmap

Keywords

๐Ÿ’กmid-journey

Mid-journey refers to a company mentioned in the script that has released its version 5 model in Alpha, specializing in AI-generated images. The term is significant as it represents a technological advancement in the field of artificial intelligence, particularly in the domain of generative models. The video discusses the impressively realistic images produced by mid-journey's AI, highlighting its impact on various industries such as modeling and art. The script uses mid-journey as an example to illustrate the growing capabilities of AI in creating content that was traditionally produced by humans.

๐Ÿ’กAI-generated images

AI-generated images are visual outputs created by artificial intelligence algorithms without human intervention. In the context of the video, these images are produced by AI models like mid-journey's version 5, which are capable of creating highly realistic and aesthetically pleasing visuals. The script emphasizes the quality of these images, noting that they can closely mimic human-generated content, raising questions about the future of human creativity and the potential obsolescence of certain professions.

๐Ÿ’กopen source project

An open source project refers to a collaborative effort where the source code or underlying principles are made publicly available, allowing anyone to view, use, or modify it. The script mentions 'stable diffusion' as an example of a leading open source project in the generative image model space. Open source projects are significant as they foster innovation and community involvement, enabling a broader range of individuals and organizations to contribute to and benefit from the technology.

๐Ÿ’กcopyright

Copyright refers to the legal rights granted to creators of original works, including the exclusive right to reproduce, distribute, and display their work. In the video, it is mentioned that the U.S copyright office ruled generative AI art cannot be copyrighted unless human authorship can be proven. This highlights the complexities of intellectual property in the age of AI, where the traditional notions of authorship and originality are challenged by the capabilities of AI to generate new content.

๐Ÿ’กDolly

Dolly is mentioned in the script as a closed-source project developed by Open AI, which is competing in the generative image model space. The reference to Dolly signifies the commercial interests and monetization strategies being pursued by companies in the AI sector. It underscores the dual nature of AI advancementsโ€”on one hand, they offer innovative tools and services, while on the other, they raise questions about the sustainability of traditional creative industries.

๐Ÿ’กco-pilot

Co-pilot, as mentioned in the script, is likely a service provided by Open AI that assists users in some form of AI-powered collaboration or task assistance. The reference to paying a monthly fee for co-pilot suggests a subscription-based model for accessing AI services, indicating the commercialization of AI technology and its integration into various aspects of work and creativity.

๐Ÿ’กChat GPT

Chat GPT is mentioned as another service for which a monthly fee is paid, suggesting it is an AI-driven chatbot or conversational AI system. The inclusion of Chat GPT in the script highlights the growing market for AI tools that facilitate communication, information retrieval, and potentially other interactive services, emphasizing the expanding role of AI in everyday digital interactions.

๐Ÿ’กmid-journing

The term 'mid-journing' is used in the script to describe the experience or process of using mid-journey's AI model to create images. It implies a journey or exploration into the capabilities of AI in art and creativity. The concept of 'mid-journing' encapsulates the transformative experience of engaging with AI tools that can generate new and complex visual content, reflecting on the changing landscape of artistic creation and human-AI collaboration.

๐Ÿ’กDiscord

Discord is a communication platform mentioned in the script as the medium through which mid-journey operates. It suggests that the platform is used for user interaction, support, and community building around the AI model. By mentioning Discord, the script highlights the role of online communities and platforms in the dissemination and utilization of AI technologies.

๐Ÿ’กprompt engineering

Prompt engineering refers to the process of crafting specific and effective prompts or instructions for AI systems to generate desired outputs. In the context of the video, it is mentioned as a skill that users of mid-journey's AI model can develop, allowing them to create particular types of images by describing their ideas accurately. The concept underscores the importance of clear communication between humans and AI systems and the potential for users to shape AI-generated content according to their intentions.

๐Ÿ’กV flag

The 'V flag' is a specific parameter mentioned in the script for using mid-journey's AI model. It is used to enhance the quality of AI-generated images of humans, indicating a level of customization and control available to users. The reference to the V flag illustrates the technical aspects of interacting with AI tools and the ongoing development of features aimed at improving the realism and usability of AI-generated content.

๐Ÿ’กQ flag

The 'Q flag' is another parameter mentioned in the script for adjusting the output of AI-generated images. It is used to increase the quality of the images produced by the AI model. The inclusion of the Q flag in the discussion demonstrates the various settings and adjustments that users can make to fine-tune the AI's output, reflecting the complexity and adaptability of AI systems in catering to diverse user needs and preferences.

Highlights

Mid-journey released its version 5 model in Alpha with AI-generated images that are shockingly realistic.

The AI-generated image of a man with a shocked face is entirely artificial, showcasing the impressive capabilities of the technology.

The content creator, who lost his job to AI, explores the possibility of a new line of work in the face of technological advancements.

The obsolescence of models due to the ability to generate them in all shapes and sizes through AI.

The competition among companies and projects to be the best generative image model in 2023, with stable diffusion leading as the top open-source project.

The challenge of copyrighting generative AI art due to the need for proof of human authorship.

The potential for AI to democratize creativity by making it accessible to almost everyone through services like Mid-journey, OpenAI's copilot, and Chat GPT.

The concern that AI-generated art may devalue human creativity and remove incentives for true human talent.

The process of using Mid-journey's AI to generate images, including the use of Discord and the Imagine slash command.

The capabilities of version 5 Alpha to produce highly realistic images of humans with the use of the V flag.

The Q flag's role in increasing the quality of AI-generated images.

The ability to manipulate the output image's aspect ratio and chaos level to control the randomness and uniqueness of the generated content.

The potential of using a starter image from a hyperlink to create new artwork or bring back memories, such as long-lost relatives.

The impact of generative AI on digital creators and the potential for AI-made content that is indistinguishable from human-made.

The hope for digital creators in the form of AI-generated content that may not be distinguishable, challenging the value of human-made art.