【Stable Diffusion】構図・アングル・視線の呪文集プロンプトをまとめて紹介
TLDRThe video script offers a comprehensive guide on creating compositions and selecting angles for AI-generated images. It emphasizes the importance of mastering specific 'spells' or prompts, such as 'head shot' and 'cowboy shot', to achieve desired compositions like full-body views or shots from above or below. The script also discusses image size adjustments and the use of prompts to control the character's line of sight, ultimately aiming to improve the quality and accuracy of AI-generated content without extensive trial and error.
Takeaways
- 🎨 Understanding composition and angle spells is crucial for efficient image generation without the need for repetitive generation processes.
- 📸 To capture an image from the chest area upwards, use the 'head shot' composition spell, which often extends to the chest area despite its name.
- 🗿 Generating a square image size, like 512x512, can facilitate easier and potentially higher quality compositions.
- 👤 For compositions starting from the thigh area, the 'cowboy shot' spell is effective, though it may initially default to images with a hat, which can be mitigated by adding ((no hat1.3)) to the prompt.
- 📏 Adjusting the image size to a vertical长方形, such as 512x1024, can improve the success rate of generating thigh-focused compositions without hats.
- 🚶♂️ To depict a full body composition, using 'full body' as a prompt along with an image size of 512x1024 or 512x768 with 'hires.fix' can enhance the quality, especially around the eyes.
- 👀 The quality of the eyes in the generated image can be a significant factor, thus using 'hires.fix' may increase processing time but dramatically improves eye detail.
- 🎯 For directional angles, using specific spells like 'shoot from front', 'above', 'below', 'side', and 'behind' allows for precise control over the perspective.
- 👀 Controlling a person's line of sight in the image can be achieved with spells like 'looking viewer', 'up', 'down', 'side', 'back', and 'looking away'.
- 🔍 If the desired composition or angle is not achieved, using methods like 'img2img', 'openpose', or changing the model may offer solutions.
- 📚 The video script serves as a comprehensive guide for users to master the use of composition and angle spells for effective image generation.
Q & A
What is the main topic of the video?
-The main topic of the video is about creating compositions and controlling angles in image generation using various spells or prompts.
How can one generate an image from the chest area upwards?
-To generate an image from the chest area upwards, you can insert the 'head shot' spell in the prompt.
Why should certain spells like 'skirt' or 'denim' be removed from the prompt?
-These spells should be removed because they can interfere with the desired composition, causing the lower body elements to not generate properly.
What is the recommended image size for generating high-quality compositions?
-A square image size, such as 512 by 512, is recommended as it makes it easier to generate high-quality images.
How can the tendency to generate a hat in a 'cowboy shot' be reduced?
-By adding '(no hat 1.3)' after 'cowboy shot' in the prompt, the tendency to generate a hat can be reduced.
What is the recommended method for composing a full-body image?
-The most recommended method for a full-body composition is to set the width to 512 and the height to 768, and use 'hires.fix' for the highest quality.
How can the angle of the generated image be controlled?
-The angle can be controlled by using specific spells like 'shoot from front', 'shoot from above', 'shoot from below', 'shoot from side', and 'shoot from behind'.
What spell can be used to make the generated character look at the camera?
-The 'looking viewer' spell can be used to make the generated character look at the camera.
If a desired composition or angle is not achieved, what alternative methods can be tried?
-If the desired composition or angle is not achieved, one can try changing the image size, using 'img2img', 'openpose', or 'openpose editor' based on a reference image, or changing the model.
What is the significance of understanding and inputting spell content?
-Understanding and inputting spell content allows for more efficient image generation, reducing the time spent until the ideal image is produced, and improving the overall quality of the generated images.
What is the role of 'Ai Gene' in the context of the video?
-Ai Gene is mentioned as a source that disseminates information about generated AI, suggesting that the video content is educational and aimed at enhancing the viewers' knowledge of AI-generated content.
Outlines
🎨 Understanding Composition and Angle Viewpoints
This paragraph discusses the intricacies of creating visual compositions using specific spells to capture the desired image. It emphasizes the importance of mastering composition and angle viewpoint spells to generate images efficiently without the need for multiple iterations. The paragraph explains how to generate images from different parts of the body, such as chest shots and cowboy shots, and how to adjust the image size for optimal results. It also highlights the use of spells to control the character's line of sight, such as looking at the camera, looking up, down, or away, and the impact of these spells on the final image. The paragraph concludes with a recommendation to use 'img2img' or 'openpose' if the desired composition or angle is not achieved through spells alone.
🔍 Advanced Techniques for Image Generation
The second paragraph delves into advanced techniques for achieving specific compositions and angles in image generation. It addresses the challenges of generating images with upward or downward gazes and the necessity of using spells like 'looking up' or 'looking down' to achieve these effects. The paragraph also explores the use of 'looking side' and 'looking back' spells to capture images from unique perspectives. It suggests increasing the value after the colon in spells like 'looking away' to ensure the desired outcome. The paragraph further discusses the potential need to adjust image sizes or switch to different models if the initial attempts do not yield satisfactory results. It concludes with a reminder to subscribe to the channel for more content on generated AI and thanks the viewer for their interest.
Mindmap
Keywords
💡Composition
💡Image Size
💡Spells
💡Angles
💡Eye Quality
💡Full Body
💡Cowboy Shot
💡Hires.Fix
💡Negative Prompt
💡Img2img
💡Openpose
Highlights
The importance of mastering composition and angle spells for efficient image generation.
Using 'head shot' to generate a composition from the chest area upwards.
The necessity of removing spells like 'skirt' or 'denim' when aiming for a specific composition.
Creating a square image size, such as 512x512, for easier generation.
Generating a high-quality image from the chest to the top without eye collapse.
The 'cowboy shot' for composing from the middle of the thigh.
Preventing the generation of a hat in the 'cowboy shot' by adding ((no hat1.3)) to the prompt.
Adjusting image size to 512x1024 for easier composition from the upper thighs without a hat.
Entering 'full body' to compose the entire body in the image.
Setting the width to 512 and height to 1024 for better quality in a standing composition.
Recommendation to use 'hires.fix' for the highest quality in a full-body image.
Entering 'shoot from front' for a frontal angle with the same line of sight as the character.
Generating an angle from above by entering 'shoot from above'.
Using 'looking viewer' to make the character look at the camera.
Specifying the angle of gaze with 'looking up: 1.2' or 'looking down: 1.2'.
The possibility of using 'img2img', 'openpose', or 'openpose editor' when spells do not yield the desired result.
Changing the model as a last resort if the desired composition or angle is not achieved.