Loki - Live Portrait - NEW TALKING FACES in ComfyUI !

FiveBelowFiveUK
7 Jul 202411:23

TLDRIn this video, the creator introduces the new Live Portrait feature in ComfyUI, which allows for real-time face swapping and animation using Loki's latest edition. The process involves creating face models, integrating them with the Trio T pose workflow, and animating them using Hedra. The video also covers the installation of Live Portrait, syncing audio with the animation, and using various models to achieve a more refined and accurate result. Viewers are guided through the workflow, from setting up to the final output, showcasing the potential for creating dynamic and expressive talking head animations.

Takeaways

  • 😀 The video introduces a new feature for Loki, a face swap tool with batch modes, allowing for animations and face model creation.
  • 🔄 The Loki tool can save and load face models using the Trio T pose workflow, ensuring consistency in face models across images.
  • 📚 The script mentions the use of Hedra for animating the created face models, demonstrating a workflow that integrates multiple tools.
  • 📝 The presenter provides a tutorial on how to create a face model in the Loki tool, including adding images and naming the model.
  • 🎨 Live Portrait KJ is highlighted, a feature that allows for immediate use of face swap with the provided workflow.
  • 🔧 The script details updates to the ComfyUI, including frame rate fixes and audio synchronization for better animation matching.
  • 📦 Additional models are required for the Live Portrait feature, which are small in size and can be easily integrated.
  • 🎥 The Live Portrait workflow is updated to include video and audio synchronization, simplifying the process of creating talking head animations.
  • 🤖 The use of a 'removable head' and 'tracking stick' is introduced to address resolution mismatches between heads and bodies in animations.
  • 💬 The video demonstrates the capability of the tool to analyze and animate faces in real-time, with the presenter providing commentary.
  • 🔍 The presenter suggests using dewarp stabilizers to solve issues of face distortion during head movements in animations.
  • 🎭 The script concludes with suggestions on using overdubbing to reduce generation costs and enhance character animations.

Q & A

  • What is the main topic discussed in the video?

    -The main topic discussed in the video is the latest edition of Loki, which includes a new feature for creating and animating talking faces using face models and the Live Portrait workflow in ComfyUI.

  • What is the purpose of the Loki face swap with batch modes?

    -The Loki face swap with batch modes is primarily used for creating animations and face swapping, but it also allows users to save and create face models that can be loaded into the Trio T pose workflow.

  • How can users create a face model in the Loki workflow?

    -Users can create a face model in the Loki workflow by selecting an image, giving it a name, and running the process, which will then create the reactor face model.

  • What is the role of Hedra in the animation process described in the video?

    -Hedra is used to animate the images that have been processed with the Loki workflow, allowing for the creation of talking head animations.

  • What updates were made to the Live Portrait workflow in ComfyUI?

    -The updates to the Live Portrait workflow in ComfyUI include fixing the frame rate to match the source video and incorporating audio to synchronize with the animation, making the process more seamless.

  • What are the additional models required for the Live Portrait workflow and where can they be found?

    -The additional models required for the Live Portrait workflow are six small models, which can be found in the description and the workflow links provided in the video.

  • How does the Live Portrait workflow handle audio synchronization?

    -The Live Portrait workflow uses the video info node to take the frame rate and audio from the source video, ensuring that the animation matches the speaking in the input video.

  • What is the significance of the removable head and tracking stick in the animation process?

    -The removable head and tracking stick allow for better matching of face and body resolutions, enabling more accurate and detailed animations.

  • How does the video driver in the Live Portrait workflow function?

    -The video driver in the Live Portrait workflow performs the heavy lifting by using the six models to analyze and animate the face, creating a talking head video.

  • What are some tips for improving the quality of the talking head animation?

    -To improve the quality of the talking head animation, ensure good lighting and the correct angle for the input video. Also, consider using dewarp stabilizers to reduce face distortion and overdubbing for better lip-syncing.

  • Where can viewers find the workflow for the Live Portrait feature?

    -Viewers can find the workflow for the Live Portrait feature on Civ AI, with a link provided in the video description.

Outlines

00:00

🎭 Introduction to Loki's Face Swap and Animation Workflow

The script begins with a warm welcome and an introduction to the latest edition of Loki, a software for face swapping and animation. It discusses the release of a new feature that allows for the creation and saving of face models, which can then be loaded into the Trio T pose workflow. The speaker demonstrates how to create a face model using images and highlights the ability to animate these models with the help of Hedra. The workflow is unchanged, and a Hedra video is included for users to experiment with the software right out of the box.

05:00

🤖 Enhancing Live Portraits with Improved Synchronization and Customization

This paragraph delves into the enhancements made to the live portrait feature in Loki. It discusses the integration of a text-to-speech video and the process of creating 2D puppets from T-poses using control net. The speaker addresses the resolution mismatch between faces and bodies, and introduces a solution with a removable head and tracking stick. The script also covers the process of animating with a webcam and the benefits of the new workflow, including improved speed, control, and accuracy. The speaker mentions the potential for character expression and emotional traits in the animations and hints at further exploration in an upcoming deep dive video.

10:02

🔧 Final Thoughts on Workflow and Future Improvements

The final paragraph wraps up the script with some final thoughts on the workflow and potential future improvements. It discusses the possibility of overdubbing to reduce generation costs and suggests using stock animated heads for non-focused characters. The speaker provides a link to the workflow on Civ AI and expresses gratitude to the developers of the nodes and the viewers for their support. The script ends with a humorous reference to Sir Humphrey Davey and a promise to see the audience in the next video.

Mindmap

Keywords

💡Loki

Loki is a reference to the software or tool being discussed in the video, which is likely a part of a series or suite of applications for creating animations or face swaps. It is central to the video's theme as it is the primary subject of the tutorial.

💡Face Swap

Face Swap is a technique used in video editing and animation to replace the face of an individual in a video with another face, often for comedic or artistic effect. In the context of the video, it is a feature of the Loki software that allows users to create animations with swapped faces.

💡Batch Modes

Batch Modes refer to the ability to process multiple files or tasks simultaneously, which is a feature of the Loki software mentioned for creating animations or face swaps more efficiently. It is significant in the video as it speeds up the workflow for the user.

💡Face Models

Face Models in this context are digital representations of faces that can be manipulated and used in animations. The script describes the process of saving and creating these models using the Loki software, which is a key aspect of the video's tutorial.

💡Trio T Pose

Trio T Pose is a workflow or method mentioned in the script for loading and using face models in the Loki software. It is part of the process of integrating the created face models into images, which is a crucial step in the animation process.

💡Hedra

Hedra is a tool or software used in conjunction with Loki to animate the created face models. The script describes using Hedra to animate the images, indicating it as an essential component of the animation workflow presented in the video.

💡Live Portrait

Live Portrait is a feature or tool within the Loki software that enables the creation of animated portraits that mimic the movements and speech of the source video. It is highlighted in the script as a new addition to the software's capabilities.

💡ComfyUI

ComfyUI appears to be the user interface of the Loki software or a related application, where users can drag and drop elements to create animations. It is mentioned in the script as the platform where users interact with the software's features.

💡Text to Speech

Text to Speech (TTS) is a technology that converts written text into spoken words. In the video, TTS is used to create a video with a talking head from a cropped JPEG image, demonstrating the versatility of the Loki software.

💡Tracking Stick

A Tracking Stick is a tool used in the animation process to help track and animate movements, particularly the head. The script mentions a removable head with a tracking stick underneath, which helps in aligning the animated face with the body movements.

💡Deep Dives

Deep Dives refer to in-depth explorations or detailed tutorials on specific topics related to the software. The script mentions that these have been covered in previous videos, indicating that the video is part of a series that provides comprehensive guides on using the Loki software.

💡Dewarp Stabilizers

Dewarp Stabilizers are tools used in video editing to correct distortions or 'liquify' effects that can occur during the animation process. The script suggests using them to solve issues with face distortion in the animation, showing the importance of post-processing in achieving a polished result.

Highlights

Introduction of Loki's latest edition with advanced face swap and batch modes.

Ability to save and create face models and load them using Trio T pose workflow.

Use of Hedra to animate faces created in images.

Demonstration of creating a face model with the new workflow.

Inclusion of a Hedra video for users to experiment with out of the box.

Fixing frame rate issues for better synchronization with source video.

Installation of Live Portrait with updated nodes for frame rate and audio.

Requirement of extra models for Live Portrait and their minimal size.

Instructions on how to integrate models into the Live Portrait workflow.

Update on the Live Portrait workflow with new features for audio synchronization.

Use of video from Comfy UI Helper Suite for better video processing.

Explanation of how to use the video loader for frame rate and audio matching.

Capability to use various sources for talking head videos including text to speech.

Introduction of T-pose using face models and controlnet for 2D puppet creation.

Solution to the problem of mismatched resolution between heads and bodies.

Demonstration of Live Portrait's speed and efficiency in processing video.

Discussion on the potential for characterizing and emotional traits in animations.

Mention of future plans for deep dive part three focusing on Comfy UI enhancements.

Advice on overdubbing to reduce generation costs and improve lip sync accuracy.

Availability of the workflow through a provided link for interested users.

Acknowledgment of node developers and thanks to the viewers for their engagement.