EXPLORE the NEW AI Voices: VoiceMod App Unveiled!

Bob Doyle Media
18 Jul 202330:13

TLDRThe VoiceMod app, once a novelty, has evolved with new AI voices and features. This video explores its capabilities, including voice cloning, soundboard improvements, and custom voice creation, showcasing the potential for voice enhancement in live streaming and other applications.

Takeaways

  • 😀 The VoiceMod app has made significant improvements and additions, making it more than just a toy.
  • 🎙️ Voice cloning and changing are fascinating aspects of AI technology, and VoiceMod is a major contender in this field.
  • 📱 The app allows users to convert their voice into different sounds, although earlier versions had watermarks.
  • 🔊 The soundboard feature has been improved for better organization and practical use in various scenarios.
  • 🎶 Users can adjust volume, loop, and mute sounds, and even bind them to key presses for ease of use.
  • 🔬 The Voice Lab offers a way to create custom voices from scratch by adjusting various parameters.
  • 🗣️ Text-to-speech functionality is available, allowing users to type in text and have it read back in different voices.
  • 🎭 The app includes a variety of AI realistic voices, some of which can be heavily modulated or have a UK accent.
  • 🎵 Background effects and music can be added to voices, with options to adjust volume and mix between dry and reverberated sounds.
  • 🌐 The app also features a store where users can purchase additional sounds and effects.

Q & A

  • What is the main focus of the video script discussing?

    -The main focus of the video script is discussing the VoiceMod App, its features, improvements, and the experience of using it for voice cloning and changing.

  • What was the initial impression of the VoiceMod App as mentioned in the script?

    -The initial impression of the VoiceMod App was that it was more of a toy with watermarks on the converted voices, and it didn't receive much attention for a while as other AI technologies were more fascinating.

  • What new features or improvements does the VoiceMod App have according to the script?

    -The VoiceMod App has made improvements and additions to its program, including a variety of new AI realistic voices, the ability to create custom voices, and enhancements to the soundboard for playing sound effects and music.

  • How does the VoiceMod App handle the user's own voice modulation?

    -The VoiceMod App allows users to modulate their voices with various effects, including pitch adjustment, adding reverb, and other sound enhancements to create different voice characters.

  • What is the 'Voice Box' section in the VoiceMod App?

    -The 'Voice Box' is the main section in the VoiceMod App where users can choose from a variety of voices, both old and new, to play with and modify.

  • What is the purpose of the 'Soundboard' feature in the VoiceMod App?

    -The 'Soundboard' feature allows users to play sound effects and music in the background, with improved organization and practicality for real use case scenarios.

  • What is the 'Voice Lab' and how can it be used?

    -The 'Voice Lab' is an area in the VoiceMod App where users can start from scratch and create their own custom voice by adjusting various parameters and saving it.

  • What are the limitations mentioned in the script regarding the AI voices in the VoiceMod App?

    -Some limitations mentioned include the occasional loss of consonant details, making speech sound a bit mushy, and the difficulty in using certain voices with heavy modulation or non-built-in dialects.

  • How does the script describe the 'Text to Speech' feature in the VoiceMod App?

    -The 'Text to Speech' feature allows users to type in text and have it read back by the chosen character voice, but the script suggests that the narrator is not particularly impressed with this feature.

  • What does the script suggest about the real-time performance of the VoiceMod App?

    -The script suggests that the VoiceMod App performs in near real-time, with only a slight echo or delay that can affect the user's experience, especially when trying to modulate their voice extensively.

Outlines

00:00

🎙️ Voice Cloning and Modulation Exploration

The narrator discusses their fascination with AI voice cloning and modulation, mentioning their experience with various tools on different devices. They highlight Voice Mod, an early contender in voice manipulation, which initially had watermarks but has since made significant improvements. The narrator intends to demonstrate the current capabilities of Voice Mod, focusing on new features and the ability to create custom voices, rather than installation or troubleshooting. They provide a brief tour of the app's interface, including the voice box for selecting voices, the soundboard for sound effects and music, and the voice lab for creating custom voices.

05:01

🔊 Soundboard and Voice Enhancement Features

This paragraph delves into the enhanced soundboard feature of Voice Mod, which has been reorganized into categories like space, Troopers, prankster, EDM, and more. The narrator appreciates the improved organization and practicality for real use cases. They describe the ability to adjust volume, loop, and mute sounds, and the option to include or exclude specific sounds when muting all others. The narrator also touches on the voice lab for creating custom voices and the text-to-speech feature, which they find less impressive, and briefly mentions the creator and store sections of the app.

10:01

🎭 Experimenting with AI Voice Effects

The narrator explores the AI realistic voices in Voice Mod, comparing them to older voice effects. They discuss the voice enhancer's ability to deepen or raise the voice and share their perspective as a voice-over artist on the realism and usability of these effects. The paragraph includes a demonstration of various voices, such as Magic Chords, which changes chords based on the narrator's speech, and Odin, which has a drum accompaniment. The narrator also tests the flexibility of new AI voices like Proto and their ability to adjust tone and add expressions or accents, noting the fun and potential of these tools in real-time without special video delay.

15:01

🗣️ Customizing and Testing AI Voice Characters

In this section, the narrator tests various AI voice characters, adjusting pitch and exploring the realism of each. They discuss the limitations of some voices, such as the loss of consonant details and the difficulty of using non-built-in dialects. The narrator also shares their experience with voices like Cameron, which they find grounded but lacking in crispness, and Emma, which they manipulate to sound like two different people. They highlight the challenges of maintaining enunciation due to the fractional delay in the voice effects and the overall fun of experimenting with these AI voices.

20:03

🎛️ Voice Lab Customization and Background Effects

The narrator describes the process of creating a custom voice in the Voice Lab, starting with selecting a persona and adjusting pitch. They experiment with different personas, noting the subtle differences between them and the challenge of maintaining intelligibility. The paragraph also covers the use of background effects and the ability to adjust volume and other parameters to create unique audio environments. The narrator shares their experience with various effects, such as the deep dream setting that alters the background music to sound scary, and the radio demon effect that changes the music to an evil tone when activated.

25:09

🎵 Voice Effects and Soundboard Demonstration

This paragraph showcases the narrator's experience with various voice effects and soundboard options in Voice Mod. They demonstrate the equalizer, pitch flanger, chorus, and tremolo effects, discussing their potential uses and the ease of adjusting volume and other settings. The narrator also explores the voice enhancer feature, which they find useful for live streamers to change the sound of their voice and add background music or ambient sounds. They conclude by encouraging viewers to share their thoughts and experiences with voice-changing solutions and end the video with a personal touch, mentioning their studio setup.

Mindmap

Keywords

💡Voice cloning

Voice cloning refers to the technology that replicates a person's voice, making it sound as if they are speaking even when they are not. In the video, the narrator discusses their fascination with voice cloning and how it has evolved, particularly mentioning the VoiceMod app's capabilities in this area.

💡Voice changing

Voice changing involves altering the pitch, tone, or other characteristics of a person's voice to create a different sound. The script mentions the narrator's experience with tools that change their voice, highlighting the VoiceMod app's ability to convert the user's voice into various effects.

💡VoiceMod app

The VoiceMod app is a software tool that allows users to modify their voice in real-time, often used for gaming, streaming, or other applications where voice alteration is desired. The video script describes the app's features, improvements, and the narrator's personal experience with it.

💡Watermarks

Watermarks are identifiable marks or patterns added to media to indicate its source or ownership. The narrator mentions their dislike for watermarks in the context of the VoiceMod app's earlier versions, which had these marks on the voice effects.

💡Voice enhancer

A voice enhancer is a feature that can modify the depth or resonance of a voice, often used to make it sound more authoritative or distinct. In the script, the narrator discusses the VoiceMod app's voice enhancer and its effects on their voice.

💡Soundboard

A soundboard is a device or software that allows users to play sound effects or music. The video script describes the VoiceMod app's soundboard feature, which has been improved for better organization and usability, enabling users to easily play and control various sound effects.

💡Text to speech

Text to speech (TTS) is a technology that converts written text into spoken words. The narrator briefly mentions the VoiceMod app's text to speech solution, where users can type in text and have it read aloud by the app's characters.

💡Voice lab

The voice lab in the VoiceMod app is a feature that allows users to create custom voices by adjusting various parameters. The script describes the narrator's exploration of this feature and how it enables the creation of unique voice effects.

💡AI realistic voices

AI realistic voices refer to synthesized voices created by artificial intelligence that sound human-like. The video discusses the VoiceMod app's new AI voices, which offer a range of realistic and flexible voice options for users to experiment with.

💡Background effects

Background effects in the context of voice modification are sounds or music that play in the background while the user's voice is being altered. The script mentions how these effects can be adjusted in the VoiceMod app to enhance the overall audio experience.

💡Pitch

Pitch in voice modification refers to the frequency of the voice, which determines how high or low it sounds. The narrator discusses the ability to adjust pitch in the VoiceMod app, allowing users to make their voice sound deeper or higher.

Highlights

VoiceMod App has unveiled new AI voices and improvements, offering a variety of voice cloning and changing options.

The app allows users to convert their voice with watermark-free options, unlike previous versions.

Voice cloning and machine learning algorithms enable the creation of realistic and customizable voice effects.

The VoiceMod App includes a Voice Box for selecting from a range of old and new voices.

Soundboard feature has been improved for better organization and practical use in various scenarios.

Users can adjust volume parameters and loop settings for sound effects, enhancing the user experience.

Voice Lab provides a platform to create custom voices from scratch using various parameters.

Text-to-Speech feature allows users to input text and hear it read back in different character voices.

The app offers a range of AI realistic voices with flexibility in tone adjustment.

Voice effects can be heavily modulated, affecting the natural speech patterns and expressions.

Some voice effects may not pick up on dialects or modulations when used with certain voices.

Real-time voice changing with minimal delay, allowing for immediate feedback on voice effects.

VoiceMod App includes background effects and the ability to adjust the mix of dry and reverberated sounds.

The app provides a selection of new AI voices with unique characteristics and adjustable parameters.

Users can experiment with voice effects to find the most suitable and enjoyable options for their needs.

VoiceMod App is a versatile tool for live streamers, offering voice enhancement and sound effects for an immersive experience.

The app's interface allows for easy adjustments and automation of sound effects using key presses or external devices.

VoiceMod App is continually updated with new features and improvements to meet user demands and expectations.