AI Shocks Again: KERA AI new updates, Apple AI Beats GPT-4 ? and New ChatGPT Features
TLDRThis week's tech news highlights include a new update for ChatGPT, which now allows direct editing of generated images. Crea AI introduces an image-to-image feature for customized picture creation. Stable Audio 2.0 offers enhanced audio generation for commercial use, with extended track lengths and improved user access. HEN presents realistic AI avatars that can talk and move, revolutionizing video creation. Lastly, Apple's Realm aims to improve Siri's language understanding and context grasp, hinting at significant AI advancements at the upcoming WWDC event.
Takeaways
- 🖼️ Chat GPT now allows users to generate images using the DALL-E model and edit parts of the generated images directly.
- 🎨 Crea AI's new feature, 'image to image', lets users upload multiple images and adjust their influence on the final output by changing their weights, creating a unique blend.
- 🎶 Stable Audio's update introduces enhanced audio quality, commercial use of generated tracks, longer audio lengths, and an audio-to-audio feature for transforming sounds into polished tracks.
- 👾 HEN's AI avatars can talk, walk, and move, offering a new level of realism and dynamism in AI interactions, with high-quality video creation available through their website.
- 📱 Apple's Realm technology aims to improve voice assistants like Siri by better understanding context and complex references, potentially being featured in upcoming WWDC updates.
- 🚀 Apple's commitment to AI advancements suggests a focus on enhancing everyday gadgets with improved AI capabilities.
- 🌐 The news highlights the continuous development and competition in the AI field, with various companies introducing innovative features and improvements.
- 🔍 The ability to edit generated images and blend multiple inputs into a new image showcases the growing versatility of AI in creative tasks.
- 🎥 AI avatars that can mimic realistic human movements and interactions open up new possibilities for digital content creation and social media engagement.
- 🎧 The commercial use of AI-generated music tracks can revolutionize the music production industry, offering creators new avenues for monetizing their work.
- 📱 The potential for Siri to become more contextually aware and responsive hints at a future where AI voice assistants are even more integrated into daily life.
Q & A
What is the new feature introduced in the latest Chat GPT update?
-The latest Chat GPT update introduces a feature that allows users to directly edit parts of a generated image without having to regenerate the entire image.
How does the image editing feature work in the Chat GPT plus version?
-In the Chat GPT plus version, users can click on the image, use a select tool to resize it, and then brush over the area they want to edit. They can type in their ideas for how they want to edit it and see the result immediately.
What is Crea AI and what new feature has it introduced?
-Crea AI is a tool that enables users to create pictures by describing what they want. The new feature it introduced is 'image to image', which allows users to upload multiple images and adjust their influence on the final output by changing their weights.
How does the 'image to image' feature work in Crea AI?
-The 'image to image' feature in Crea AI lets users mix parts of multiple uploaded images to create a new image. By adjusting the weights of each picture, users can influence how much of each image is used in the final output.
What are the key features of the Stable Audio update?
-The key features of the Stable Audio update include commercial use, audio length, ease of access, and an audio to audio capability. It enhances audio quality, allows for the creation of tracks up to 3 minutes long, and introduces a feature to transform recorded sounds into polished tracks.
How does the audio to audio feature in Stable Audio 2 work?
-The audio to audio feature in Stable Audio 2 allows users to convert text to audio and transform recorded sounds into polished tracks, providing creators with the ability to craft rich audio experiences with ease and precision.
What is the main function of the HEN avatars?
-HEN avatars are AI-generated virtual avatars that can talk, walk, and move around, bringing a new level of realism and dynamism to AI interactions. They can be used to create fun, high-quality videos by typing in a script and receiving an email with a video clip of the avatar in action.
How realistic is the movement of HEN avatars?
-The movement of HEN avatars is highly realistic. If quickly scrolling through social media, one might not even notice that the avatars are made by AI, as they can mimic human movements and expressions very closely.
What is Apple's new AI language technology called 'Realm'?
-Realm stands for Reference Resolution as Language Modeling. It is an AI language technology designed to improve voice assistants like Siri on phones by helping them better understand context and tricky references, thereby providing smarter and quicker responses to user queries.
What is the significance of Apple's Realm technology?
-Realm technology signifies Apple's commitment to enhancing AI capabilities in their devices. It is designed to work smoothly on phones and suggests that Apple plans to continue improving Siri with better AI for future updates.
What event is hinted at in the script for potential AI improvements?
-The script hints at Apple's Worldwide Developers Conference (WWDC) in June, where they might announce AI improvements, including a Siri with much better AI capabilities.
Outlines
🖼️ Image Editing with Chat GPT Plus
This paragraph discusses the latest update to the Chat GPT Plus version, which now enables users to generate images using the Dalle model. The update introduces a significant improvement where users can directly edit a specific part of a generated image without having to recreate the entire image. The process involves using a select tool to resize the image and a brush to modify the desired area. Users then input their ideas for the edit, and the result is displayed immediately. This feature greatly enhances the customization of generated images to suit users' needs.
🎨 Crea AI's Image to Image Feature
The second paragraph highlights the Crea AI tool, which allows users to create images by simply describing what they want. The latest update of Crea AI introduces an innovative feature called 'image to image,' enabling users to upload multiple images and adjust their influence on the final output by changing their weights. This feature lets users blend elements from different photos to create a new image. For example, by uploading three pictures and requesting an image of fish made out of porcelain, users can adjust how much of each picture is utilized, watching the new image evolve before their eyes. Crea AI's ability to blend images in this manner makes it an engaging tool for producing unique pictures.
🎵 Stable Audio: Enhancing Audio Experiences
The third paragraph focuses on Stable Audio, an AI-driven tool designed to transform the way we create and interact with sound. It excels in improving audio quality by filtering out noise and composing music based on specific inputs. This technology provides creators with the ability to craft rich audio experiences with ease and precision, applicable for podcasts, music production, or digital content creation. Key features include commercial use, as the tool now incorporates a licensed data set, making the generated tracks fully usable for commercial purposes. Users can create audio tracks up to 3 minutes long through an intuitive interface. Additionally, Stable Audio 2 introduces an audio-to-audio capability, converting text to audio and transforming recorded sounds into polished tracks. The tool is available for free with a Google login required to generate up to 20 tracks.
👾 AI Avatars by Hen
The fourth paragraph delves into an exciting advancement in AI avatars, highlighting a company named Hen. Hen allows users to create fun, high-quality videos using AI with virtual avatars that can not only talk but also walk and move around, adding a new level of realism and dynamism to AI interactions. Users can visit Hen's website, input the details they want the avatar to express, and provide their email address. Hen will then send an email with a video clip showcasing the user's personalized avatar in motion. This innovative technology represents a new era of video making with AI, where users can type their script in any language to get started. The avatar's realistic movement is so lifelike that it could easily blend in on social media platforms like Instagram.
📱 Apple's Realm Breakthrough in AI Language Tech
The final paragraph discusses a breakthrough in AI language technology by Apple, introducing a new model called Realm, short for Reference Resolution as Language Modeling. Realm is designed to enhance the performance of voice assistants like Siri on smartphones by improving their understanding of context and complex references, thereby providing smarter and quicker responses to user queries. Prior to Realm's introduction, there was speculation that Apple might adopt a different language technology, Gemini 1.5, for Siri. However, with Realm being developed by Apple and running smoothly on mobile devices, it appears that Apple plans to continue using Realm for future Siri updates. The paragraph also mentions Apple's big event, the Worldwide Developers Conference (WWDC), in June, where they might announce AI improvements, including a significantly enhanced Siri. This indicates that Apple is actively advancing in AI to enhance the everyday gadgets we use.
Mindmap
Keywords
💡chat GPT
💡DALL-E
💡Crea AI
💡image to image
💡stable audio
💡audio to audio
💡haen
💡virtual avatars
💡Apple AI
💡Realm
💡WWDC
Highlights
Chat GPT has introduced a new update that allows users to generate images with the Dolly model.
The new Chat GPT Plus version lets you directly edit parts of a generated image without recreating the whole image.
Crea AI, a tool for creating images by describing them, has been updated with an 'image to image' feature that lets users blend elements from multiple photos into one.
Stable Audio is an AI-driven tool that enhances audio quality and composes music based on specific inputs.
Stable Audio 2 now includes a licensed dataset, allowing for commercial use of the generated tracks.
Users can create audio tracks up to 3 minutes long with Stable Audio 2's intuitive interface.
Stable Audio 2 introduces an 'audio to audio' capability, transforming recorded sounds into polished tracks.
HAEN has introduced virtual avatars that can talk, walk, and move around, bringing a new level of realism to AI interactions.
HAEN's avatars can be customized with specific details and generate high-quality, realistic video clips.
Apple has introduced Realm, a new AI language tech aimed at improving voice assistants like Siri.
Realm focuses on better understanding context and references to provide smarter and quicker responses.
Apple's upcoming WWDC event may bring news of Siri updates and other AI improvements.
The Realm technology is designed to work smoothly on mobile phones.
Apple's development in AI suggests a push to enhance everyday gadgets with improved technology.
The video provides a demo of HAEN's avatar in motion, showcasing the realistic movement of the AI-generated avatar.
HAEN's avatar technology has not been matched by other companies, offering a unique and innovative service.
The new features in AI avatars and language tech aim to create more engaging and interactive experiences for users.