AI Shocks Again: KERA AI new updates, Apple AI Beats GPT-4 ? and New ChatGPT Features

TechFront AI
9 Apr 202406:16

TLDRThis week's tech news highlights include a new update for ChatGPT, which now allows direct editing of generated images. Crea AI introduces an image-to-image feature for customized picture creation. Stable Audio 2.0 offers enhanced audio generation for commercial use, with extended track lengths and improved user access. HEN presents realistic AI avatars that can talk and move, revolutionizing video creation. Lastly, Apple's Realm aims to improve Siri's language understanding and context grasp, hinting at significant AI advancements at the upcoming WWDC event.

Takeaways

  • 🖼️ Chat GPT now allows users to generate images using the DALL-E model and edit parts of the generated images directly.
  • 🎨 Crea AI's new feature, 'image to image', lets users upload multiple images and adjust their influence on the final output by changing their weights, creating a unique blend.
  • 🎶 Stable Audio's update introduces enhanced audio quality, commercial use of generated tracks, longer audio lengths, and an audio-to-audio feature for transforming sounds into polished tracks.
  • 👾 HEN's AI avatars can talk, walk, and move, offering a new level of realism and dynamism in AI interactions, with high-quality video creation available through their website.
  • 📱 Apple's Realm technology aims to improve voice assistants like Siri by better understanding context and complex references, potentially being featured in upcoming WWDC updates.
  • 🚀 Apple's commitment to AI advancements suggests a focus on enhancing everyday gadgets with improved AI capabilities.
  • 🌐 The news highlights the continuous development and competition in the AI field, with various companies introducing innovative features and improvements.
  • 🔍 The ability to edit generated images and blend multiple inputs into a new image showcases the growing versatility of AI in creative tasks.
  • 🎥 AI avatars that can mimic realistic human movements and interactions open up new possibilities for digital content creation and social media engagement.
  • 🎧 The commercial use of AI-generated music tracks can revolutionize the music production industry, offering creators new avenues for monetizing their work.
  • 📱 The potential for Siri to become more contextually aware and responsive hints at a future where AI voice assistants are even more integrated into daily life.

Q & A

  • What is the new feature introduced in the latest Chat GPT update?

    -The latest Chat GPT update introduces a feature that allows users to directly edit parts of a generated image without having to regenerate the entire image.

  • How does the image editing feature work in the Chat GPT plus version?

    -In the Chat GPT plus version, users can click on the image, use a select tool to resize it, and then brush over the area they want to edit. They can type in their ideas for how they want to edit it and see the result immediately.

  • What is Crea AI and what new feature has it introduced?

    -Crea AI is a tool that enables users to create pictures by describing what they want. The new feature it introduced is 'image to image', which allows users to upload multiple images and adjust their influence on the final output by changing their weights.

  • How does the 'image to image' feature work in Crea AI?

    -The 'image to image' feature in Crea AI lets users mix parts of multiple uploaded images to create a new image. By adjusting the weights of each picture, users can influence how much of each image is used in the final output.

  • What are the key features of the Stable Audio update?

    -The key features of the Stable Audio update include commercial use, audio length, ease of access, and an audio to audio capability. It enhances audio quality, allows for the creation of tracks up to 3 minutes long, and introduces a feature to transform recorded sounds into polished tracks.

  • How does the audio to audio feature in Stable Audio 2 work?

    -The audio to audio feature in Stable Audio 2 allows users to convert text to audio and transform recorded sounds into polished tracks, providing creators with the ability to craft rich audio experiences with ease and precision.

  • What is the main function of the HEN avatars?

    -HEN avatars are AI-generated virtual avatars that can talk, walk, and move around, bringing a new level of realism and dynamism to AI interactions. They can be used to create fun, high-quality videos by typing in a script and receiving an email with a video clip of the avatar in action.

  • How realistic is the movement of HEN avatars?

    -The movement of HEN avatars is highly realistic. If quickly scrolling through social media, one might not even notice that the avatars are made by AI, as they can mimic human movements and expressions very closely.

  • What is Apple's new AI language technology called 'Realm'?

    -Realm stands for Reference Resolution as Language Modeling. It is an AI language technology designed to improve voice assistants like Siri on phones by helping them better understand context and tricky references, thereby providing smarter and quicker responses to user queries.

  • What is the significance of Apple's Realm technology?

    -Realm technology signifies Apple's commitment to enhancing AI capabilities in their devices. It is designed to work smoothly on phones and suggests that Apple plans to continue improving Siri with better AI for future updates.

  • What event is hinted at in the script for potential AI improvements?

    -The script hints at Apple's Worldwide Developers Conference (WWDC) in June, where they might announce AI improvements, including a Siri with much better AI capabilities.

Outlines

00:00

🖼️ Image Editing with Chat GPT Plus

This paragraph discusses the latest update to the Chat GPT Plus version, which now enables users to generate images using the Dalle model. The update introduces a significant improvement where users can directly edit a specific part of a generated image without having to recreate the entire image. The process involves using a select tool to resize the image and a brush to modify the desired area. Users then input their ideas for the edit, and the result is displayed immediately. This feature greatly enhances the customization of generated images to suit users' needs.

05:01

🎨 Crea AI's Image to Image Feature

The second paragraph highlights the Crea AI tool, which allows users to create images by simply describing what they want. The latest update of Crea AI introduces an innovative feature called 'image to image,' enabling users to upload multiple images and adjust their influence on the final output by changing their weights. This feature lets users blend elements from different photos to create a new image. For example, by uploading three pictures and requesting an image of fish made out of porcelain, users can adjust how much of each picture is utilized, watching the new image evolve before their eyes. Crea AI's ability to blend images in this manner makes it an engaging tool for producing unique pictures.

🎵 Stable Audio: Enhancing Audio Experiences

The third paragraph focuses on Stable Audio, an AI-driven tool designed to transform the way we create and interact with sound. It excels in improving audio quality by filtering out noise and composing music based on specific inputs. This technology provides creators with the ability to craft rich audio experiences with ease and precision, applicable for podcasts, music production, or digital content creation. Key features include commercial use, as the tool now incorporates a licensed data set, making the generated tracks fully usable for commercial purposes. Users can create audio tracks up to 3 minutes long through an intuitive interface. Additionally, Stable Audio 2 introduces an audio-to-audio capability, converting text to audio and transforming recorded sounds into polished tracks. The tool is available for free with a Google login required to generate up to 20 tracks.

👾 AI Avatars by Hen

The fourth paragraph delves into an exciting advancement in AI avatars, highlighting a company named Hen. Hen allows users to create fun, high-quality videos using AI with virtual avatars that can not only talk but also walk and move around, adding a new level of realism and dynamism to AI interactions. Users can visit Hen's website, input the details they want the avatar to express, and provide their email address. Hen will then send an email with a video clip showcasing the user's personalized avatar in motion. This innovative technology represents a new era of video making with AI, where users can type their script in any language to get started. The avatar's realistic movement is so lifelike that it could easily blend in on social media platforms like Instagram.

📱 Apple's Realm Breakthrough in AI Language Tech

The final paragraph discusses a breakthrough in AI language technology by Apple, introducing a new model called Realm, short for Reference Resolution as Language Modeling. Realm is designed to enhance the performance of voice assistants like Siri on smartphones by improving their understanding of context and complex references, thereby providing smarter and quicker responses to user queries. Prior to Realm's introduction, there was speculation that Apple might adopt a different language technology, Gemini 1.5, for Siri. However, with Realm being developed by Apple and running smoothly on mobile devices, it appears that Apple plans to continue using Realm for future Siri updates. The paragraph also mentions Apple's big event, the Worldwide Developers Conference (WWDC), in June, where they might announce AI improvements, including a significantly enhanced Siri. This indicates that Apple is actively advancing in AI to enhance the everyday gadgets we use.

Mindmap

Keywords

💡chat GPT

chat GPT is an advanced language model developed by OpenAI, known for its ability to generate human-like text based on the prompts given to it. In the context of the video, it is mentioned as having received an update in its 'plus' version, which now allows for the generation of images using the DALL-E model. This signifies the model's evolving capabilities beyond text generation, indicating a significant leap in AI technology.

💡DALL-E

DALL-E is an AI model created by OpenAI, known for its ability to generate images from textual descriptions. In the video, it is highlighted that the latest update to chat GPT plus enables the direct editing of parts of generated images, which is a feature of DALL-E. This showcases the integration of DALL-E's image generation capabilities with chat GPT's language understanding, further enhancing the user experience by allowing for more precise control over visual content creation.

💡Crea AI

Crea AI is an AI-powered tool that allows users to create images by simply describing what they want. As mentioned in the video, the tool has been updated with an 'image to image' feature, which lets users upload multiple images and adjust their influence on the final output. This new functionality enables a unique form of creativity where users can blend elements from different images to create a new one, expanding the possibilities for artistic expression and design.

💡image to image

The 'image to image' feature is an innovative update to Crea AI that enables users to upload multiple images and adjust the impact each has on the final image. This means that by changing the 'weights' of the uploaded images, users can see the new image evolve in real-time, reflecting a blend of the different visual elements. This feature is showcased in the video as a way to create unique and personalized images, demonstrating the growing sophistication of AI in understanding and manipulating visual content.

💡stable audio

Stable audio is an AI-driven tool designed to enhance the quality of audio content by filtering out noise and composing music based on specific inputs. As highlighted in the video, the tool's latest update, stable audio 2, introduces significant improvements such as the ability to create commercial-use tracks, longer audio lengths, and an intuitive interface for ease of access. This tool is particularly beneficial for creators looking to produce rich audio experiences for podcasts, music production, or digital content creation with greater ease and precision.

💡audio to audio

The 'audio to audio' capability is a feature introduced in stable audio 2 that allows users to transform recorded sounds into polished tracks. This innovative functionality expands the tool's capabilities beyond just text-to-audio conversion, offering users a more comprehensive solution for audio content creation. In the context of the video, this feature is presented as a way to enhance and refine existing audio recordings, demonstrating the versatility and potential of AI in the realm of sound design and music production.

💡haen

Haen is a company featured in the video that specializes in creating AI avatars capable of talking and moving, adding a new level of realism and dynamism to AI interactions. The video describes how users can input details for an avatar to express and receive a video clip showcasing their personalized avatar in motion. This technology represents a significant advancement in AI, as it allows for the creation of lifelike virtual characters that can engage with users in a more interactive and visually appealing manner.

💡virtual avatars

Virtual avatars, as discussed in the video, are AI-generated characters that can not only speak but also move and interact in a lifelike manner. The company Haen has made significant strides in this area, allowing users to create high-quality videos with avatars that can express specific emotions or actions. These avatars are designed to be highly realistic, to the point where they could blend in with real social media content, such as Instagram posts. The development of such avatars signifies a new era in digital representation and interaction.

💡Apple AI

Apple AI refers to the artificial intelligence technologies and products developed by Apple Inc. In the video, it is mentioned that Apple has made a breakthrough with 'Realm', a language tech designed to improve the performance of voice assistants like Siri. Realm focuses on better understanding context and complex references, aiming to provide smarter and quicker responses to user queries. The video also speculates on Apple's potential AI advancements to be revealed at their Worldwide Developers Conference (WWDC), indicating Apple's commitment to integrating AI into everyday devices and improving user experience.

💡Realm

Realm, short for Reference Resolution as Language Modeling, is a novel AI language technology developed by Apple. It is designed to enhance the capabilities of voice assistants, such as Siri, by improving their understanding of context and complex references. The goal of Realm is to enable voice assistants to provide more accurate and rapid responses to user questions. In the video, it is suggested that Realm's introduction indicates Apple's intention to continue refining Siri and other AI-driven features, potentially leading to significant improvements in future updates.

💡WWDC

WWDC, or the Worldwide Developers Conference, is an annual event hosted by Apple Inc. where the company typically announces new software and technologies. As mentioned in the video, there is anticipation for AI-related improvements to be unveiled at the upcoming WWDC, including potential enhancements to Siri's AI capabilities. This event is significant as it often serves as a platform for Apple to showcase its latest innovations and advancements in technology, hinting at the future direction of their products and services.

Highlights

Chat GPT has introduced a new update that allows users to generate images with the Dolly model.

The new Chat GPT Plus version lets you directly edit parts of a generated image without recreating the whole image.

Crea AI, a tool for creating images by describing them, has been updated with an 'image to image' feature that lets users blend elements from multiple photos into one.

Stable Audio is an AI-driven tool that enhances audio quality and composes music based on specific inputs.

Stable Audio 2 now includes a licensed dataset, allowing for commercial use of the generated tracks.

Users can create audio tracks up to 3 minutes long with Stable Audio 2's intuitive interface.

Stable Audio 2 introduces an 'audio to audio' capability, transforming recorded sounds into polished tracks.

HAEN has introduced virtual avatars that can talk, walk, and move around, bringing a new level of realism to AI interactions.

HAEN's avatars can be customized with specific details and generate high-quality, realistic video clips.

Apple has introduced Realm, a new AI language tech aimed at improving voice assistants like Siri.

Realm focuses on better understanding context and references to provide smarter and quicker responses.

Apple's upcoming WWDC event may bring news of Siri updates and other AI improvements.

The Realm technology is designed to work smoothly on mobile phones.

Apple's development in AI suggests a push to enhance everyday gadgets with improved technology.

The video provides a demo of HAEN's avatar in motion, showcasing the realistic movement of the AI-generated avatar.

HAEN's avatar technology has not been matched by other companies, offering a unique and innovative service.

The new features in AI avatars and language tech aim to create more engaging and interactive experiences for users.