Everyone's favorite chatbot can India Archivesnow see and hear and speak. On Monday, OpenAI announced new multimodal capabilities for ChatGPT. Users can now have voice conversations or share images with ChatGPT in real-time.
Audio and multimodal features have become the next phase in fierce generative AI competition. Meta recently launched AudioCraft for generating music with AI and Google Bard and Microsoft Bing have both deployed multimodal features for their chat experiences. Just last week, Amazon previewed a revamped version of Alexa that will be powered by its own LLM (large language model), and even Apple is experimenting with AI generated voice, with Personal Voice.
SEE ALSO: OpenAI expands ChatGPT 'custom instructions' to free usersVoice capabilities will be available on iOS and Android. Like Alexa or Siri, you can tap to speak to ChatGPT and it will speak back to you in one of five preferred voice options. Unlike, current voice assistants out there, ChatGPT is powered by more advanced LLMs, so what you'll hear is the same type of conversational and creative response that OpenAI's GPT-4 and GPT-3.5 is capable of creating with text. The example that OpenAI shared in the announcement is generating a bedtime story from a voice prompt. So, exhausted parents at the end of a long day can outsource their creativity to ChatGPT.
This Tweet is currently unavailable. It might be loading or has been removed.
Multimodal recognition is something that's been forecasted for a while, and is now launching in a user-friendly fashion for ChatGPT. When GPT-4 was released last March, OpenAI showcased its ability to understand and interpret images and handwritten text. Now it will be a part of everyday ChatGPT use. Users can upload an image of something and ask ChatGPT about it — identifying a cloud, or making a meal plan based on a photo of the contents of your fridge. Multimodal will be available on all platforms.
As with any generative AI advancement, there are serious ethics and privacy issues to consider. To mitigate risks of audio deepfakes, OpenAI says it is only using its audio recognition technology for the specific "voice chat" use case. Also, it was created with voice actors they have "directly worked with." That said, the announcement doesn't mention whether users' voices can be used to train the model, when you opt in to voice chat. For ChatGPT's multimodal capabilities, OpenAI says it has "taken technical measures to significantly limit ChatGPT’s ability to analyze and make direct statements about people since ChatGPT is not always accurate and these systems should respect individuals’ privacy." But the real test of nefarious uses won't be known until it's released into the wild.
Voice chat and images will roll out to ChatGPT Plus and Enterprise users in the next two weeks, and to all users "soon after."
Topics Artificial Intelligence ChatGPT
San Juan mayor fires back at Trump official's claim about Puerto RicoOscars 2024: Ryan Gosling and Emily Blunt bring Barbenheimer beef to the Academy AwardsElmo visited Southeast Asia for the first time and met a particularly curious pythonSony BOGO deal: Get a free TV with select purchases'Manipulated' photo of Kate Middleton pulled by media agencies. Why?6 White Day gifts to make up for your botched Valentine's DayIs 'Dream Scenario' streaming? How to watch the Nicolas Cage A24 filmHow to watch 'Invincible' Season 2, Part 2: streaming date, free trials, and moreAirbnb banned indoor security cameras. Here's why.This tiny, shapeScientists warned of an impending disaster in Puerto Rico 5 days aheadChristopher Nolan wins his first Best Director Oscar for 'Oppenheimer'Save 50% or more at REI through March 11Oscars 2024: Complete list of winners6 White Day gifts to make up for your botched Valentine's DayElon Musk wants SpaceX to fly you anywhere on Earth in under an hourHow to turn read receipts on or off on Instagram9 intriguing UFO claims the Pentagon just refuted as bogusFEMA removes data on water availability in Puerto Rico from websiteThe AirPods Pro are back down to a record low price Tripadvisor in Australia: Everything you need to know Frank Ocean's Coachella livestream was cancelled, but the internet finds a way How doctors' receptionists really feel about their notorious TikTok reputation What Earth was like last time CO2 levels were as high as today Contact by Adam Gilders The unexpected joy of not knowing when your package will be delivered Google's AI search engine will 'anticipate users' needs' Mark Zuckerberg 'expressed concerns' in Trump phone call, so that should fix everything The Speed of Motion by Harold Edgerton Apple might announce new MacBooks at WWDC Spotify goes down worldwide 'Succession' Season 4, episode 4: What does Shiv's pregnancy actually mean? Memo to 'The Mandalorian': This is the way (to fix the show) 'Succession' Season 4, episode 4: What's the deal with Logan's paper? Emily Fragos on Emily Dickinson’s Letters by David O'Neill Theme park food videos are perfect for a stay Chrissy Teigen promises big bail fund donation after Trump's 'MAGA Night' tweet A Week in Culture: Amélie Nothomb, Writer by Amélie Nothomb Francisco Goldman on ‘Say Her Name’ by Lila Byock New to RVing? Here's what you need to know.
2.2698s , 8222.1953125 kb
Copyright © 2025 Powered by 【India Archives】,Prosperous Times Information Network