ChatGPT Enhances User Experience: Voice Response and Photo Analysis Now Available

ChatGPT Enhances User Experience: Voice Response and Photo Analysis Now Available

OpenAI announced in a blog post on Monday that their artificial intelligence chatbot, ChatGPT, is gaining new capabilities in voice and picture recognition. Users will now be able to ask questions out loud and have ChatGPT analyze photos.

With the updated version of the ChatGPT app, users can directly speak to the chatbot and receive responses in an AI-generated voice. For instance, users can ask ChatGPT to narrate a bedtime story, and it will automatically generate a narrative and read it aloud.

In terms of image analysis, ChatGPT can now recognize objects in photos, allowing users to ask questions about them. OpenAI provides examples such as taking a photo of a pantry and asking ChatGPT to analyze the available ingredients and suggest corresponding recipes. Similarly, users can take a photo of their bike and ask ChatGPT for instructions on how to lower the seat. The chatbot may then request a picture of the user’s toolbox and a relevant manual to recommend the necessary tools and instructions.

The voice recognition and picture analysis features will be rolled out to ChatGPT Plus and Enterprise users within the next two weeks.

These enhancements come as ChatGPT’s usage has declined since the beginning of this year. Although it initially gained immense popularity, becoming the fastest-growing app in history with an estimated 100 million active monthly users, it experienced a decrease in traffic by July. This decline could be attributed to either a waning novelty factor or increased competition from Google’s Bard and Bing’s AI chat.

Despite this, ChatGPT has had a significant impact on the tech industry. Many companies have integrated generative AI technology into their software, including Duolingo and fitness apps.

However, generative AI is not without flaws. Sometimes, AI-generated information can be misleading or incorrect, presented in a convincing and confident manner. This is known as a hallucination, a term that is part of the growing glossary needed to understand AI. Additionally, malicious actors can exploit AI-generated voices to deceive people. To mitigate this risk, OpenAI has limited ChatGPT to five voices.

To enable ChatGPT’s voice recognition feature, users can follow these steps:
1. Go to Settings.
2. Tap on “New Features.”
3. Opt in to Voice Conversations.
4. Tap the headphone button located at the top right corner of the home screen.
5. Choose from the available voices.

As an editor’s note, CNET is utilizing an AI engine to assist in creating some stories.