OpenAI, the company renowned for its widely-used chatbot, ChatGPT, has introduced a significant enhancement by integrating voice capabilities alongside its existing image functionality. This advancement now empowers users to transcend text-based interactions and engage in voice-based conversations with this AI tool.
Over the course of the next two weeks, OpenAI plans to gradually roll out these new features to its Plus and Enterprise users. Moreover, OpenAI has ensured cross-platform accessibility by extending this capability to both iOS and Android devices, with users being able to opt-in via their application settings.
In an official announcement made on Monday, OpenAI expressed their enthusiasm for these features, stating:
“We are pleased to announce the introduction of innovative voice and image capabilities to ChatGPT. These functionalities offer a more intuitive and dynamic interface, enabling users to engage in voice conversations with ChatGPT or visually convey their ideas to the AI.”
The integration of voice and image functionalities greatly expands the versatility of ChatGPT, enabling users to utilize it in various aspects of their lives. For instance, one can capture a snapshot of a landmark while traveling and initiate a real-time discussion about its significance.
Similarly, users can photograph the contents of their refrigerator and pantry to facilitate dinner planning, even seeking step-by-step recipe assistance through follow-up inquiries. After a meal, parents can assist their children with math problems by capturing an image, highlighting the problem, and receiving hints from ChatGPT.
OpenAI ChatGPT Also Receives Fresh Enhancements
In a parallel development, Google has also made recent enhancements to its AI tool, Bard, just one week prior to OpenAI’s announcement. Google’s updates to Bard involve an expansion of its capabilities, incorporating real-time information from various Google apps and services that individuals rely on daily. Google detailed these enhancements, explaining:
“Bard is now equipped to access and assist with real-time data from Maps, YouTube, Hotels, and Flights. Users can effortlessly amalgamate information from diverse sources, transforming ideas into reality more efficiently. These extensions are activated by default, yet users maintain the flexibility to deactivate them at their discretion.
This update empowers users to collaborate not only with the wealth of information available globally but also with their personal data, all within a unified platform with Bard as their creative companion. Users can grant Bard permission to interact with their Gmail, Docs, and Drive, allowing them to locate, summarize, and address queries related to their personal content.
It’s essential to note that user data from Google Workspace is not employed to train Bard’s public model, and users can disable this feature at any time.”
Google emphasized that Bard’s newfound ability to interface with other apps and services represents the initial step toward a transformative capability for the AI tool.
These advancements in AI capabilities come at a time when global concerns are mounting regarding the potential risks associated with the unchecked growth of AI and its potential impact on humanity.
Follow techkudi.com for more juicy content