Artificial Intelligence has been rapidly advancing across various industries, with OpenAI, a San Francisco-based company, taking a leading role
The success of ChatGPT has sparked a competitive race among major tech companies to integrate generative AI into their products and services.
On September 25, OpenAI, under the leadership of Sam Altman, introduced voice and image capabilities for its widely acclaimed chatbot.
OpenAI stated, “The addition of voice and image capabilities expands the range of applications for ChatGPT.
At home, users can take pictures of their fridge and pantry to determine dinner options, and even request step-by-step recipes.
In July, Google introduced multi-modality in its chatbot, Google Bard, in an effort to remain competitive with OpenAI and other players in the field.
AI enthusiasts have already showcased the tool’s remarkable abilities, from identifying locations in images to offering interior design suggestions.
One noteworthy feature of ChatGPT Vision is its ability to interpret and explain complex diagrams, making it a valuable educational tool.
The vision-based models undergo rigorous testing, and OpenAI collaborates with organizations like ‘Be My Eyes’ to improve accessibility for visually impaired users.