OpenAI showcases ChatGPT’s new voice and image processing features
OpenAI has introduced new voice and image processing features for ChatGPT, enhancing its multimodal capabilities. Users can now interact with the app through voice while sharing images and screens in real time. The updates aim to create more natural and conversational interactions, with broader access planned for the future.
- ▪OpenAI demonstrated ChatGPT's ability to fill out paperwork using voice conversations and image uploads.
- ▪The multimodal capabilities were first announced on September 25, 2023, targeting Plus and Enterprise users.
- ▪Voice interactions are available on iOS and Android, while image uploads are enabled across all platforms.
Opening excerpt (first ~120 words) tap to expand
OpenAI showcases ChatGPT’s new voice and image processing features The AI giant is pushing ChatGPT beyond text with multimodal capabilities that let users talk to the app while sharing images and screens in real time. Share Add us on Google by Editorial Team May. 23, 2026 window.sevioads = window.sevioads || []; var sevioads_preferences = []; sevioads_preferences[0] = {}; sevioads_preferences[0].zone = "01f21ccf-2092-46b1-9ac7-8c44cc782e0f"; sevioads_preferences[0].adType = "native"; sevioads_preferences[0].inventoryId = "c5700508-581b-472c-8fdd-a931cdbfc8e1"; sevioads_preferences[0].accountId = "1e47efc1-ec2d-4fca-a8b9-354e249e5095"; sevioads.push(sevioads_preferences); OpenAI has been demonstrating ChatGPT’s ability to fill out paperwork using a combination of voice conversations and…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Crypto Briefing.