ChatGPT Can Now Analyze Live Videos: How It Works
For the end of the year, OpenAI is bringing exciting innovations to its users as part of its "12 Days of OpenAI" campaign. Among the most anticipated updates, a revolutionary feature has just been announced: the ability for ChatGPT to analyze live video streams from your smartphone. This functionality, currently available to premium subscribers, is set to roll out in Europe soon.
A New Era of Interaction with ChatGPT
Since May, when the advanced voice mode of GPT-4o was launched, OpenAI had hinted at future developments related to video integration. Now, it’s official. ChatGPT can not only hear and understand voice commands but also "see" and analyze visual input in real-time.
The process is simple: users activate the “advanced voice” option and then access a new video button. This feature allows them to switch on the front or rear camera of their device. From that moment on, users can film their surroundings while interacting with the AI.
During a demo, OpenAI showcased ChatGPT assisting a user in making coffee. The AI recognized the tools and ingredients available in the kitchen and provided step-by-step guidance based on what it "saw" on the screen. This interaction is similar to a video call with ChatGPT as an intelligent assistant.
How Does Screen Sharing Work?
The screen-sharing functionality allows users to film their surroundings while conversing naturally with ChatGPT. Here’s how it works:
- Activate Voice Mode: The user activates the advanced voice mode on ChatGPT.
- Launch Video Mode: A video button appears, letting users turn on their front or rear camera.
- Real-Time Interaction: Users can now show objects, environments, or situations to ChatGPT, which recognizes and analyzes them to provide appropriate responses.
This feature makes it possible for users to get real-time guidance in tasks such as cooking, home repair, or even product identification.
A Game-Changer for AI-User Interactions
With this new feature, OpenAI has opened the door to a new form of interaction between humans and AI. The days of relying solely on text prompts or voice commands are over. Now, users can physically show their context to ChatGPT, providing a much richer and more intuitive experience.
Imagine the potential applications:
- Home Assistance: Get help identifying objects, fixing appliances, or organizing spaces.
- Learning and Education: Receive interactive lessons where ChatGPT responds to what you’re showing.
- Product Reviews: Show products live to ChatGPT and ask for more details or comparisons.
This feature marks a significant shift from traditional chatbot interactions to something closer to a "visual assistant" model.
When Will It Be Available?
While the functionality is currently available to premium subscribers, OpenAI has confirmed that it will soon roll out to users in Europe. This move aligns with OpenAI’s broader strategy to provide cutting-edge AI tools to a global audience.
The ability to share your environment with ChatGPT takes human-AI interaction to the next level. From offering advice on daily tasks to serving as a visual guide, ChatGPT has become more than just a chatbot, it's now a visual assistant ready to help in real time.