This week Trevor Lohrbeer talks with ChatGPT through the Hume AI Empathic Voice Interface (EVI) to discuss the advancements introduced with OpenAI's new GPT-4o model. They discuss including the model's impressive responsiveness, multimodal capabilities, and its potential to revolutionize AI interactions by understanding text, audio, image, and video in real time.
They also explore how GPT-4o might be a stepping stone to training GPT-5 with native audio and video and the possibility of incorporating additional sensory data from robots. they emphasize how these innovations could transform human-AI interactions, making them more natural and intuitive on a level we haven’t seen before, while introducing new potential risks.
Create your
podcast in
minutes
It is Free