Called "GPT 4o." (The "o" stands for "omni"), this new flagship model was designed, the company said, to "reason" across audio, vision, and text in real time. OpenAI also announced the release of the desktop version of ChatGPT, and a refreshed UI designed to make it simpler to use and more natural. The new iteration was designed to accept as input any combination of text, audio, and image, and to generate any combination of text, audio, and image outputs. In a blog post, the company said it can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, "which is similar to human response time in a conversation."