Social Mixer 2024 Singapore
OpenAI unveils new GPT-4o AI model

OpenAI unveils new GPT-4o AI model

share on

OpenAI is releasing a new AI model called GPT-4o. The new model will be capable of realistic voice conversation and can interact across audio, vision and text in real time.

In a statement by OpenAI, GPT-4o is able to accept any combination of text, audio and image as input and generate any combination of text, audio, and image outputs. 

In addition, it can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time in a conversation, said OpenAI.

Don’t miss: Could OpenAI's text to video AI model Sora change the game for brand films? 

GPT-4o is also able to understand over 20 languages including different language families such as gujarati, telugu and marathi. 

The new AI model has safety built-in, through techniques such as filtering training data and refining the model’s behaviour through post-training. A new safety system was also created to provide guardrails on voice outputs. 

In the upcoming weeks and months, OpenAI will be working on the technical infrastructure, usability via post-training, and safety of the model. 

GPT-4o’s text and image capabilities have been rolled out on ChatGPT. It will be made available in the free tier and to Plus users. OpenAI aims to roll out a new version of Voice Mode with GPT-4o in alpha within ChatGPT Plus in the coming weeks. 

OpenAI made the announcement a day before Google’s developers’ conference where the tech giant is expected to show off Alphabet’s new AI-related features. 

According to Reuters, shares of Alphabet were down 0.4%, after falling nearing 3% earlier in the day. This is likely due to OpenAI’s unveiling of GPT-4o. 

Meanwhile, Microsoft shares were down 0.2%. 

Last year, Google introduced its AI model Gemini, a multimodal platform that can generalise and understand, operate across  and combine different types of information including text, code, audio, image and video. 

The system, which comprises Gemini Ultra, Gemini Pro and Gemini Nano, is also a flexible one that can run on everything from data centers to mobile devices. 

Join us on 12 June 2024 for an exciting experience as Content360 makes its debut in Malaysia! Brace yourself to join the crème de la crème of the content marketing industry hailing from across the region. Immerse yourself in a dynamic atmosphere, and uncover the latest trends with thought leaders and solution providers from the realm of content.

Related articles: 
Financial Times partners OpenAI to enhance AI platform
Sora for dummies: 101 on OpenAI's new text to video AI model
Google's Gemini for dummies: Why experts are divided on its potential success

share on

Follow us on our Telegram channel for the latest updates in the marketing and advertising scene.

Free newsletter

Get the daily lowdown on Asia's top marketing stories.

We break down the big and messy topics of the day so you're updated on the most important developments in Asia's marketing development – for free.

subscribe now open in new window