Jump to content

GPT-4o released with improved text, audio and vision capabilities

By Jubair Hossin Babu, 05/14/2024


gsmarena_000.jpg

GPT-4o (“o” for “omni”) is OpenAI’s latest multimodal large language model (LLM) and it brings major advancements in text, voice, and image content generation to offer more natural interaction between users and the assistant. OpenAI claims its new AI model can respond to audio inputs in as little as 232 milliseconds and it is significantly faster in text response in non-English prompts with support for over 50 languages. You can also interrupt the model with new questions or clarifications while it is talking. GPT-4o also features a more capable, human-sounding voice assistant that...

View the full article

  • 45 Views
  • 0 Comments



Comments

There are no comments to display.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
IP.Board News by DevFuse
×
×
  • Create New...

Write what you are looking for and press enter or click the search icon to begin your search