OpenAI announces GPT-4, says it beats 90% of people on SAT

Sam Altman, CEO of OpenAI, will be walking from lunch at the Allen & Company Sun Valley Conference on July 6, 2022 in Sun Valley, Idaho.

Kevin Dietsch | Getty Images News | Getty Images

OpenAI announced the latest version of its primary major language model, GPT-4, on Tuesday, which it says demonstrates “human-level performance” in many professional tests.

ChatGPT-4 is “bigger” than previous versions, meaning it’s trained on more data and has more weights in the model file, which also makes it more expensive to run.

Currently, many researchers in the field believe that many of the recent advances in AI stem from running increasingly large models on thousands of supercomputers in training processes that can cost tens of millions of dollars. GPT-4 is an example of an approach in which ‘scaling up’ is central to achieving better results.

OpenAI said it used Microsoft Azure to train the model; Microsoft has invested billions in the startup. OpenAI has not published details about the specific model size or the hardware it used to train it, which could be used to recreate the model, citing “the competitive landscape”.

OpenAI GPT’s large language model powers many of the artificial intelligence demos that have impressed people in the tech industry over the past six months, including Bing’s AI chat and ChatGPT. consumer products such as chatbots in the coming weeks. Bing’s AI chatbot uses GPT-4, Microsoft said Tuesday.

OpenAI says the new model will produce fewer factually incorrect answers, go off the rails less, talk about forbidden topics less often, and even outperform humans on many standardized tests.

GPT-4 performed at the 90th percentile on a simulated bar exam, the 93rd percentile on a SAT reading exam and the 89th percentile on the SAT Math exam, OpenAI claimed.

However, OpenAI warns that the new software is not yet perfect and is less capable than humans in many scenarios. It still has a major problem with “hallucination,” or making things up, and is factually unreliable, the company said. It still tends to insist that it is right when it is wrong.

“GPT-4 still has many known limitations that we are trying to address, such as social biases, hallucinations, and conflicting clues,” the company said in a blog post.

“In casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference emerges when the complexity of the task reaches a sufficient threshold: GPT-4 is more reliable, more creative, and can handle much more nuanced instructions than GPT-3.5,” OpenAI wrote in a blog post.

The new model will be available to paying ChatGPT subscribers and will also be available as part of an API that allows programmers to integrate the AI ‚Äč‚Äčinto their apps. OpenAI charges about 3 cents for about 750 words of prompts and 6 cents for about 750 words of answers.

Leave a Comment