OpenAI’s GPT-5, the newest version of the AI tech that drove the ChatGPT phenomenon in 2022, is about to be released soon, and users will be keen to see if this upgrade from GPT-4 matches the advancements made by the research lab before.

The advancement of GPT-4 was driven by increased computing power and data, and the company was optimistic that a similar approach of “scaling up” would reliably enhance AI models.

However, OpenAI, which has Microsoft as a backer and is currently valued at $300 billion, faced challenges in scaling up. One of the issues was the data barrier the company encountered, and OpenAI’s former chief scientist Ilya Sutskever mentioned last year that even though processing power was on the rise, the volume of data was not keeping pace.

He pointed out that large language models rely on extensive datasets that pull information from the entire internet, and AI research labs lack alternative sources for substantial amounts of human generated text data.

Besides the data shortage, another issue is that large model ‘training runs’ are more prone to hardware failures due to the system’s complexity.

Researchers often won’t know how well the models perform until the run is complete, which can take several months.

OpenAI hasn’t announced a release date for GPT-5.

Almost three years back, ChatGPT brought generative AI into the spotlight, impressing users with its knack for crafting human like writing and poetry, and it quickly became one of the fastest growing apps ever.

In March 2023, OpenAI launched GPT-4, a large language model that made significant advancements in intelligence. While GPT-3.5, an earlier version, scored in the bottom 10% on a bar exam, GPT-4 managed to pass the simulated bar exam in the top 10%.

GPT-4 then set the standard, and everyone realized that AI models could actually outperform humans in various tasks.

Before long, other companies started to take notice. That same year, Alphabet’s Google and Anthropic, which is supported by Amazon and Google, rolled out their own competitive models against GPT-4. Within a year, open-source models comparable to GPT-4, like Meta Platforms’ Llama 3 models, were introduced.

In addition to training large models, OpenAI has now ventured into another area called “test-time compute,” which directs more processing power to tackle tough tasks like math or complex operations that require human like reasoning and decision making.

Earlier this year, the company’s CEO Sam Altman mentioned that GPT-5 would merge both test time compute and its large models. He also noted that OpenAI’s model and product offerings had become “complicated’.

Subscribe My Channel





Discover more from Connect2ConnectOnline

Subscribe now to keep reading and get access to the full archive.

Continue reading