Google researchers announced that they have created a new machine learning model featuring a self-modifying architecture.

Named ‘HOPE’, this model reportedly excels in managing long-context memory compared to current top-tier AI models.

This model is intended to act as a proof-of-concept for an innovative approach called ‘nested learning’, developed by Google researchers.

According to Google, the new “nested learning” idea may help overcome the shortcomings of the large language models (LLMs) of today, especially with regard to continuous learning, which is a crucial step in the development of artificial general intelligence (AGI) or intelligence comparable to that of humans..

Google is making waves with its new AI model ‘HOPE’, which marks a significant advancement in continual learning.

For years, Google’s researchers have been focused on solving the issue of catastrophic forgetting in LLMs.

What exactly is continual learning? And why is it so challenging?

LLMs that drive AI chatbots can currently write sonnets and generate code in seconds. However, they still lack the basic ability to learn from their experiences.

Unlike the human brain, which learns and improves continuously, today’s LLMs can’t acquire new knowledge or skills without losing what they already know. This limitation is known as ‘catastrophic forgetting’ (CF).

Researchers have been attempting to address CF for years by either improving optimization methods or altering the model’s architecture. But according to Google’s researchers, the training rules- that is, the optimization algorithm and the model’s architecture are essentially the same thing.

What’s nested learning all about?

One of the more complicated machine learning models is nested learning, which is made up of “a set of coherent, interconnected optimization problems that are either nested within one another or operating simultaneously”.

Each of these internal problems has its own context flow – a unique set of information it’s trying to learn from.

By leveraging these ideas, developers can create learning components in LLMs that have greater computational depth, as stated by Google.

They added, “The resulting models, like the Hope architecture, show that a careful approach to combining these components can produce learning algorithms that are more expressive, capable, and effective”.

Tests on a range of widely used language modeling and common-sense reasoning tasks revealed that the proof-of-concept model, HOPE, demonstrated higher accuracy and lower perplexity when compared to contemporary LLMs.

Subscribe My Channel





Discover more from Connect2ConnectOnline

Subscribe now to keep reading and get access to the full archive.

Continue reading