AI News

Meet GOAT-7B-Community Model: An AI Model Fine-Tuned LLaMA-2 7B Model on Dataset Collected from GoatChat App

July 28, 2023

403

Recently, scientists at the AI Research Lab unveiled the GOAT-7B-Community model, which refines the LLaMA-2 7B model using data from the GoatChat app. Meta’s LLaMA v2 7B was fine-tuned to become the state-of-the-art GOAT-7B-Community model by utilizing the novel, fine-grained dataset obtained from the application GoatChat.

‘Alignment’ is crucial in creating large language models (LLMs). It’s the idea that a model can decline to answer questions it considers unethical or illegal based on its education and experience. Alignment is essential for ethical AI implementation but poses new obstacles for model optimization.

Researchers have noticed that alignment-generated responses rarely provide the precise details the customers require. These reactions are typically more subdued and indicative of a reluctance to elaborate. Taking care of this is essential if one is going to build a reliable model that provides insightful and complete responses to questions. They have found that the alignment filter eliminates not all improper suggestions. Because of this, alignment often results in discarding a large dataset. This amounts to around a third of the total information in the case.

In light of this problem, researchers have developed a new technique for cleaning datasets. In addition, they ran a regulated experiment to thoroughly comprehend the effect of aligned replies on the model’s performance.

How Scientists Are Taught

An eight-A100 NVIDIA GPU-equipped high-performance node provided the backbone of the deep learning computations. The researchers chose the bfloat16 floating-point format and the DeepSpeed ZeRO-3 optimization as the basis for the training procedure. They put the models through three iterations, saving their progress every other epoch. Empirical evidence, however, showed that after a single epoch of execution, the quality began to degrade. This led them to rethink their strategy and settle on a single training epoch with a halfway point check. Common criteria for evaluating language models, such as MMLU and BigBench Hard, are used to assess the GOAT-7B-Community model. The team is still analyzing all the models and will release its findings soon.

Uses

Research on big language models and chatbots is GOAT-7B-Community’s primary focus. Natural language processing, machine learning, and artificial intelligence scholars and enthusiasts will find it especially useful.

Limitations

Despite its impressive reasoning abilities, the model suffers from the issues associated with its relatively tiny size (7B models are considered a “small” LLM). Hallucinations are the most noticeable kind. These ‘hallucinations’ are an ongoing obstacle to solving as LLMs are improved and expanded.

Hallucinations are a persistent problem highly emphasized in artificial intelligence studies. The ultimate objective is to develop models capable of producing logical, grammatically sound answers and true to the facts presented.

Risk and Biases

The GOAT-7B-Community model is unreliable since it may return results that are at odds with reality. The model was educated using both public and proprietary data. So, the GOAT-7B-Community model can produce inaccurate, biased, or even objectionable results.

Principal Observations

There are few better free 7B models than this one.
The key to good MMLU results is a diverse and high-quality data set.
When compared to current 13B models, the 7B performs admirably.
However, size constraints still apply.

Way Forward

Researchers have several exciting projects in the pipeline that will take the AI research to new heights. They are crafting a scientific paper that delves into the fresh findings on how different dataset processing and collection methods can substantially enhance a model’s reasoning abilities. They have discovered that how to curate and process the data substantially impacts the success of supervised instruction fine-tuning. The insights they have gleaned could be pivotal in advancing the field of AI, and researchers are eager to share them with the broader community. They are also setting their sights on even more ambitious goals in deep learning. Researchers are already developing larger LLaMA v2 models, specifically the 13B and 70B variants. These grand-scale models will allow us to experiment further and push the boundaries of what’s currently possible in AI modeling.

The journey into deep learning research and model training is just beginning. Researchers are fully committed to researching all the critical challenges around LLMs and the AI Twin technologies, aiming to unlock the extraordinary potential of reinforcement learning from human feedback (RLHF).

Check out the Blog and Demo. All Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to join our 26k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

Dhanshree Shenwai is a Computer Science Engineer and has a good experience in FinTech companies covering Financial, Cards & Payments and Banking domain with keen interest in applications of AI. She is enthusiastic about exploring new technologies and advancements in today’s evolving world making everyone’s life easy.

🔥 Gain a competitive
edge with data: Actionable market intelligence for global brands, retailers, analysts, and investors. (Sponsored)

Credit: Source link

Meet GOAT-7B-Community Model: An AI Model Fine-Tuned LLaMA-2 7B Model on Dataset Collected from GoatChat App

Recommended

Apple and Duke Researchers Present a Reinforcement Learning Approach That Enables...

Ethereum Price Forms Bullish Pennant; Can ETH Hit $4000 by June?

Sui (SUI) Whales Spark Sell-Off, While Ruvi AI (RUVI) Attracts Growing...

Bitcoin Short-Term Holders Are Distributing Again — Local Price Top Imminent?

BitMEX to Launch MOONPIGUSDT Perpetual Swap with 50x Leverage

EDITOR PICKS

Who’s Buying Access to Trump? Jamie Raskin Demands Guest List of...

Bitcoin MVRV Ratio Signals Price Hot, But Not In Danger Zone...

Krypto Reserve: Laut David Sacks könnten nun doch aktive Zukäufe möglich...

POPULAR POSTS

Sorare 2023-24: New Gameplay Formats & Experiences

What Does it Mean to Deploy a Machine Learning Model?

Ruliad AI Releases DeepThought-8B: A New Small Language Model Built on...

POPULAR CATEGORY