Golden has been acquired by ComplyAdvantage.Read about it here ⟶

LLaMA

LLaMA (Large Language Model Meta AI) is a foundational large language model (LLM) released by Meta AI.

Overview Structured Data Issues Contributors Activity

All edits

Edits on 22 Feb, 2024

"update inverses"

Golden AI

edited on 22 Feb, 2024

Edits made to:

Infobox (+1 properties)

Infobox

Competitors

Gemma (Google)

Edits on 11 Oct, 2023

"update citations for inverse infoboxes"

Golden AI

edited on 11 Oct, 2023

Infobox

Competitors

Mistral 7B

Edits on 10 Oct, 2023

"update inverses"

Golden AI

edited on 10 Oct, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Competitors

Mistral 7B

Edits on 1 Sep, 2023

Jen English

edited on 1 Sep, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Product Parent Company

Meta AI

Edits on 30 Aug, 2023

Jen English

edited on 30 Aug, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Also Known As

LLaMA 2

Edits on 19 Jul, 2023

Amy Tomlinson Gayle

edited on 19 Jul, 2023

Edits made to:

Timeline (+9/-13 characters)

Article (+53/-38 characters)

Article

Meta first released LLaMA in February 2023. Unlike language models from Open AI/Microsoft and Google, which are conversational chatbots, LLaMA is not a system users can talk to; it is a tool to help researchers working in the field. Meta is releasing LLaMA under a noncommercial license focused on research use cases, with access granted to groups like universities, NGOs, and industry labs. Like other large language models, LLaMA takes a sequence of words as an input to predict the next word, recursively generating text. LLaMA was trained using text from twenty languages, focusing on those with Latin and Cyrillic alphabets. In the announcement of LLaMA, Meta stated:

modelsModels such as LLaMA enable others in the research community who don’t have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field.

...

On July 18, 2023, Meta and Microsoft introduced the Llama 2 family of models, a group of open-source LLMs free to use for research and commercial use. Llama 2 represents an expansion of the partnership between Microsoft and Meta, with availability through the Azure AI model catalog and optimization for running running locally on Windows. Llama 2 is also available via AWSAWS, Hugging FaceHugging Face, and other providers. Meta opened access to Llama 2 with the support of a broad set of companies and researchers across tech, academia, and policy who believe in open innovation of AI technologies.

...

The release includes model weights and starting code for pre-trained and fine-tuned Llama models -models— ranging from 7B to 70B parameters. Llama 2 was trained on 40% percent more data compared to Llama 2. Llama 2 was pre-trained on publicly available online data, totaling over 2 trillion tokens. The fine-tuned version of the model (for dialogue use), Llama-2-chat, uses reinforcement learning from human feedback (RLHF), leveraging publicly available instruction datasets and over 1 million human annotations.

...

As part of the Llama 2 release, each model comes with the following:

Timeline

July 18, 2023

Meta and Microsoft Introduce theintroduce Llama 2.

Arthur Smalley

edited on 19 Jul, 2023

Edits made to:

Infobox (+5 properties)

Timeline (+1 events) (+79 characters)

Description (+13/-19 characters)

Article (+1977/-181 characters)

Further Resources (+2 rows) (+9 cells) (+527 characters)

LLaMA

LLaMA ( Large Language Model Meta AI ) is a foundational large language model (LLM) released by Meta AI in February 2023.

Article

Overview

LLaMA (Large Language Model Meta AI) is a foundational large language model (LLM) released by Meta AI. LLaMa is designed to help researchers advance the field of AI, through access to smaller more performant models without the need for large amounts of infrastructure and computing power.

LLaMA 1

InMeta first released LLaMA in February 2023, Meta announced a 65-billion-parameter large language model called Large Language Model Meta AI (LLaMA), designed to help researchers advance their AI work. Unlike language models from Open AI/Microsoft and Google thatwhich are conversational chatbots, LLaMA is not a system users can talk to; it is a research tool to help researchers working in the field. Meta is releasing LLaMA under a noncommercial license focused on research use cases, with access granted to groups like universities, NGOs, and industry labs. Like other large language models, LLaMA takes a sequence of words as an input to predict the next word, recursively generating text. LLaMA was trained using text from twenty languages focusing on those with Latin and Cyrillic alphabets. In the announcement of LLaMA, Meta stated:

...

4chan leak

A week after the first announcement of LLaMA on March 3, 2023, the model was leaked. A downloadable torrent of the system was posted on 4chan before spreading to other online AI communities. On March 6, 2023, Meta announced it would continue to release its AI tools to approved researchers despite the leak to unauthorized users. In a statement, the company said:

...

Llama 2

On July 18, 2023, Meta and Microsoft introduced the Llama 2 family of models, a group of open-source LLMs free to use for research and commercial use. Llama 2 represents an expansion of the partnership between Microsoft and Meta, with availability through the Azure AI model catalog and optimization for running running locally on Windows. Llama 2 is also available via AWS, Hugging Face, and other providers. Meta opened access to Llama 2 with the support of a broad set of companies and researchers across tech, academia, and policy who believe in open innovation of AI technologies.

The release includes model weights and starting code for pre-trained and fine-tuned Llama models - ranging from 7B to 70B parameters. Llama 2 was trained on 40% more data compared to Llama 2. Llama 2 was pre-trained on publicly available online data, totaling over 2 trillion tokens. The fine-tuned version of the model (for dialogue use), Llama-2-chat, uses reinforcement learning from human feedback (RLHF) leveraging publicly available instruction datasets and over 1 million human annotations.

As part of the Llama 2 release, each model comes with:

Model code
Model weights
User guide
Responsible use guide
License
Acceptable use policy
Model card

...

Llama 2 models have a context window of 4,096 tokens. While Llama 2 does not reach the same performance as GPT-4, Meta research shows it performs well against other open-source models. Use cases for Llama 2 focus on commercial and research in English. Tuned models are intended for assistant-like chatbots, while pre-trained models can be adapted for a wide range of natural language tasks.

Further Resources

Title

Author

Link

Type

Date

Llama 2: Open Foundation and Fine-Tuned Chat Models | Meta AI Research

Hugo Touvron, et al

https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/

Web

July 18, 2023

LLaMA: Open and Efficient Foundation Language Models

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, Aurelien Rodriguez, Armand Joulin, Edouard Grave, Guillaume Lample

https://arxiv.org/abs/2302.13971

February 27, 2023

Infobox

Is a

Industry

Competitors

Technologies Used

First Release

February 18, 2023

Timeline

February 24, 2023

Meta releases LLaMA, a foundational, 65-billion-parameter large language model.

Edits on 18 Jul, 2023

Jen English

edited on 18 Jul, 2023

Edits made to:

Timeline (+1/-1 events) (+41/-81 characters)

Timeline

July 18, 2023

Microsoft and Meta expand their AI partnership with Llama 2 on Azure and Windows.

July 18, 2023

Meta and Microsoft Introduce the Llama 2.

Jen English

edited on 18 Jul, 2023

Edits made to:

Infobox (+1 properties)

Timeline (+1 events) (+81 characters)

Infobox

Official Website

https://ai.meta.com/llama/

Timeline

July 18, 2023

Microsoft and Meta expand their AI partnership with Llama 2 on Azure and Windows.

Edits on 12 May, 2023

Jen English

edited on 12 May, 2023

Edits made to:

Article (+2158 characters)

Article

In February 2023, Meta announced a 65-billion-parameter large language model called Large Language Model Meta AI (LLaMA), designed to help researchers advance their AI work. Unlike language models from Open AI/Microsoft and Google that are conversational chatbots, LLaMA is not a system users can talk to; it is a research tool to help researchers working in the field. Meta is releasing LLaMA under a noncommercial license focused on research use cases, with access granted to groups like universities, NGOs, and industry labs. Like other large language models, LLaMA takes a sequence of words as an input to predict the next word, recursively generating text. LLaMA was trained using text from twenty languages focusing on those with Latin and Cyrillic alphabets. In the announcement of LLaMA, Meta stated:

models such as LLaMA enable others in the research community who don’t have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field.

As the size of the model impacts the computing power and resources required to test new approaches, Meta is making LLaMA available in several sizes:

7 billion
13 billion
33 billion
65 billion

The release came alongside a paper with more details on the model titled "LLaMA: Open and Efficient Foundation Language Models." In the paper, Meta claims the 13 billion parameter model (LLaMA-13B) performs better than OpenAI’s popular GPT-3 model on most benchmarks, while the largest model, LLaMA-65B, is “competitive with the best models,” such as DeepMind’s Chinchilla70B and Google’s PaLM 540B.

4chan leak

A week after the announcement of LLaMA on March 3, 2023, the model was leaked. A downloadable torrent of the system was posted on 4chan before spreading to other online AI communities. On March 6, 2023, Meta announced it would continue to release its AI tools to approved researchers despite the leak to unauthorized users. In a statement, the company said:

While the model is not accessible to all, and some have tried to circumvent the approval process, we believe the current release strategy allows us to balance responsibility and openness,

Jen English

edited on 12 May, 2023

Edits made to:

Timeline (-4 events) (-219 characters)

Timeline

May 4, 2023

Introducing LLaMA: A foundational, 65-billion-parameter language model

May 4, 2023

dmca/2023-03-21-meta.md at master · github/dmca

March 6, 2023

Meta's LLaMA Leaked to the Public, Thanks To 4chan

February 27, 2023

LLaMA: Open and Efficient Foundation Language Models

Jen English

edited on 12 May, 2023

Edits made to:

Infobox (-1 properties)

Infobox

Official Website

https://ai.facebook.com/

Jen English

edited on 12 May, 2023

Edits made to:

Infobox (+1/-1 properties)

Article (-738 characters)

Article

LLaMA(Large Language Model Meta AI) is alarge language model(LLM) released byMeta AIin February 2023. A variety of model sizes were trained ranging from 7 billion to 65 billion parameters. LLaMA's developers reported that the 13 billion parameter model's performance on mostNLPbenchmarks exceeded that of the much largerGPT-3(with 175 billion parameters) and that the largest model was competitive with state of the art models such as PaLM andChinchilla.Whereas the most powerful LLMs have generally been accessible only through limitedAPIs(if at all), Meta released LLaMA's model weights to the research community under a noncommercial license.Within a week of LLaMA's release, its weights wereleakedto the public on 4chan viaBitTorrent.

Infobox

Community Forum

https://ai.facebook.com/

Announcement URL

https://ai.facebook.com/blog/large-language-model-llama-meta-ai/

Edits on 4 May, 2023