Golden has been acquired by ComplyAdvantage.Read about it here ⟶

Gorilla

Gorilla is a large language model (LLM) that generates API calls given a natural language query, created by researchers from UC Berkeley and Microsoft Research.

Overview Structured Data Issues Contributors Activity

All edits

Edits on 21 Sep, 2023

Jen English

edited on 21 Sep, 2023

Edits made to:

Infobox (+1/-1 properties)

Infobox

Is a

‌

AI Project

Is a

Product

Edits on 14 Aug, 2023

Jen English

edited on 14 Aug, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Related Academic Papers

‌

Gorilla: Large Language Model Connected with Massive APIs

Edits on 10 Aug, 2023

Amy Tomlinson Gayle

edited on 10 Aug, 2023

Edits made to:

Infobox (+1/-1 properties)

Article (+186/-170 characters)

Article

Gorilla is a large language model (LLM) that generates API calls given a natural language query.

Gorilla is a large language model (LLM) that generates API calls given a natural language query. An end-to-end model tailored to serving correct API calls without additional coding, Gorilla is designed to work as part of a wider ecosystem, integrating with other tools. The model's website describes it as "An Appstore for LLMs."

...

Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claims it outperforms several baseline models for code generation, including GPT-4. Gorilla is finetunedfine-tuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torchdatasets—Torch Hub, TensorFlow Hub, and HuggingFace. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

...

The first Gorilla model (for HuggingFace API descriptions) was released on May 27, 2023, based on LLaMA-7B, a 7 billion parameter LLM created by Meta. The next day, two LLaMA-based versions of the model were released for Torch Hub and TensorFlow API descriptions. On June 6, 2023, two more versions were released under the Apache 2.0 license, allowing developers to use the models for commercial use. Unlike previous models, these two releases were not based on LLaMA-7B. One is based on MPT-7, and one is based on Falcon-7B from MosaicML and the Technology Innovation Institute, respectively.

...

A paper describing Gorilla was first released on May 24, 2023. Titled "Gorilla: Large Language Model Connected with Massive APIs," the paper is authored by Shishir G. Patil, Tianjun Zhang, Xin Wang, and Joseph E. Gonzalez. At the time of its publication, both lead authors, Patil and Zhang, were fourth-year PhD students under Professor Gonzalez at UC Berkeley. Wang, a former PhD student at UC Berkeley who worked with Gonzalez, is a senior researcher at Microsoft Research within the Physics of AGI group. Previously she was a part of the Computer Vision Group. Gonzales is a professor in the Department of Electrical Engineering and Computer Science at UC Berkeley, a co-director and founding member of the UC Berkeley RISE Lab, and a member of the Berkeley AI Research (BAIR Group).

...

TorchHub and TensorHub were scraped exhaustively. However, HuggingFace contains a large number of models (>200,000), many of which have poor documentation, lack dependencies, or have limited information on their model cards. Therefore, only the 20twenty most downloaded models per task category were used. The task categories included seven in multimodal data, eight in computer vision, twelve in natural language processing, five in audio, two in tabular data, and two in reinforcement learning.

...

Ten synthetic user question prompts per API were also generated using Self-Instruct, such that each entry in the dataset becomes an instruction reference API pair. GPT-4 was used to generate the synthetic instruction data. Three in-context examples were provided, along with a reference API documentation, tasking GPT-4 to refrain from using API names or hints when generating instructions.

...

Gorilla is a retrieve-aware finetunedfine-tuned model, specifically for API calls. The model employs sel-instructself-instruct to generate instruction/API pairs. To fine-tune the base model (LLaMA for the initial releases), these pairs are converted into a user-agent chat-style conversation, with each round of the conversation making up a data point. Next standard instruction finetuningfine-tuning was performed on the base model. Gorilla was trained with and without the retriever.

...

API calls often come with constraints. Therefore, Gorilla must not only comprehend the functionality of an API call but also categorize the calls according to different constraint parameters. The paper shows that augmenting an LLM with retrieval does not always improve performance. During inference, the user provides the natural language prompt. These prompts can be a simple task or more vague. Gorilla has two inference modes zero-shotmodes—zero-shot and retrieval. In zero shot, the prompt is passed to Gorilla LLM, returning the API that helps accomplish the task or goal. In retrieval, the retriever (either of BM25 or GPT-index) returns the most up-to-date API documentation stored in the API database. This is concatenated to the prompt along with the message to "Use this API documentation for reference" before it is passed to Gorilla. Besides the concatenation, no further prompt tuning is performed.

...

Given a natural language prompt, there are a number of different LLM APIs that Gorilla could provide to complete the task. For example, there are many different image generation models. To evaluate the performance of Gorilla and verify the APIs delivered, their functional equivalence is compared using the dataset collected. An AST tree-matching strategy is adopted to trace which API in the dataset is being called. An AST matching process is used to directly identify hallucinations. The paper defines hallucinations as an API call that is not a sub-tree of any API in the database. The model returned an imagined tool. This is different from invoking an incorrect API, described in the paper as an error not a hallucination.

Infobox

Discord URL

https://discord.com/invite/3apqwwME

Discord URL

https://discord.com/invite/SwTyuTAxX3

Arthur Smalley

edited on 10 Aug, 2023

Edits made to:

Infobox (+9 properties)

Article (+1 images) (+2615/-444 characters)

Article

An end-to-end model tailored to serving correct API calls without additional coding, Gorilla is designed to work as part of a wider ecosystem, integrating with other tools. Created by researchers from UC Berkeley and MicrosoftThe Research,model's thewebsite team behind Gorilla claimsdescribes it outperformsas several"An baseline modelsAppstore for code generation, including GPT-4LLMs. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training."

...

Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claims it outperforms several baseline models for code generation, including GPT-4. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

Diagram showing how Gorilla works.

The first Gorilla model (for HuggingFace API descriptions) was released on May 27, 2023, based on LLaMA-7B, a 7 billion parameter LLM created by Meta. The next day two LLaMA-based versions of the model were released for Torch Hub and TensorFlow API descriptions. On June 6, 2023, two more versions were released under the Apache 2.0 license, allowing developers to use the models for commercial use. Unlike previous models, these two releases were not based on LLaMA-7B. One is based on MPT-7, and one is based on Falcon-7B from MosaisMLMosaicML and the Technology Innovation Institute, respectively.

...

Model

Gorilla is a retrieve-aware finetuned model, specifically for API calls. The model employs sel-instruct to generate instruction/API pairs. To fine-tune the base model (LLaMA for the initial releases), these pairs are converted into a user-agent chat-style conversation, with each round of the conversation making up a data point. Next standard instruction finetuning was performed on the base model. Gorilla was trained with and without the retriever.

API calls often come with constraints. Therefore, Gorilla must not only comprehend the functionality of an API call but also categorize the calls according to different constraint parameters. The paper shows that augmenting an LLM with retrieval does not always improve performance. During inference, the user provides the natural language prompt. These prompts can be a simple task or more vague. Gorilla has two inference modes zero-shot and retrieval. In zero shot, the prompt is passed to Gorilla LLM, returning the API that helps accomplish the task or goal. In retrieval, the retriever (either of BM25 or GPT-index) returns the most up-to-date API documentation stored in the API database. This is concatenated to the prompt along with the message to "Use this API documentation for reference" before it is passed to Gorilla. Besides the concatenation, no further prompt tuning is performed.

Given a natural language prompt, there are a number of different LLM APIs Gorilla could provide to complete the task. For example, there are many different image generation models. To evaluate the performance of Gorilla and verify the APIs delivered, their functional equivalence is compared using the dataset collected. An AST tree-matching strategy is adopted to trace which API in the dataset is being called. An AST matching process is used to directly identify hallucinations. The paper defines hallucinations as an API call that is not a sub-tree of any API in the database. The model returned an imagined tool. This is different from invoking an incorrect API, described in the paper as an error not a hallucination.

Infobox

Creator

Shishir Patil

Tianjun Zhang

Related Organization

University of California, Berkeley

Technologies Used

Launch Date

May 27, 2023

Competitors

ChatGPT

GPT-4

Claude

Industry

Artificial Intelligence (AI)

Technologies Used

Edits on 8 Aug, 2023

edited on 8 Aug, 2023

Edits made to:

Article (-216 characters)

Article

Model

Gorilla is a retreive-aware, finetuned LLM specifically for API calls. The initial three releases were finetuned versions of the LLaMA-7B model. The model employs self-instruct to generate API instruction pairs.

Arthur Smalley

edited on 8 Aug, 2023

Edits made to:

Timeline (+6 events) (+777 characters)

Article (+1726/-44 characters)

Article

An end-to-end model tailored to serving correct PAIAPI calls without additional coding, Gorilla is designed to work as part of a wider ecosystem, integrating with other tools. Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claims it outperforms several baseline models for code generation, including GPT-4. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

...

The first Gorilla model (for HuggingFace API descriptions) was released on May 27, 2023, based on LLaMa-7BLLaMA-7B, a 7 billion parameter LLM created by Meta. The next day two LLaMa-basedLLaMA-based versions of the model were released for Torch Hub and TensorFlow API descriptions. On June 6, 2023, two more versions were released under the Apache 2.0 license, allowing developers to use the models for commercial use. Unlike previous models, these two releases were not based on LLaMa-7BLLaMA-7B. One is based on MPT-7, and one is based on Falcon-7B from MosaisML and the Technology Innovation Institute, respectively.

...

The Gorilla code and model files are available on GitHub. A Google Colab notebook demo of the model has also been released, allowing users to launch the three LLaMa-7B-basedLLaMA-7B-based models with a hosted end-point for the MPT-7 based model and plans to add a Falcon-7B version of the model. Users can also run Gorilla using the command line interface (CLI). On June 29, 2023, a research prototype of Gorilla CLI was released, building on the Gorilla LLMs to provide potential commands for execution based on natural language queries. Gorilla CLI was released under the Apache 2.0 license.

History

A paper describing Gorilla was first released on May 24, 2023. Titled "Gorilla: Large Language Model Connected with Massive APIs," the paper is authored by Shishir G. Patil, Tianjun Zhang, Xin Wang, and Joseph E. Gonzalez. At the time of its publication, both lead authors Patil and Zhang were fourth-year PhD students under Professor Gonzalez at UC Berkeley. Wang, a former PhD student at UC Berkeley who worked with Gonzalez, is a senior researcher at Microsoft Research within the Physics of AGI group. Previously she was a part of the Computer Vision Group. Gonzales is a professor in the Department of Electrical Engineering and Computer Science at UC Berkeley, a co-director and founding member of the UC Berkeley RISE Lab, and a member of the Berkeley AI Research (BAIR Group).

The day after submitting the paper (May 25, 2023), the APIBench dataset and the evaluation code of Gorilla were released. On May 27, 2023, the team released the first Gorilla model (Hugging Face APIs) as well as the APIZoo contribution guide for community API contributions. On May 28, they released two more versions of the Gorilla model based on Torch Hub and TensorFlow Hub APIs. On May 30, 2023, the CLI to chat with Gorilla was introduced. On June 6, two commercially usable Apache 2.0 licensed Gorilla models were released based on MPT-7 and Falcon-7B. On June 29, 2023, Gorilla-CLI was released, an LLM for providing CLI executions based on natural language queries.

...

Model

Timeline

June 29, 2023

Gorilla-CLI is released, an LLM for providing CLI executions based on natural language queries.

June 6, 2023

Two commercially usable Apache 2.0 licensed Gorilla models are released.

The models are based on MPT-7 and Falcon-7B.

May 28, 2023

Two more versions of the Gorilla model are released, based on Torch Hub and TensorFlow Hub APIs.

The two models are based on LlaMA-7b.

May 27, 2023

The first Gorilla model (Hugging Face APIs) is released alongside the APIZoo contribution guide for community API contributions.

The model is based on LlaMA-7b.

May 25, 2023

The evaluation code of Gorilla and the APIBench dataset are released.

May 24, 2023

A paper describing Gorilla LLM is released.

Titled "Gorilla: Large Language Model Connected with Massive APIs," the paper is authored by Shishir G. Patil, Tianjun Zhang, Xin Wang, and Joseph E. Gonzalez.

Edits on 7 Aug, 2023

"update inverses"

Golden AI

edited on 7 Aug, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Creator

Tianjun Zhang

"update inverses"

Golden AI

edited on 7 Aug, 2023

Edits made to:

Infobox (+1 properties)

Infobox

Creator

Shishir Patil

Arthur Smalley

edited on 7 Aug, 2023

Edits made to:

Article (+1231/-489 characters)

Article

An end-to-end model tailored to serving correct PAI calls without additional coding, Gorilla is designed to work as part of a wider ecosystem, integrating with other tools. Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claimclaims it outperforms several baseline models for code generation, including GPT-4. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

...

The first Gorilla model (for HuggingFace API descriptions) was released on May 27, 2023, based on LLaMa-7B, a 7 billion parameter LLM created by Meta. The next day two LLaMa-based versions of the model were released for Torch Hub and TensorFlow API descriptions. On June 6, 2023, two more versions were released under the Apache 2.0 license, allowing developers to use the models for commercial use. Unlike previous models, these two releases were not based on LLaMa-7B. One is based on MPT-7, and one is based on Falcon-7B, from MosaisML and the Technology Innovation Institute respectively.

...

History

A paper describing Gorilla was first released on May 24, 2023. Titled "Gorilla: Large Language Model Connected with Massive APIs," the paper is authored by Shishir G. Patil, Tianjun Zhang, Xin Wang, and Joseph E. Gonzalez. The next day (May 25, 2023) they released the APIBench dataset and the evaluation code of Gorilla, before releasing the first Gorilla model on May 27, 2023. The first commercially usable, Apache 2.0 licensed Gorilla models were released on June 6, 2023.

APIBench

APIBench is a large corpus of APIs developed by the team behind Gorilla. By scraping machine learning APIs from public model hubs, APIBench contains complicated and often overlapping functionality. The researchers chose three major model hubs to construct APIBench:

HuggingFace—925 API calls
TorchHub—94 API calls
TensorHub—696 API calls

TorchHub and TensorHub were scraped exhaustively. However, HuggingFace contains a large number of models (>200,000), many of which have poor documentation, lack dependencies, or have limited information on their model cards. Therefore, only the 20 most downloaded models per task category were used. The task categories included seven in multimodal data, eight in computer vision, twelve in natural language processing, five in audio, two in tabular data, and two in reinforcement learning.

Arthur Smalley

edited on 7 Aug, 2023

Edits made to:

Description (+64/-34 characters)

Article (+1426/-786 characters)

Gorilla

Gorilla is a finetuned LLaMA-based open-source large language model (LLM) that generates API calls given a natural language query, created by researchers from UC Berkeley and Microsoft Research.

Article

Gorilla is a large language model (LLM) that generates API calls given a natural language query.

Gorilla is a finetuned LLaMA-based open-source largeAn languageend-to-end model (LLM)tailored thatto generatesserving APIcorrect PAI calls givenwithout additional coding, Gorilla is designed to work as part of a naturalwider ecosystem, integrating with languageother querytools. Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claim it outperforms several baseline models for code generation including GPT-4. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. The resulting corpus of over 1,600 API calls is called APIBench. The team behind Gorilla plan to add new domains, including Kubernetes, GCP, AWS, OpenAPI, and others. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

...

The Gorilla code and model files are available on GitHub as well as a Google Colab notebook demo of the model. The colab demo allows users to launch three versions of the LLaMa-7B-based model (one for each of the API datasets Torch Hub, TensorFlow Hub and HuggingFace) with a hosted end-point for an MPT-7 based model and plans to add a Falcon-7B. Gorilla lead author, Shishir Patil has stated the Gorilla models based on LLaMa are not licensed for commercial use while the ones based on MPT-7 and Falcon-7B are.

The first Gorilla model (for HuggingFace API descriptions) was released on May 27, 2023, based on LLaMa-7B, a 7 billion parameter LLM created by Meta. The next day two LLaMa-based versions of the model were released for Torch Hub and TensorFlow API descriptions. On June 6, 2023, two more versions were released under the Apache 2.0 license, allowing developers to use the models for commercial use. Unlike previous models, these two releases were not based on LLaMa-7B. One is based on MPT-7 and one is based on Falcon-7B, from MosaisML and the Technology Innovation Institute respectively.

...

The Gorilla code and model files are available on GitHub. A Google Colab notebook demo of the model has also been released, allowing users to launch the three LLaMa-7B-based models with a hosted end-point for the MPT-7 based model and plans to add a Falcon-7B version of the model. Users can also run Gorilla using the command line interface (CLI). On June 29, 2023, a research prototype of Gorilla CLI was released, building on the Gorilla LLMs to provide potential commands for execution based on natural language queries. Gorilla CLI was released under the Apache 2.0 license.

History

Edits on 6 Aug, 2023

Arthur Smalley

edited on 6 Aug, 2023

Edits made to:

Infobox (+3 properties)

Description (+130/-164 characters)

Article (+1742 characters)

Gorilla

Gorilla is a large language model that writes API calls, and is trained on three massive machine learning hub datasets: Torch Hub, TensorFlow Hub, and HuggingFace.

Gorilla is a finetuned LLaMA-based open-source large language model (LLM) that generates API calls given a natural language query.

Article

Overview

Gorilla is a finetuned LLaMA-based open-source large language model (LLM) that generates API calls given a natural language query. Created by researchers from UC Berkeley and Microsoft Research, the team behind Gorilla claim it outperforms several baseline models for code generation including GPT-4. Gorilla is finetuned on APIBench, a new dataset of API descriptions from three machine learning hub datasets Torch Hub, TensorFlow Hub and HuggingFace. The resulting corpus of over 1,600 API calls is called APIBench. The team behind Gorilla plan to add new domains, including Kubernetes, GCP, AWS, OpenAPI, and others. Gorilla can also call out to an external document database containing API definitions, accessing new APIs without re-training.

...

A paper describing Gorilla was first released on May 24, 2023. Titled "Gorilla: Large Language Model Connected with Massive APIs," the paper is authored by Shishir G. Patil, Tianjun Zhang, Xin Wang, and Joseph E. Gonzalez. The next day (May 25, 2023) they released the APIBench dataset and the evaluation code of Gorilla, before releasing the first Gorilla model on May 27, 2023. The first commercially usable, Apache 2.0 licensed Gorilla models were released on June 6, 2023.

Infobox

Technologies Used

TensorFlow

PyTorch

Hugging Face

Edits on 30 May, 2023

Jen English

edited on 30 May, 2023

Edits made to:

Infobox (+2 properties)

Infobox

Industry

Machine learning

Application programming interface (API)

Jen English

edited on 30 May, 2023

Edits made to:

Infobox (+4 properties)

Description (+164 characters)

Further Resources (+1 rows) (+4 cells) (+162 characters)

Gorilla

Gorilla is a large language model that writes API calls, and is trained on three massive machine learning hub datasets: Torch Hub, TensorFlow Hub, and HuggingFace.

Further Resources

Title

Author

Link

Type

Date

Gorilla: Large Language Model Connected with Massive APIs

Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez

https://arxiv.org/abs/2305.15334

May 24, 2023

Infobox

Discord URL

https://discord.com/invite/3apqwwME

GitHub URL

https://github.com/ShishirPatil/gorilla

Related Organization

University of California, Berkeley

Microsoft Research

"Created via: Web app"

Jen English

created this topic on 30 May, 2023

Edits made to:

Infobox (+2 properties)

Gorilla

Gorilla is a large language model (LLM) that generates API calls given a natural language query, created by researchers from UC Berkeley and Microsoft Research.

Infobox

Is a

Product

Official Website

https://gorilla.cs.berkeley.edu/

Find more entities like Gorilla

Use the Golden Query Tool to find similar entities by any field in the Knowledge Graph, including industry, location, and more.

Open Query Tool

Access by API