🔓 Open-source

Gradient gives you easy access to open-source LLMs. We host the models and you can focus on inference and fine-tuning.

What we offer

  1. Ownership & Control: You own your model completely. This includes your private data used for inference and fine-tuning (i.e., the model weights generated during the fine-tuning process).
  2. Security & Privacy: When you create your own instance of an open-source model no one else can access it unless you decide to share it. Models created within a Gradient workspace can only be accessed by members of that workspace.
  3. More Customization: When using open-source models you have full access the model's architecture and code. This allows you to better understand the model and completely customize it to fit your needs.
  4. Open-Source Community: Open-source comes with a built-in community of developers who are actively building and sharing resources with each other.

☁️ Gradient-hosted LLMs

The Gradient API hosts a broad range of language models for inference and fine-tuning. Fine-tuning is only supported for specific models.

Currently supported by Gradient

Model NameRelease DateContext LengthFeature SupportLicense
Bloom-560May 20222048Inference; Fine-tuningThe BigScience RAIL License
Llama-2-Chat-7BJuly 20234096Inference; Fine-tuningLLaMA 2 License; Cannot use LLaMA outputs to train other LLMs besides LLaMA and its derivatives
Llama-2-Chat-70BJanuary 20244096Inference; Fine-tuning**LLaMA 2 License ; Cannot use LLaMA outputs to train other LLMs besides LLaMA and its derivatives
Llama-3-Chat-8BApril 20248kInference; Fine-tuning**LLaMA 3 License
Llama-3-Chat-70BApril 20248kInference; Fine-tuning**LLaMA 3 License
Mixtral-8x7B InstructJanuary 202432kInference onlyApache-2.0 License
Nous-Hermes-Llama-2-13BJuly 20234096Inference; Fine-tuningN/A

** Requires Enterprise License. If you don't already have one, please reach out to [email protected]

Model IDs for reference in the API and CLI

Language ModelModel IDSlug ID
Bloom-56099148c6d-c2a0-4fbe-a4a7-e7c05bdb8a09_base_ml_modelbloom-560m
Llama-2-Chat-7Bf0b97d96-51a8-4040-8b22-7940ee1fa24e_base_ml_modelllama2-7b-chat
Llama-2-Chat-70B780dc245-41cc-4398-99ad-ea1a7d6ad011_base_ml_modelllama2-70b-chat
Llama-3-Chat-8B62163345-b63f-4826-88f1-acd8e289f85e_base_ml_modelllama3-8b-chat
Llama-3-Chat-70B07396fc3-64f1-43e7-b194-f10843e741f0_base_ml_modelllama3-70b-chat
Mixtral-8x7B Instruct86898eca-d7eb-4ed5-aee4-6fe4f67c426b_base_ml_modelmixtral-8x7b-instruct
Nous-Hermes-Llama-2-13Bcc2dafce-9e6e-4a23-a918-cad6ba89e42e_base_ml_modelnous-hermes2

Expected to be supported in the future

Language ModelParamsRelease DateContext LengthLicense
Starcoder15.5BMay 20248192OpenRAIL-M v1
Code Llama7B, 13B, 34BMay 20244096LLaMA 2 License; Cannot use LLaMA outputs to train other LLMs besides LLaMA and its derivatives

Note: Supported models are subject to change


🔢 Gradient-hosted embeddings models

The Gradient API also hosts embeddings models to convert your text data into generated embeddings.

Currently supported by Gradient

ModelRelease DateLicense
bge-largeAug 2023MIT License

Note: Supported models are subject to change