mistralai / mistral-large

Model Overview

Description:

Mistral Large is MistralAI's new cutting-edge text generation model. It reaches top-tier reasoning capabilities. It can be used for complex multilingual reasoning tasks, including text understanding, transformation, and code generation.
Mistral Large has the following capabilities.

  • It is natively fluent in English, French, Spanish, German, and Italian, with a nuanced understanding of grammar and cultural context.
  • Its 32K tokens context window allows precise information recall from large documents.
  • Its precise instruction-following enables developers to design their moderation policies – we used it to set up the system-level moderation of le Chat.
  • It is natively capable of function calling. This, along with constrained output mode, implemented on la Plateforme, enables application development and tech stack modernisation at scale.

Third-Party Community Consideration:

This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see the Mistral Large Blog.

Terms of use

By using this software or model, you are agreeing to the terms and conditions of the license, acceptable use policy and Mistral's privacy policy.

References(s):

Mistral Large Blog | Mistral AI

Model Architecture:

Architecture Type: Transformer

Model Version: 0.1

Input:

Input Format: Text

Input Parameters: Max Tokens, Temperature, Top P

Output:

Output Format: Text

Output Parameters: None

Software Integration:

Supported Hardware Platform(s): Hopper, Ampere, Ada

Supported Operating System(s): Linux

Inference:

Engine: Triton

Test Hardware: Other