Model Overview
Description:
MathΣtral is designed specifically for math reasoning and scientific discovery, inspired by the legacy of Archimedes. It is a 7B model, aimed at solving advanced mathematical problems and supporting scientific research. Mathstral is released under the Apache 2.0 license and is part of a broader effort to support academic projects, produced in collaboration with Project Numina.
This model is ready for commercial use.
Third-Party Community Consideration:
Mathstral is developed by Mistral AI and is intended for use by the scientific and academic community. The model is available on Hugging Face.
Terms of Use
By using this software or model, you agree to the terms and conditions, acceptable use policy, and Mistral's privacy policy. Mathstral-7B-v0.1 is released under the Apache 2.0 license.
References(s):
Mathstral blogpost
Model Architecture:
Architecture Type: Transformer
Network Architecture: Mathstral 7B v0.1
Model Version: 0.1
Input:
Input Type(s): Text
Input Format: String
Input Parameters: Max Tokens, Temperature, Top P
Max Input Tokens: 4096 Tokens
Output:
Output Type(s): Text
Output Format: String
Max Output Tokens: 4096 Tokens
Software Integration:
Supported Hardware Platform(s): NVIDIA Ampere, NVIDIA Hopper, NVIDIA Turing
Supported Operating System(s): Linux
Inference:
Engine: TRT-LLM
Test Hardware: L40S
Benchmarks:
Mathstral achieves state-of-the-art reasoning capacities in its size category across various industry-standard benchmarks. It scores 56.6% on MATH and 63.47% on MMLU. With more inference-time computation, Mathstral 7B scores 68.37% on MATH with majority voting and 74.59% with a strong reward model among 64 candidates.
Ethical Considerations
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report security vulnerabilities or NVIDIA AI Concerns here.