abacusai / dracarys-llama-3.1-70b-instruct

Dracarys-Llama-3.1-70B-Instruct

Introduction

We introduce the latest in the Smaug series, the Dracarys family of finetunes targeting coding performance improvements across a variety of base models.

This variant is a finetune of meta-llama/Meta-Llama-3.1-70B-Instruct that allows to generate code and answer questions about code.

Compared to meta-llama/Meta-Llama-3.1-70B-Instruct, Dracarys has better LiveCodeBench scores (see evaluation results below).

This model is ready for commercial and non-commercial use.

Third-Party Community Consideration

This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA Model Card.

License

META LLAMA 3 COMMUNITY LICENSE

Model Details

Model Architecture

Architecture Type: Transformer

Input

Input Type(s): Text

Input Format(s): String

Input Parameters: One Dimensional (1D)

Output

Output Type(s): Text

Output Format: String

Output Parameters: 1D

Supported Hardware Microarchitecture Compatibility:

  • NVIDIA Hopper
  • NVIDIA Lovelace

Preferred Operating System(s):

  • Linux

Evaluation Results

LiveCodeBench

ModelCode GenerationCode ExecutionTest Output Prediction
Dracarys-Llama-3.1-70B-Instruct37.0839.0049.90
Meta-Llama-3.1-70B-Instruct31.8055.5041.40

Data Collection Method by dataset:

  • [Unknown]

Labeling Method by dataset:

  • [Unknown]

Inference

Engine: TensorRT-LLM

Test Hardware:

  • NVIDIA H100x2

Ethical Considerations

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

Please report security vulnerabilities or NVIDIA AI Concerns here.