Llama3-ChatQA-1.5-8B Model card
Model Information
Model Summary
Author: NVIDIA
Description
Llama3-ChatQA-1.5 excels at conversational question answering (QA) and retrieval-augmented generation (RAG). Llama3-ChatQA-1.5 is developed using an improved training recipe from ChatQA paper, and it is built on top of Llama-3 base model. Specifically, we incorporate more conversational QA data to enhance its tabular and arithmetic calculation capability. Llama3-ChatQA-1.5 has two variants: Llama3-ChatQA-1.5-8B and Llama3-ChatQA-1.5-70B.
Terms of Use
By accessing this model, you are agreeing to the NVIDIA AI Foundation Models Community License
Additional Information: META LLAMA 3 COMMUNITY LICENSE AGREEMENT.
Reference:
@article{liu2024chatqa,
title={ChatQA: Surpassing GPT-4 on Conversational QA and RAG},
author={Liu, Zihan and Ping, Wei and Roy, Rajarshi and Xu, Peng and Lee, Chankyu and Shoeybi, Mohammad and Catanzaro, Bryan},
journal={arXiv preprint arXiv:2401.10225},
year={2024}}
Resources and Technical Documentation
Model Architecture:
Architecture Type: Transformer Decoder Network
Network Architecture: Llama-3
Inputs and outputs
Input:
Input Type(s): Text
Input Format(s): String
Input Parameters: One-Dimensional (1D)
Output:
Output Type(s): Text
Output Format(s): String
Output Parameters: One-Dimensional (1D)
Ethical Considerations (For NVIDIA Models Only):
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards [Insert Link to Model Card++ here]. Please report security vulnerabilities or NVIDIA AI Concerns here.