Overview
The Large Language Model (LLM) NIM API endpoints provide simple access to use natural language based generative AI. This single API endpoint provides access to top models for use in a wide range of tasks including: chat, instruction following, question answering, summarization, creative text generation, and code generation.
NOTE: Select models are available as downloadable container images and supported with an NVIDIA AI Enterprise entitlement. These select models have additional OpenAI API spec details for running self-hosted localized NIMs. Please refer to the Downloadable NIM documentation for additional information.
URL: https://integrate.api.nvidia.com
Endpoint: POST /v1/chat/completions
Models
01-ai
abacusai
aisingapore
Model | Endpoint |
---|---|
aisingapore / sea-lion-7b-instruct | [Create a chat completion (sea-lion-7b-instruct)](ref:create_chat_completion_v1_chat_completions_post |
bigcode
databricks
deepseek
google
ibm
mediatek
meta
microsoft
mistralai
nvidia
qwen
rakuten
seallms
snowflake
Model | Endpoint |
---|---|
snowflake / arctic | Create chat completion (arctic) |