LLM APIs

Overview

The Large Language Model (LLM) NIM API endpoints provide simple access to use natural language based generative AI. This single API endpoint provides access to top models for use in a wide range of tasks including: chat, instruction following, question answering, summarization, creative text generation, and code generation.

NOTE: Select models are available as downloadable container images and supported with an NVIDIA AI Enterprise entitlement. These select models have additional OpenAI API spec details for running self-hosted localized NIMs. Please refer to the Downloadable NIM documentation for additional information.

URL: https://integrate.api.nvidia.com

Endpoint: POST /v1/chat/completions

Models

01-ai


abacusai


aisingapore

ModelEndpoint
aisingapore / sea-lion-7b-instruct[Create a chat completion (sea-lion-7b-instruct)](ref:create_chat_completion_v1_chat_completions_post

bigcode


databricks


deepseek


google


ibm


mediatek


meta


microsoft


mistralai


nvidia


qwen


rakuten


seallms


snowflake


upstage