Jump to Content
API Reference
API Reference
API Reference
Status polling
Search
JUMP TO
Introduction
Models
Large Language models
LLM APIs
01-ai / yi-large
Creates a model response for the given chat conversation.
post
abacusai / dracarys-llama-3.1-70b-instruct
Creates a model response for the given chat conversation.
post
ai21labs / jamba-1.5-large-instruct
Creates a model response for the given chat conversation.
post
ai21labs / jamba-1.5-mini-instruct
Creates a model response for the given chat conversation.
post
aisingapore / sea-lion-7b-instruct
Create a chat completion
post
baichuan-inc / baichuan2-13b-chat
Creates a model response for the given chat conversation.
post
bigcode / starcoder2-7b
Create Completion
post
bigcode / starcoder2-15b
Create Completion
post
databricks / dbrx-instruct
Create a chat completion
post
deepseek-ai / deepseek-r1
Creates a model response for the given chat conversation.
post
deepseek-ai / deepseek-r1-distill-llama-8b
Creates a model response for the given chat conversation.
post
deepseek-ai / deepseek-r1-distill-qwen-7b
Creates a model response for the given chat conversation.
post
deepseek-ai / deepseek-r1-distill-qwen-14b
Creates a model response for the given chat conversation.
post
deepseek-ai / deepseek-r1-distill-qwen-32b
Creates a model response for the given chat conversation.
post
google / codegemma-1.1-7b
Creates a model response for the given chat conversation.
post
google / codegemma-7b
Create a chat completion
post
google / gemma-2b
Create a chat completion
post
google / gemma-7b
Create a chat completion
post
google / gemma-2-2b-it
Creates a model response for the given chat conversation.
post
google / gemma-2-9b-it
Creates a model response for the given chat conversation.
post
google / gemma-2-27b-it
Creates a model response for the given chat conversation.
post
google / gemma-3-1b-it
Creates a model response for the given chat conversation.
post
google / recurrentgemma-2b
Create a chat completion
post
google / shieldgemma-9b
Creates a model response for the given chat conversation.
post
ibm / granite-3.0-3b-a800m-instruct
Creates a model response for the given chat conversation.
post
ibm / granite-3.0-8b-instruct
Creates a model response for the given chat conversation.
post
ibm / granite-34b-code-instruct
Creates a model response for the given chat conversation.
post
ibm / granite-8b-code-instruct
Creates a model response for the given chat conversation.
post
ibm / granite-guardian-3.0-8b
Creates a model response for the given chat conversation.
post
igenius / colosseum_355b_instruct_16k
Creates a model response for the given chat conversation.
post
igenius / italia_10b_instruct_16k
Creates a model response for the given chat conversation.
post
institute-of-science-tokyo / llama-3.1-swallow-70b-instruct-v01
Creates a model response for the given chat conversation.
post
institute-of-science-tokyo / llama-3.1-swallow-8b-instruct-v0.1
Creates a model response for the given chat conversation.
post
mediatek / breeze-7b-instruct
Creates a model response for the given chat conversation.
post
meta / codellama-70b
Create a chat completion
post
meta / llama2-70b
Create a chat completion
post
meta / llama3-8b
Creates a chat completion
post
meta / llama3-70b
Creates a chat completion
post
meta / llama-3.1-8b-instruct
Creates a model response for the given chat conversation.
post
meta / llama-3.1-70b-instruct
Creates a model response for the given chat conversation.
post
meta / llama-3.1-405b-instruct
Creates a model response for the given chat conversation.
post
meta / llama-3.2-1b-instruct
Creates a model response for the given chat conversation.
post
meta / llama-3.2-3b-instruct
Creates a model response for the given chat conversation.
post
meta / llama-3.3-70b-instruct
Creates a model response for the given chat conversation.
post
microsoft / phi-3-medium-128k-instruct
Creates a model response for the given chat conversation.
post
microsoft / phi-3-medium-4k-instruct
Creates a chat completion
post
microsoft / phi-3-mini-128k-instruct
Creates a model response for the given chat conversation.
post
microsoft / phi-3-mini-4k-instruct
Creates a model response for the given chat conversation.
post
microsoft / phi-3-small-128k-instruct
Creates a chat completion
post
microsoft / phi-3-small-8k-instruct
Create a chat completion
post
microsoft / phi-3.5-mini
Creates a model response for the given chat conversation.
post
microsoft / phi-3.5-moe-instruct
Creates a model response for the given chat conversation.
post
microsoft / phi-4-mini-instruct
Creates a model response for the given chat conversation.
post
mistralai / codestral-22b-instruct-v0.1
Creates a model response for the given chat conversation.
post
mistralai / mamba-codestral-7b-v0.1
Creates a model response for the given chat conversation.
post
mistralai / mistral-2-large-instruct
Creates a model response for the given chat conversation.
post
mistralai / mathstral-7b-v01
Creates a model response for the given chat conversation.
post
mistralai / mistral-7b-instruct
Create a chat completion
post
mistralai / mistral-7b-instruct-v0.3
Creates a model response for the given chat conversation.
post
mistralai / mixtral-8x7b-instruct
Create a chat completion
post
mistralai / mixtral-8x22b-instruct
Create a chat completion
post
mistralai / mistral-large
Create a chat completion
post
mistralai / mistral-small-24b-instruct
Creates a model response for the given chat conversation.
post
nvidia / llama3-chatqa-1.5-8b
Creates a model response for the given chat conversation.
post
nvidia / llama-3.1-nemoguard-8b-content-safety
Creates a model response for the given chat conversation.
post
nvidia / llama-3.1-nemoguard-8b-topic-control
Creates a model response for the given chat conversation.
post
nvidia/llama-3.1-nemotron-nano-8b-v1
Creates a model response for the given chat conversation.
post
nvidia / llama-3.1-nemotron-51b-instruct
Creates a model response for the given chat conversation.
post
nvidia/llama-3.1-nemotron-70b-instruct
Creates a model response for the given chat conversation.
post
nvidia / llama-3.1-nemotron-70b-reward
Creates a model response for the given chat conversation.
post
nvidia / llama-3_1-nemotron-ultra-253b-v1
Creates a model response for the given chat conversation.
post
nvidia/llama-3.3-nemotron-super-49b-v1
Creates a model response for the given chat conversation.
post
nvidia / llama3-chatqa-1.5-70b
Creates a model response for the given chat conversation.
post
nvidia / mistral-nemo-minitron-8b-base
Create Completion
post
nvidia / mistral-nemo-minitron-8b-8k-instruct
Creates a model response for the given chat conversation.
post
nvidia / nemoguard-jailbreak-detect
Classify text for jailbreak attempt.
post
nvidia / nemotron-4-340b-instruct
Creates a model response for the given chat conversation.
post
nvidia / nemotron-4-340b-reward
Creates a model response for the given chat conversation.
post
nvidia / nemotron-4-mini-hindi-4b-instruct
Creates a model response for the given chat conversation.
post
nvidia / nemotron-mini-4b-instruct
Creates a model response for the given chat conversation.
post
nvidia / usdcode
Creates a model response for the given chat conversation.
post
nvidia / usdsearch
Search Post
post
nv-mistralai / mistral-nemo-12b-instruct
Creates a model response for the given chat conversation.
post
qwen / qwen2-7b-instruct
Creates a model response for the given chat conversation.
post
qwen / qwen2.5-7b-instruct
Creates a model response for the given chat conversation.
post
qwen / qwen2.5-coder-7b-instruct
Creates a model response for the given chat conversation.
post
qwen / qwen2.5-coder-32b-instruct
Creates a model response for the given chat conversation.
post
qwen / qwq-32b
Creates a model response for the given chat conversation.
post
rakuten / rakutenai-7b-chat
Creates a model response for the given chat conversation.
post
rakuten / rakutenai-7b-instruct
Creates a model response for the given chat conversation.
post
seallms / seallm-7b-v2.5
Create a chat completion
post
snowflake / arctic
Create a chat completion
post
tokyotech-llm / llama-3-swallow-70b-instruct-v01
Creates a model response for the given chat conversation.
post
thudm / chatglm3-6b
Creates a model response for the given chat conversation.
post
tiiuae / falcon3-7b-instruct
Creates a model response for the given chat conversation.
post
upstage / solar-10.7b-instruct
Creates a model response for the given chat conversation.
post
writer / palmyra-creative-122b
Creates a model response for the given chat conversation.
post
writer / palmyra-fin-70b-32k
Creates a model response for the given chat conversation.
post
writer / palmyra-med-70b-32k
Create a chat completion
post
writer / palmyra-med-70b
Create a chat completion
post
yentinglin / llama-3-taiwan-70b-instruct
Creates a model response for the given chat conversation.
post
zyphra/zamba2-7b-instruct
Creates a model response for the given chat conversation.
post
Retrieval
Retrieval APIs
baai / bge-m3
Creates an embedding vector from the input text.
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / embed-qa-4
Create embedding vector
post
nvidia / llama-3.2-nv-embedqa-1b-v1
Creates an embedding vector from the input text.
post
nvidia / llama-3.2-nv-embedqa-1b-v2
Creates an embedding vector from the input text.
post
nvidia / llama-3.2-nv-rerankqa-1b-v1
Rank passages by their relation to a query.
post
nvidia / llama-3.2-nv-rerankqa-1b-v2
Rank passages by their relation to a query.
post
nvidia / nvclip
Creates an embedding vector representing the input text or image.
post
nvidia / nv-embed-v1
Creates an embedding vector from the input text.
post
nvidia / nv-embedcode-7b-v1
Creates an embedding vector from the input text.
post
nvidia / nv-embedqa-e5-v5
Creates an embedding vector from the input text.
post
nvidia / nv-embedqa-mistral-7b-v2
Creates an embedding vector from the input text.
post
nvidia / nv-rerankqa-mistral-4b-v3
Rank passages by their relation to a query.
post
nvidia / rerank-qa-mistral-4b
Create ranking
post
snowflake / arctic-embed-l
Creates an embedding vector from the input text.
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
Visual Models
Visual Models APIs
black-forest-labs / flux.1-dev
Infer
post
briaai / bria-2.3
Request generation
post
google / gemma-3-27b-it
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
hive / ai-generated-image-detection
Request response from the model
post
hive / deepfake-image-detection
Infer
post
meta / sam2
Run inference on the input image/video for a given prompt.
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
microsoft / phi-4-multimodal-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
mistralai / mistral-medium-3-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / bevformer
Post V1 Bevformer Process
post
nvidia / consistory
Request generation
post
nvidia / cosmos-1.0-7b-diffusion-text2world
nvidia / nemoretriever-parse
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / nv-dinov2
Run inference on the input image.
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / nv-grounding-dino
Run inference on the input image/video for a given text prompt
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / ocdrnet
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / retail-object-detection
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / sparsedrive
Post V1 Sparsedrive Inference
post
nvidia / vila
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / visual-changenet
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
stabilityai / stable-diffusion-3-medium
Request generation
post
stabilityai / sdxl-turbo
Request generation
post
stabilityai / stable-diffusion-xl
Request generation
post
stabilityai / stable-video-diffusion
Request generation
post
multimodAl
Multimodal APIs
adept / fuyu-8b
Request response from the model
post
Status polling
get
google / deplot
Request response from the model
post
Status polling
get
google / paligemma
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
meta/llama-3.2-11b-vision-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
meta/llama-3.2-90b-vision-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
meta / llama-4-maverick-17b-128e-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
meta / llama-4-scout-17b-16e-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
microsoft / florence-2
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
microsoft / kosmos-2
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
microsoft / phi-3-vision-128k-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
microsoft / phi-3.5-vision-instruct
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
nvidia / neva-22b
Request response from the model
post
Gets the result of an earlier function invocation request that returned a status of 202.
get
Healthcare
Healthcare APIs
arc / evo2-40b
Generate DNA sequences
post
colabfold / msa-search
Nim Api Post Call Msa Search Post
post
deepmind / alphafold2
Predict Structure From Sequence Post
post
deepmind / alphafold2-multimer
Predict Structure From Sequence Post
post
ipd / proteinmpnn
Predict amino acid sequences
post
ipd / rfdiffusion
Run RFdiffusion Protein Generation
post
meta / esmfold
Predict protein structure (alignment-free)
post
meta / esm2-650m
Protein Embeddings
post
mit / diffdock
Molecular Docking Pose Generation
post
nvidia / deepvariant
Run Parabricks Universal Variant Calling
post
nvidia / fq2bam
Run Parabricks fq2bam to align sequence reads
post
nvidia / genmol
Molecular Generation
post
nvidia / maisi
Generate Image
post
nvidia / molmim
Perform molecule generation
post
nvidia / vista3d
Run Inference
post
openfold / openfold2
Nim Api Post Call Monomer Structure From Msa And Template
post
route optimization
Route Optimization APIs
nvidia / cuOpt
Submit to solver
post
Status polling
get
climate simulation
Climate Simulation APIs
nvidia / corrdiff
Inference
post
nvidia / fourcastnet
Runs FourCastNet inference.
post
Status polling
get
https://ai.api.nvidia.com/v1/status/
{requestId}
Language
Shell
Node
Ruby
PHP
Python
Credentials
Bearer
Bearer
RESPONSE
Click
Try It!
to start a request and see the response here!