post https://ai.api.nvidia.com/v1/vlm/microsoft/phi-3-vision-128k-instruct
Invokes inference using the model chat parameters. If uploading large images, this POST should be used in conjunction with the NVCF API which allows for the upload of large assets.
You can find details on how to use NVCF Asset APIs here: https://docs.api.nvidia.com/cloud-functions/reference/createasset