google / deplot

Model Overview

Description:

The Google DePlot model is a one-shot visual language understanding solution
that translates images of plots or charts into linearized tables

📗

Note

This API is used in conjunction with the NVCF large assets API.

Terms of use

By using this model, you are agreeing to the terms and conditions of the
license,
acceptable use policy and
Google Research privacy policy.

References(s):

Model Architecture:

Architecture Type: Transformer

Network Architecture: Pix2Struct

Input:

Input Format: Red, Green, Blue (RGB) Image + Text

Input Parameters: None

Other Properties Related to Input: None

Output:

Output Format: Text

Output Parameters: temperature, top_p, max_tokens

Other Properties Related to Output: stream

Supported Operating System(s):

Linux

Inference:

Engine: Triton

Test Hardware: Other