PublicEndpoint - Go SDK

PublicEndpoint type definition

The Go SDK and docs are currently in beta. Report issues on GitHub.

Information about a specific model endpoint

Fields

FieldTypeRequiredDescriptionExample
ContextLengthint64✔️N/A
LatencyLast30m*components.PercentileStats✔️Latency percentiles in milliseconds over the last 30 minutes. Latency measures time to first token. Only visible when authenticated with an API key or cookie; returns null for unauthenticated requests.{"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3}
MaxCompletionTokensint64✔️N/A
MaxPromptTokensint64✔️N/A
ModelIDstring✔️The unique identifier for the model (permaslug)openai/gpt-4
ModelNamestring✔️N/A
Namestring✔️N/A
Pricingcomponents.Pricing✔️N/A
ProviderNamecomponents.ProviderName✔️N/AOpenAI
Quantization*components.PublicEndpointQuantization✔️N/Afp16
Status*components.EndpointStatusN/A0
SupportedParameters[]components.Parameter✔️N/A
SupportsImplicitCachingbool✔️N/A
Tagstring✔️N/A
ThroughputLast30m*components.PercentileStats✔️N/A{"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3}
UptimeLast1dfloat64✔️Uptime percentage over the last 1 day, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data.
UptimeLast30mfloat64✔️N/A
UptimeLast5mfloat64✔️Uptime percentage over the last 5 minutes, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data.