Go SDKAPI ReferenceOperations
ListEndpointsResponse - Go SDK
ListEndpointsResponse - Go SDK
ListEndpointsResponse type definition
The Go SDK and docs are currently in beta. Report issues on GitHub.
Returns a list of endpoints
Fields
| Field | Type | Required | Description | Example |
|---|---|---|---|---|
Data | components.ListEndpointsResponse | ✔️ | List of available endpoints for a model | {"architecture": {"input_modalities": ["text"],"instruct_type": "chatml","modality": "text-\u003etext","output_modalities": ["text"],"tokenizer": "GPT"},“created”: 1692901234, “description”: “GPT-4 is a large multimodal model that can solve difficult problems with greater accuracy.”, “endpoints”: [ {"context_length": 8192,"latency_last_30m": {"p50": 0.25,"p75": 0.35,"p90": 0.48,"p99": 0.85},“max_completion_tokens”: 4096, “max_prompt_tokens”: 8192, “model_name”: “GPT-4”, “name”: “OpenAI: GPT-4”, “pricing”: {"completion": "0.00006","image": "0","prompt": "0.00003","request": "0"},“provider_name”: “OpenAI”, “quantization”: “fp16”, “status”: “default”, “supported_parameters”: [ “temperature”, “top_p”, “max_tokens”, “frequency_penalty”, “presence_penalty” ], “supports_implicit_caching”: true, “tag”: “openai”, “throughput_last_30m”: {"p50": 45.2,"p75": 38.5,"p90": 28.3,"p99": 15.1},“uptime_last_1d”: 99.8, “uptime_last_30m”: 99.5, “uptime_last_5m”: 100<br />}], “id”: “openai/gpt-4”, “name”: “GPT-4” } |