Skip to content

Nebius

Nebius is an OpenAI-compatible provider offering comprehensive API support through DeepIntShield. Key features:

  • Full OpenAI compatibility - Chat, text completion, embeddings, and responses
  • Streaming support - Server-Sent Events with delta-based updates
  • AI Project ID - Nebius-specific project identifier support
  • Tool calling - Complete function definition and execution
  • Parameter filtering - Removes unsupported OpenAI-specific fields
OperationNon-StreamingStreamingEndpoint
Chat Completions/v1/chat/completions
Responses API/v1/chat/completions
Text Completions/v1/completions
Embeddings-/v1/embeddings
Image Generation-/v1/images/generations
List Models-/v1/models
Speech (TTS)-
Transcriptions (STT)-
Files-
Batch-

Nebius supports all standard OpenAI chat completion parameters. For full parameter reference and behavior, see OpenAI Chat Completions.

ai_project_id (Optional):

Nebius allows specifying a project ID for resource organization:

Terminal window
curl -X POST https://app.deepintshield.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "x-bf-vk: $DEEPINTSHIELD_VIRTUAL_KEY" \
-d '{
"model": "nebius/model-name",
"messages": [...],
"ai_project_id": "project-123"
}'

The following parameters are not supported by Nebius and are ignored: prompt_cache_key, verbosity, store, service_tier.

Nebius supports all standard OpenAI message types, tools, responses, and streaming formats. For details on message handling, tools, responses, and streaming, refer to OpenAI Chat Completions.


Nebius supports the Responses API with the same parameter support and message handling as Chat Completions, including ai_project_id via extra_params.


Nebius supports legacy text completion format:

ParameterMapping
promptDirect pass-through
max_tokensmax_tokens
temperature, top_pDirect pass-through
stopStop sequences
frequency_penalty, presence_penaltyPenalty parameters

Nebius supports text embeddings:

ParameterNotes
inputText or array of texts
modelEmbedding model name
encoding_format”float” or “base64”
dimensionsCustom output dimensions (optional)

Response returns embedding vectors with usage information.


Request Parameters

ParameterTypeRequiredNotes
modelstringModel identifier
promptstringText description of the image to generate
sizestringImage size in WxH format (e.g., "1024x1024")
output_formatstringOutput format: "png", "jpeg", "webp"
response_formatstringResponse format: "url" or "b64_json"
seedintSeed for reproducible generation
negative_promptstringNegative prompt
num_inference_stepsintNumber of inference steps
extra_paramsobjectNebius-specific parameters (see below)

Extra Parameters (via extra_params)

ParameterTypeNotes
guidance_scaleintGuidance scale (0-100)
ai_project_idstringNebius project ID

Response Behavior

Each returned image includes url, b64_json, revised_prompt, and a sequential index (0, 1, 2, …).

Endpoint: /v1/images/generations

Streaming: Image generation streaming is not supported by Nebius.


Lists available Nebius models with capabilities and context lengths.


FeatureReason
Speech/TTSNot offered by Nebius API
Transcription/STTNot offered by Nebius API
Batch OperationsNot offered by Nebius API
File ManagementNot offered by Nebius API

Cache Control Stripped

Severity: Medium Behavior: Cache control directives are removed from messages Impact: Prompt caching features don’t work

Parameter Filtering

Severity: Low Behavior: OpenAI-specific fields filtered out Impact: prompt_cache_key, verbosity, store removed

User Field Size Limit

Severity: Low Behavior: User field > 64 characters silently dropped Impact: Longer user identifiers are lost