Skip to content

Parasail

Parasail is an OpenAI-compatible provider offering high-performance inference through DeepIntShield. Key characteristics:

  • OpenAI API compatibility - Identical request/response format
  • Full streaming support - Server-Sent Events with usage tracking
  • Tool calling - Complete function definition and execution
  • Parameter filtering - Unsupported OpenAI-specific fields are removed before the call
  • Responses API - Supported with the same parameters as Chat Completions
OperationNon-StreamingStreamingEndpoint
Chat Completions/v1/chat/completions
Responses API/v1/chat/completions
List Models-/v1/models
Text Completions-
Embeddings-
Image Generation-
Speech (TTS)-
Transcriptions (STT)-
Files-
Batch-

Parasail supports all standard OpenAI chat completion parameters. For full parameter reference and behavior, see OpenAI Chat Completions.

The following parameters are not supported by Parasail and are ignored: prompt_cache_key, verbosity, store, service_tier.

Reasoning via standard OpenAI format:

{
"model": "parasail-llama-33-70b-fp8",
"messages": [...],
"reasoning_effort": "high"
}

Parasail supports all standard OpenAI message types, tools, responses, and streaming formats. For details on message handling, tools, responses, and streaming, refer to OpenAI Chat Completions.


Parasail supports the Responses API with the same parameter support as Chat Completions.


Lists available Parasail models with capabilities and context information.


FeatureReason
Text CompletionsNot offered by Parasail API
EmbeddingNot offered by Parasail API
Image GenerationNot offered by Parasail API
Speech/TTSNot offered by Parasail API
Transcription/STTNot offered by Parasail API
Batch OperationsNot offered by Parasail API
File ManagementNot offered by Parasail API

Cache Control Stripped

Severity: Medium Behavior: Cache control directives are removed from messages Impact: Prompt caching features don’t work

Parameter Filtering

Severity: Low Behavior: OpenAI-specific parameters filtered out Impact: prompt_cache_key, verbosity, store removed

User Field Size Limit

Severity: Low Behavior: User field > 64 characters silently dropped Impact: Longer user identifiers are lost