Skip to content

Groq

Groq is an OpenAI-compatible provider offering the same API interface with identical parameter handling. Key features:

  • Full OpenAI compatibility - Identical request/response format
  • Streaming support - Server-Sent Events with delta-based updates
  • Tool calling - Complete function definition and execution support
  • Text completion fallback - Via litellm compatibility mode when enabled
OperationNon-StreamingStreamingEndpoint
Chat Completions/v1/chat/completions
Responses API/v1/chat/completions
Text Completions⚠️⚠️Served via Chat Completions
List Models-/v1/models
Embeddings-
Image Generation-
Speech (TTS)-
Transcriptions (STT)-
Files-
Batch-

Groq supports all standard OpenAI chat completion parameters. For full parameter reference and behavior, see OpenAI Chat Completions. The following parameters are not supported and are ignored: prompt_cache_key, verbosity, store, service_tier.

Groq supports reasoning via the standard reasoning_effort field:

{
"model": "llama-3.3-70b-versatile",
"messages": [...],
"reasoning_effort": "high"
}

Groq uses OpenAI message format with the following content type support:

  • ✅ Text content (strings)
  • ❌ Images (neither URL nor base64)
  • ❌ Audio input
  • ❌ Files

For all other message handling, tools, responses, and streaming formats, refer to OpenAI Chat Completions.


Groq supports the Responses API with the same parameters and message handling as Chat Completions. Responses are returned in Responses format, with output items instead of message content.


When enabled, text completion requests are served through Groq’s Chat Completions API.

Limitations:

  • Behaves like the chat API (different from a native text completion)
  • Single choice only (n=1)
  • Streaming not available

Groq’s model listing endpoint returns available models with their context lengths and capabilities.


FeatureReason
Image URLsGroq doesn’t support image inputs
Image Base64Groq doesn’t support image inputs
Multiple ImagesGroq doesn’t support image inputs
EmbeddingNot offered by Groq API
Speech/TTSNot offered by Groq API
Transcription/STTNot offered by Groq API
Batch OperationsNot offered by Groq API
File ManagementNot offered by Groq API

User Field Size Limit

Severity: Low Behavior: User field > 64 characters is silently dropped Impact: Longer user identifiers are lost