v1.0.0 - Initial release

The first release of DeepintShield - one API that fronts every model with real-time guardrails, cost optimization, agentic tool-call authorization, and full observability.

Highlights

Unified AI gateway

One OpenAI-compatible endpoint to 22+ providers and 2500+ models - OpenAI, Anthropic, Google, Bedrock, and more.
Drop-in SDK compatibility: point OpenAI, Anthropic, Google GenAI, Bedrock, LangChain, LiteLLM, or PydanticAI at DeepintShield unchanged.
Automatic fallbacks, retries, load balancing, and conditional (rule-based) routing.

Real-time AI guardrails

Prompt-injection, jailbreak, toxicity, PII, and secret detection across input, output, RAG, and tool stages - with sub-5ms latency on clean traffic.
One-click OWASP LLM Top 10 cards and industry domain packs.
Five decision outcomes: allow, redact, require approval, sandbox, deny.

PII redaction & data protection

Inline detection and masking of PII, PHI, and secrets at the request, response, and tool boundaries.

RAG security

Per-chunk allow / redact / reject / quarantine with source trust scoring, citation enforcement, and one-line retriever/embedder guards in the SDK.

Agentic security (PEP/PDP)

Authorize every agent tool-call (allow / deny / require approval / mask) with a visual policy builder, agent identity, tool integrity checks, AIBOM export, and a tamper-evident decision audit. Roll out in Shadow, Canary, or Enforce mode.

Cost optimization

Semantic caching, provider prompt caching, cascade routing, request coalescing, prompt compression, and reasoning-effort throttling - cutting LLM spend by up to 90%.

MCP gateway

Broker Model Context Protocol servers, group and scope tools, per-tool auth, and a sandboxed code mode.

Governance & accuracy

Virtual keys with budgets, rate limits, model allow-lists, and guardrail/cache policy; hallucination defense and response-consistency controls.

Observability

Real-time dashboards, per-request tracing, OpenTelemetry export, and agentic traces with scoring.

SDKs

Unified Python SDK with one-line tool-gating binders for LangGraph, CrewAI, OpenAI Agents, LlamaIndex, AutoGen, and PydanticAI.

Deployment

Cloud - hosted at app.deepintshield.com; start in minutes.
Enterprise VPC / Self-Hosted - run the data plane in your own environment. See the deployment guides.