t3.medium

Instance Configuration

AWS t3.medium Specifications:

DeepIntShield Configuration:

Metric	Value	Notes
Success Rate	100.00%	Perfect reliability under high load
Average Request Size	0.13 KB	Lightweight request payload
Average Response Size	1.37 KB	Standard response size for testing
Average Latency	2.12s	Total end-to-end response time
Peak Memory Usage	1,312.79 MB	~33% of available 4GB RAM

Almost all end-to-end latency is the upstream provider API call - the gateway itself adds only microseconds.

Component	Latency	Notes
Upstream provider call	1.56s	The actual model API request (unavoidable in any setup)
DeepIntShield overhead	59 µs	Added latency from the gateway

DeepIntShield’s Total Overhead: 59 µs*

*Excludes the provider API call and JSON serialization, which are required in any implementation

Memory Usage: Very efficient at 1,312.79 MB peak usage
CPU Performance: Handles 5,000 RPS workload effectively
Stability: No failed requests or throughput degradation under sustained load

Based on test results, these configurations work well:

{
  "client": {
    "initial_pool_size": 10000,
    "buffer_size": 15000
  }
}

For Lower Memory Usage:

For Better Performance:

Metric	t3.medium	t3.xlarge	Difference
DeepIntShield Overhead	59 µs	11 µs	+81% slower
Average Latency	2.12s	1.61s	+24% slower
Memory Usage	1,312.79 MB	3,340.44 MB	-61% usage

Key Insights:

When to upgrade to t3.xlarge: