Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

PolicyService CheckAction RPC — Latency Benchmark Results

Environment

ParameterValue
CPUApple M3 Max
Memory128 GB
OSmacOS 26.2 (Darwin)
Rust1.95.0 (2026-04-14)
Tonic0.13.1
TransportTCP loopback (127.0.0.1)
Profile--release (optimized)

SLA Target

p99 < 5ms end-to-end round-trip (serialize + transport + evaluate + respond).

Criterion Micro-Benchmarks

Reused TCP connection, single client, 100 samples per variant.

Payload VariantDescriptionMeanStd Dev
minimal_llm_callLlmCallContext, no PII77.9 us~1 us
full_tool_call_1kbToolCallContext, ~1KB args_json82.2 us~1 us
worst_case_networkNetworkCallContext, long URL (~400 bytes)81.9 us~1 us

Sustained Load Test (60 seconds)

1,000 req/sec sustained for 60 seconds, 10 concurrent clients, ToolCallContext payload.

MetricValuevs SLA
Total requests60,000
Actual RPS999
p50144 us34x headroom
p95357 us14x headroom
p99803 us6.2x headroom
p9992.65 ms1.9x headroom
max10.89 ms

Verdict

PASS — p99 latency of 803 us is well under the 5ms SLA target with 6.2x headroom.

The max latency (10.89 ms) exceeds 5ms but this is expected for a single outlier in 60,000 requests on a non-isolated workstation. The p999 (2.65 ms) confirms the tail is well-bounded for all practical purposes.


Last updated: 2026-05-04 by Chisanan232