OPS — Control Planedashboard.nata.onl — Control Plane
Route zoneOps
production / modelsops.models / simulated saturation
Alerts 1
Available3

Models ready for routing.

Guarded1

Models behind saturation or budget guardrails.

Avg latency1358 ms

Average latency across all visible models.

Queue depth14

Combined queue depth on the compute layer.

Model status grid

Name, host, queue and latency.

local-llama-70b

Keep-alive, fallback route and recent errors.

Host mac-studio-mainParallel limit 8Keep-alive 15m
Fallback route

gpt-5.2-mini

Config snapshot
  • context 64k
  • quant q6_k
  • semantic-recall=true
Recent errors
  • none in last 6h

Routing matrix

Primary-to-fallback mapping required by the models contract.

ModelHostFallbackQueue depth
local-llama-70bmac-studio-maingpt-5.2-mini6
gpt-5.2premium-remotegpt-5.2-mini2
claude-sonnetpremium-remotelocal-llama-70b4
local-mistralvps-prod-2gpt-5.2-mini2

Rollback action modal

Config rollback readiness and guarded change notes.

Selected config snapshot
  • context 64k
  • quant q6_k
  • semantic-recall=true
Recent errors
  • none in last 6h