Infrastructure Monitoring

Monitor AI processing infrastructure and system health.

9%
24
Total GPU Workers
12%
18
Active Workers
15%
47
Jobs in Queue
8%
2.3s
Average Processing Latency
0.2%
99.8%
System Uptime

Worker Activity Over Time

Active vs Idle workers

Active Workers
Idle Workers

System Alerts & Warnings

Worker offline

12 minutes ago

gpu-worker-005 has been offline for 12 minutes

Queue backlog detected

18 minutes ago

47 jobs waiting in queue, exceeding normal threshold

High latency detected

35 minutes ago

Average processing time increased to 2.8s (normal: 2.1s)

Autoscaling Activity

Worker Added12 minutes ago

Added gpu-worker-024 in US-East region due to high queue load

Scaling Trigger15 minutes ago

Queue threshold exceeded: 50+ jobs waiting

Worker Added28 minutes ago

Added gpu-worker-023 in EU-Central region

Worker Removed1 hour ago

Removed gpu-worker-019 due to low demand

Scaling Trigger1 hour ago

CPU usage dropped below 30% threshold

Job Queue Status

Jobs Waiting in Queue
47
Jobs Currently Processing
18
Estimated Wait Time
4.2 min
Worker IDRegionStatusCurrent JobsCPU UsageGPU UsageMemory UsageLast HeartbeatActions
gpu-worker-001US-EastActive368%92%74%2 seconds ago
gpu-worker-002US-WestActive254%87%69%1 second ago
gpu-worker-003EU-CentralIdle012%8%34%5 seconds ago
gpu-worker-004Asia-PacificActive478%95%81%3 seconds ago
gpu-worker-005US-EastError00%0%0%12 minutes ago
gpu-worker-006EU-WestActive145%76%58%4 seconds ago
gpu-worker-007US-WestIdle015%10%38%6 seconds ago
gpu-worker-008Asia-PacificActive262%88%72%2 seconds ago
Showing 1 to 8 of 10 workers