System Status

All systems operational

Operational

Last checked: · Notifications: Subscribe to incidents

Services

Platform components

Telemetry Ingestion Pipeline

Operational

SCADA historian connectors (OSIsoft PI, Wonderware), sensor node data reception, and edge-agent telemetry relay.

90-day uptime
99.7%

Risk Scoring Engine

Operational

ML inference pipeline, hourly segment risk score updates, and spatial join with GIS pipe network data.

90-day uptime
99.9%

Alert Routing Service

Operational

Email and SMS alert dispatch, PagerDuty integration relay, and SCADA historian write-back service.

90-day uptime
99.8%

Customer Dashboard

Operational

Risk map visualization, segment detail views, alert history, and ROI dig-cost dashboard.

90-day uptime
99.6%

REST API

Operational

Platform API endpoints for risk score export, segment query, and GIS integration webhooks.

90-day uptime
100%
History

Recent incidents (90 days)

Resolved Alert Routing Service

Delayed PagerDuty relay — 47 minutes

An upstream PagerDuty API rate limit was reached during a simultaneous multi-utility alert burst at 06:14 MT. Alerts queued and delivered with a 47-minute delay. No alerts were dropped. PagerDuty notified and rate limit configuration adjusted. Post-incident: added exponential back-off and alert queue depth monitoring.

Resolved at 07:01 MT · Duration: 47 min · Impact: Delayed alert delivery to 3 customers

Resolved Telemetry Ingestion

OSIsoft PI connector authentication timeout

A security certificate renewal on the customer-side PI server at a Maricopa County utility triggered authentication failures in the Watsynq edge agent. Telemetry ingestion from that customer paused for 2h 14m while the certificate was re-registered. Risk scoring continued using the last valid telemetry window; no stale alerts were generated.

Resolved at 11:32 MT · Duration: 2h 14min · Impact: Single customer data gap

Resolved Customer Dashboard

Scheduled maintenance — risk map tile cache rebuild

Planned maintenance window for rebuilding the spatial tile cache after a GIS schema update. Dashboard risk map layer was unavailable for 38 minutes (22:05–22:43 MT). Advance notice provided to all customers 48 hours prior. No telemetry or alert routing impact.

Completed at 22:43 MT · Duration: 38 min · Impact: Dashboard map visualization only

No other incidents in the past 90 days.
Notifications

Get email notification for any future incidents affecting your utility.

Email [email protected] with "Status Subscribe" in the subject to be added to the incident notification list.