Loading...
Loading...
System Health
Know the health of every component, every second. Track incidents from discovery through resolution with automated alerting.
Capabilities
Track health of database, API, queue, cache, storage, and integrations with response time metrics.
Full incident lifecycle from investigating through identified, monitoring, and resolved with severity tracking.
Historical uptime data for every component. Track reliability trends and SLA compliance.
Response time tracking for every health check. Identify performance degradation before it impacts users.
Automated alerts when components degrade or go down. Route to the right team instantly.
Public or private status page showing current system health and recent incident history.
Getting Started
Set up health checks for each infrastructure component with check frequency and thresholds.
View real-time status of all components on the health dashboard with response time metrics.
Create and track incidents through their lifecycle. Associate affected components and notify stakeholders.
Developer Integration
import { DRD } from '@drd/sdk';
const drd = new DRD({ token: 'drd_live_sk_...' });
// Check current system health
const health = await drd.systemHealth.getCurrent();
health.forEach(c => console.log(c.component, c.status, c.responseTimeMs));
// Create incident
await drd.systemHealth.createIncident({
title: 'API latency spike',
severity: 'high',
affectedComponents: ['api', 'cache'],
});Component monitoring, incident management, and automated alerting. Keep your infrastructure reliable.