SLOs
Measurable service commitments
Last reviewed: 2025-01-25
SLOs
Service Level Objectives define measurable commitments for ΔOS performance and reliability.
SLOs are measurable targets. Guarantees are properties enforced by design. SLOs can be missed (rarely); guarantees cannot be violated.
Availability SLOs
Intent Evaluation Availability
| Metric | Target | Measurement |
|---|---|---|
| Availability | 99.95% | Successful evaluations / attempted evaluations |
| Measurement window | 30 days rolling | |
| Exclusions | Scheduled maintenance (announced 7 days prior) |
What this means: In a 30-day period, you can expect less than 22 minutes of evaluation unavailability.
Audit Recording Availability
| Metric | Target | Measurement |
|---|---|---|
| Availability | 99.99% | Successful recordings / attempted recordings |
| Measurement window | 30 days rolling |
What this means: Audit records are nearly always recorded synchronously. In rare cases, recording may be delayed but data is never lost.
Latency SLOs
Intent Evaluation Latency
| Percentile | Target | Conditions |
|---|---|---|
| P50 | < 20ms | Standard evaluation |
| P95 | < 50ms | Standard evaluation |
| P99 | < 100ms | Standard evaluation |
| P99.9 | < 500ms | Complex evaluation or high load |
Measurement: Time from Intent submission to Judgment return.
Exclusions:
- External evidence collection (customer-provided sources)
- Escalation routing (by design, waits for human)
Evidence Collection Latency
| Source Type | P99 Target |
|---|---|
| Built-in evidence | < 10ms |
| Customer HTTP source | < 200ms + network latency |
| Customer database source | < 100ms + query time |
Throughput SLOs
Intent Processing Rate
| Tier | Guaranteed Throughput |
|---|---|
| Starter | 100 intents/second |
| Growth | 1,000 intents/second |
| Enterprise | 10,000+ intents/second |
Burst handling: Systems can handle 5x sustained rate for 60-second bursts.
Escalation Processing
| Metric | Target |
|---|---|
| Escalation routing | < 1 second |
| Notification delivery | < 5 seconds |
| Decision recording | < 1 second |
Data Durability SLOs
Audit Trail Durability
| Metric | Target |
|---|---|
| Durability | 99.999999999% (11 9s) |
| Replication | 3+ geographic regions |
| Retention | As configured (default: 7 years) |
What this means: Audit records are effectively permanent. You will not lose audit data.
Intent Data Durability
| Metric | Target |
|---|---|
| Durability | 99.999999999% (11 9s) |
| Hot storage | 90 days |
| Archive storage | 7 years |
Consistency SLOs
Judgment Consistency
| Metric | Target |
|---|---|
| Replay match rate | 100% |
| Cross-region consistency | < 100ms |
What this means: If you replay a historical Intent with its original evidence, you will get the same Judgment. Always.
Policy Propagation
| Metric | Target |
|---|---|
| Policy update propagation | < 5 seconds |
| Global consistency | < 30 seconds |
Escalation SLOs
Routing Performance
| Metric | Target |
|---|---|
| Route determination | < 100ms |
| Queue placement | < 500ms |
| Notification dispatch | < 5 seconds |
SLA Tracking
| Metric | Target |
|---|---|
| SLA timer accuracy | ± 1 second |
| Breach detection | < 5 seconds |
| Re-escalation | < 30 seconds |
Monitoring
Real-Time Metrics
Access current SLO status:
const sloStatus = await deltaos.slos.current();
console.log(sloStatus);
// {
// evaluation: {
// availability: { current: 0.9998, target: 0.9995, status: 'healthy' },
// latencyP99: { current: '45ms', target: '100ms', status: 'healthy' }
// },
// audit: {
// availability: { current: 0.99995, target: 0.9999, status: 'healthy' },
// durability: { current: 1.0, target: 0.99999999999, status: 'healthy' }
// }
// }
Historical SLO Data
const sloHistory = await deltaos.slos.history({
metric: 'evaluation.latencyP99',
period: '30d',
granularity: '1h'
});
// Returns hourly P99 latency for the past 30 days
Status Page
Current SLO status is always available at status.deltaos.ai.
SLO Breaches
What Happens
When an SLO is breached:
- Detection — Automated monitoring detects the breach
- Alert — Engineering is notified immediately
- Response — Incident response begins
- Communication — Customers are notified via status page
- Resolution — Issue is resolved
- Review — Post-incident review conducted
Error Budgets
| SLO | Monthly Budget | Tracking |
|---|---|---|
| 99.95% availability | 22 minutes | Dashboard |
| P99 < 100ms | 43.2 minutes > target | Dashboard |
Customer Notification
// Configure breach notifications
await deltaos.notifications.configure({
sloBreaches: {
channels: ['email:ops@company.com', 'slack:#alerts'],
severity: ['critical', 'warning']
}
});
SLO Reporting
Monthly Reports
Enterprise customers receive monthly SLO reports:
- Actual vs target for all SLOs
- Incident summary
- Trend analysis
- Capacity projections
Export Data
const report = await deltaos.slos.report({
period: '2025-01',
format: 'pdf' // or 'json', 'csv'
});
See Also
- Guarantees — Properties that cannot be violated
- Incident Posture — What happens when things fail
- Architecture: Components — System design