Quickstart for Evaluators
Evaluate ΔOS for your organization
Last reviewed: 2025-01-25
Quickstart for Evaluators
You're evaluating ΔOS for your organization. This guide helps you understand what ΔOS does, verify its claims, and determine fit.
Full evaluation takes 2-4 hours. This quickstart provides a structured path through the most important information.
Step 1: Understand the Problem (15 minutes)
ΔOS addresses a specific problem: AI agents taking actions without systematic governance.
Read these pages to understand the problem space:
- What is ΔOS? — Core concept
- What ΔOS Is Not — Category boundaries
Key Questions to Answer
- Do you have AI agents taking actions in production?
- Are those actions currently governed before execution?
- Do you have audit trails for agent decisions?
- Can you quantify the risk of agent actions?
If you answered "no" to any of these, ΔOS may be relevant.
Step 2: Verify Our Claims (30 minutes)
We make specific claims. Here's how to verify them:
Claim: Pre-Execution Evaluation
What we claim: Every Intent is evaluated before the associated action executes.
How to verify:
- Review Guarantees for enforcement mechanism
- Request access to our test environment
- Submit Intents and observe evaluation timing
Claim: Deterministic Decisions
What we claim: Same inputs always produce same outputs.
How to verify:
- Review LIMs architecture
- Use replay feature on historical decisions
- Compare outputs across multiple replays
Claim: Human Authority Preserved
What we claim: Humans can always override any automated judgment.
How to verify:
- Review Authority Boundaries
- Test override functionality in trial environment
- Verify kill switch availability
Step 3: Assess Fit (45 minutes)
Technical Fit
| Requirement | ΔOS Capability |
|---|---|
| Agent integration | SDK for major frameworks |
| Latency requirements | P99 < 50ms for evaluation |
| Scale requirements | 100k+ Intents/second |
| Deployment options | SaaS, VPC, On-prem |
Use Case Fit
ΔOS is designed for:
- High-stakes agent actions — Payments, data modifications, external calls
- Regulatory requirements — Audit trails, human oversight
- Risk management — Quantified value attribution
ΔOS is not designed for:
- AI safety/content filtering
- Application monitoring
- Workflow orchestration
Organizational Fit
Consider:
- Do you have teams who will define policies?
- Who will handle escalations?
- What's your risk tolerance for agent autonomy?
Step 4: Evaluate Cost/Value (30 minutes)
Pricing Model
ΔOS pricing is based on Intent volume. See Pricing for current rates.
Value Framework
We don't claim ROI. We show observable value:
- Prevented Actions — Value of blocked high-risk actions
- Audit Efficiency — Time saved on compliance reporting
- Risk Quantification — Measurable risk reduction
Read Value Attribution for methodology.
Step 5: Trial (1-2 hours)
Evidence-Only Mode
Start without blocking. Observe what ΔOS would govern:
deltaos.configure({
mode: 'evidence-only'
});
This records all Intents without affecting production behavior.
Review Dashboard
After 24-48 hours of evidence collection:
- How many Intents were submitted?
- What actions are agents taking?
- What would have been escalated?
- What would have been blocked?
Selective Governance
Enable governance for specific, low-risk actions:
deltaos.configure({
mode: 'selective',
govern: {
actions: ['notification.send'], // Start small
defaultJudgment: 'allow'
}
});
Evaluation Checklist
Technical
- Reviewed architecture documentation
- Understood integration requirements
- Verified latency meets requirements
- Confirmed deployment options work
Security
- Reviewed Guarantees
- Reviewed Authority Boundaries
- Reviewed Incident Posture
- Understood data handling
Business
- Identified use cases
- Understood pricing
- Identified stakeholders
- Assessed organizational readiness
Next Steps
If ΔOS Fits
- Complete technical trial
- Define initial policy set
- Plan integration roadmap
- Identify escalation team
If ΔOS Doesn't Fit
That's fine. ΔOS solves a specific problem. If your problem is different:
- AI safety → Guardrails, content filters
- Monitoring → Observability platforms
- Workflow → Orchestration tools
Questions?
- Technical questions: See Developer Quickstart
- Integration questions: Contact solutions team
- Security questions: Request security review documentation
See Also
- What is ΔOS? — Full introduction
- Guarantees — What we promise
- Developer Quickstart — Integration guide