Server Uptime, Data Backup, and Disaster Recovery
Server Uptime
MyQ Roger is dedicated to maintaining a 99.9% uptime target; however, service level agreements (SLAs) are tailored per customer and do not explicitly guarantee this metric. We employ advanced high-availability (HA) strategies, leveraging multi-zone deployments and multi-pod architectures to ensure service continuity and minimize potential downtime in the event of operational disruptions.
Real-Time System Status Monitoring
To enhance transparency and provide real-time visibility into system performance, we maintain a dedicated status site that delivers up-to-date insights on service health. This platform enables customers to:
Monitor Live System Status: View real-time operational metrics and incident updates.
Receive Proactive Notifications: Subscribe to alerts for service degradation or planned maintenance events.
Access Historical Uptime Reports: Review past incidents and performance trends to assess reliability.
By offering a centralized status site, we empower customers with timely and accurate information, reinforcing trust and enabling proactive decision-making in the event of disruptions.
Backup Strategies
Azure SQL Backup Strategy
We employ a geo-redundant backup strategy for Azure SQL databases, ensuring data resilience and recoverability. Key aspects include:
Geo-Redundant Storage: Backups are replicated across multiple regions for disaster recovery.
Retention Policy: Backups are retained for several months, allowing historical data recovery.
Incremental Backups: A 14-day point-in-time recovery (PITR) window enables granular restoration of lost data.
SLA-Based Customization: Retention periods and recovery time objectives (RTO) vary based on cluster-specific SLAs and can be tailored to meet customer needs.
Azure Cosmos DB Backup Strategy
For Azure Cosmos DB, we implement comprehensive data protection mechanisms, particularly for accounting records, ensuring data integrity and traceability:
Full Accounting Data Reports: All accounting transactions, including job accounting for print, scan, copy, and fax activities, are fully recorded.
Historical Data Retracing: Our system allows tracking and retracing of any reported accounting records to ensure compliance and auditability.
High Availability and Redundancy: Built-in multi-region replication ensures continuity and minimizes data loss risks.
By implementing these robust backup strategies, we safeguard critical business data while providing flexible recovery options tailored to customer-specific SLAs.
Disaster Scenarios & Response Plan
Disaster Scenario | Mitigation Strategy | Recovery Process |
---|---|---|
Data Corruption | PITR (Point-in-Time Recovery) | Restore database to last known good state |
Cloud Region Outage | Geo-redundant backups & failover | Switch to secondary Azure region |
Kubernetes Node Failure | Auto-replication of services in AKS | Pods are rescheduled to healthy nodes |
Cyberattack (DDoS, Ransomware, etc.) | WAF, Azure DDoS Protection, and security monitoring | Isolate compromised systems, restore from clean backups |