Reliability and Security Principles
At Voltade, software reliability and security are critical. Our products and services are used globally, often by users’ customers. This means our services must be reliable, fast, and secure at all times. The following principles outline how we ensure this.
Secured Perimeter
Ensuring secure interfaces for users, protecting exposed endpoints, closing unused access points, and enforcing security measures such as 2FA and access control.
Availability Monitoring
Continuous monitoring of our systems using third-party services like Better Uptime and independent checks to ensure the systems are responsive 24/7.
Alert Systems
A systematic alerting process through tools like OpsGenie and PagerDuty ensures on-call engineers can respond to issues promptly while keeping users informed.
Service Recovery
Reverting to stable versions of code or infrastructure when issues arise, with rigorous testing to avoid exacerbating problems.
User Feedback
We maintain channels for direct user feedback to catch issues beyond technical failures, such as API failures or interface problems.
Logging
Comprehensive logging on both client and server sides, using tools like Google Analytics, Sentry, and AWS CloudWatch to track errors and issues.
Decoupled End-to-End Checks
Independent systems verify key outcomes, ensuring bugs in the main system don’t affect checks.
Isolated Services
Reducing dependencies among services to minimize cascading failures, ensuring the resilience of APIs and app servers.
Redundant Infrastructure
Multiple stateless servers and failover plans ensure service continuity even if individual components or entire systems fail.
Backups
Backups are in place to prevent data loss, with strategies to minimize Recovery Time Objective (RTO) and Recovery Point Objective (RPO).
Test Setup
Automated testing environments are critical to ensuring systems work as expected, with integration and unit tests covering key functionalities.
DDOS Protection & Caching
We use CDNs like Cloudflare to handle large-scale traffic attacks and filter out malicious requests while ensuring legitimate users can still access our services.
Incremental Rollout System
New features and updates are gradually rolled out to small groups of users to gather feedback and catch issues early before a full release.
Updated: May 2023