Deployment architecture

Modified on Fri, 5 Sep at 11:07 AM

This article provides a comprehensive overview of the platform's cloud native deployment architecture that incorporates best practices for high availability, scalability, data security and comprehensive networking strategy.

Deployment Architecture

The architecture presented is a multi-layered, cloud-native design that seamlessly integrates various services to create a resilient and scalable system. Let's break down each component to understand how they function together.

Core Infrastructure

At the heart of this design is a Virtual Private Cloud (VPC) that provides network isolation and security. Within this VPC lies a compute layer structured for maximum reliability:

Design ready for High availability and redundancy: Today, the system employs two separate availability zones (Zone A and Zone B) with relevant compute instances. This architecture is fundamentally ready for high availability and redundancy in the future ensuring that the application continues to function even if an entire zone experiences an outage.
1. Note: Each availability zone will have compute engine instances, creating redundancy which will eliminate single points of failure and allows for load balancing.
NAT Instance: The NAT instance in the public subnet allows the private instances to initiate outbound internet traffic and other managed services workloads leveraged by the Cubyts architecture.

User Access and Load Distribution

The architecture implements intelligent access management:

Cloud Load Balancer: Acts as the entry point for user traffic, distributing incoming requests evenly across available compute instances. This not only improves performance but also enhances stability during high-traffic periods.
Internet Connectivity: A secure connection to the internet allows for external service integration while maintaining security protocols.

Orchestration Layer

The orchestration layer provides automation and coordination capabilities:

Scheduler Component: Manages automated tasks, workflow orchestration, and scheduled activities, reducing manual intervention and ensuring consistent operations.

Data Management

The architecture employs a multi-database strategy to handle different data requirements:

SQL Database: Handles structured data that requires relational integrity and complex queries.
NoSQL Database: Provides document-oriented storage for semi-structured and unstructured data, offering flexibility and scalability.
Cloud Storage: Manages large files, backups, media assets, and other unstructured data that doesn't fit neatly into database paradigms.

Communication and Messaging

Modern applications require robust messaging capabilities:

Pub-Sub: Implements a publish-subscribe pattern for asynchronous communication between system components.
SMTP: Handles email communications for notifications, alerts, and user interactions.
Functions: Both subscription and email processes are handled by dedicated serverless functions, allowing for event-driven architecture and automatic scaling.

AI and ML Integration

The architecture embraces artificial intelligence capabilities:

Google Vertex AI: Provides machine learning capabilities for advanced analytics, predictions, and intelligent features without requiring extensive ML infrastructure management.

Monitoring and Observability

A comprehensive observability stack ensures system health:

Datadog Integration: Provides monitoring, logging, and observability tools to track system performance, identify issues before they impact users, and ensure compliance with service level objectives.

Key Benefits of this Architecture

This cloud-native architecture delivers several significant advantages:

High Availability: With multiple availability zones and redundant components, the system can withstand individual component failures without downtime.
Security: The VPC provides network isolation, with additional security measures possible at each layer.
Flexibility: The multi-database approach and serverless components allow for choosing the right tool for each specific requirement.
Future-Proof: AI integration capabilities prepare the system for advanced analytics and intelligent features.

Security considerations for the Deployment Architecture

This chapter outlines the security measures implemented in the deployment architecture. Our deployment architecture incorporates multiple security layers and controls to protect data and services at every level. The following sections detail the security measures implemented throughout each component of the architecture.

Core Infrastructure Security

At the heart of the design is a Virtual Private Cloud (VPC) that provides critical network isolation and security boundary controls:

Network Segmentation: The VPC establishes a clear security perimeter with controlled ingress/egress points, enabling comprehensive network traffic monitoring and filtering.
Multi-Zone Security Strategy: Two separate availability zones (Zone A and Zone B), are implemented not only for reliability but as an important security control that prevents single-zone security compromises from affecting the entire system.
Compute Instance Isolation: Each availability zone contains compute engine instances (for various workloads) which are completely isolated from the external world thanks to security groups and well configured access controls.

Perimeter Security and Access Controls

The architecture implements multiple layers of access controls to protect the system perimeter:

Cloud Load Balancer: We've implemented a secure load balancer as our first line of defense against attacks, with properly configured TLS & HTTP security headers. Our implementation includes request filtering, rate limiting, and anomaly detection to prevent common web attacks.
Internet Gateway Controls: All connections to the internet are secured through our comprehensive egress filtering, outbound connection monitoring, and appropriate security protocols to prevent data exfiltration and command-and-control attacks.

User Access and Load Distribution

The architecture implements intelligent access management:

Cloud Load Balancer: Acts as the entry point for user traffic, distributing incoming requests evenly across available compute instances. This not only improves performance but also enhances stability during high-traffic periods.
Internet Connectivity: A secure connection to the internet allows for external service integration while maintaining security protocols.

Orchestration Layer Security

The orchestration layer includes specific security controls to protect automated processes:

Scheduler Component Authentication: Strong authentication mechanisms and least privilege access controls for our scheduler component that manages automated tasks and workflows. All credentials are securely managed through our enterprise secrets management system.
Orchestration Audit Logging: Comprehensive logging of all orchestration activities is enabled, particularly for changes to scheduled tasks, with log data centrally collected and monitored for suspicious activities.

Data Security Controls

The architecture employs a multi-database strategy to handle different data requirements with specific security controls for each data store:

SQL Database Security: Structured data is protected with encryption at rest and in transit, robust authentication controls, comprehensive privilege management, prepared statements to prevent SQL injection, and continuous database activity monitoring. The database is fully secured behind the VPC.
NoSQL Database Security: NoSQL database is secured with authentication enabled by default, role-based access controls, network security controls to restrict access, and proper encryption for sensitive data fields. The database is fully secured behind the VPC.
Cloud Storage Security: All storage buckets/containers have properly configured access policies, sensitive data is encrypted, access credentials are securely managed, object lifecycle policies are in place, and access logging is enabled for all operations.

AI and ML Security Implementation

All AI components in the deployment architecture include specialized security measures to address unique challenges:

Google Vertex AI Data Protection: All sensitive data used for model training and inference is properly protected, with controls for data minimization, anonymization where appropriate, and secure transfer between components.
AI Ethics and Compliance: Our governance framework ensures AI outputs comply with regulatory requirements and ethical standards, particularly for decisions affecting users. Regular reviews ensure fair and unbiased operation.