Architecture Overview

System Architecture

flowchart TB
    subgraph Developer["Developer Workflow"]
        DEV[Developer] -->|push code| GIT[GitHub]
        GIT -->|trigger| CI[GitHub Actions]
        CI -->|build & push| ECR[ECR Registry]
        CI -->|update| VALUES[Helm Values]
    end

    subgraph GitOps["GitOps Control Plane"]
        VALUES -->|watch| ARGO[ArgoCD]
        ARGO -->|sync| K8S[Kubernetes Cluster]
    end

    subgraph Cluster["EKS Cluster"]
        K8S --> NS_DEV[dev namespace]
        K8S --> NS_STG[staging namespace]
        K8S --> NS_PROD[production namespace]
        
        NS_PROD --> ROLLOUT[Argo Rollouts]
        ROLLOUT -->|canary| CANARY[10% traffic]
        ROLLOUT -->|stable| STABLE[90% traffic]
    end

    subgraph Observability["Observability"]
        K8S --> PROM[Prometheus]
        PROM --> GRAF[Grafana]
        PROM --> ALERT[AlertManager]
    end

    subgraph Security["Security"]
        K8S --> ESO[External Secrets Operator]
        ESO --> ASM[AWS Secrets Manager]
        K8S --> KYVER[Kyverno Policies]
    end

Component Overview

Infrastructure Layer (Terraform)

Component	Purpose	Module
VPC	Network isolation with public/private subnets	`terraform/modules/vpc`
EKS	Managed Kubernetes control plane	`terraform/modules/eks`
ArgoCD	GitOps controller bootstrap	`terraform/modules/argocd`

GitOps Layer (ArgoCD)

Component	Purpose	Location
App of Apps	Root application managing all others	`argocd/apps/`
ApplicationSets	Dynamic multi-environment generation	`argocd/applicationsets/`
Projects	RBAC and resource boundaries	`argocd/projects/`

Application Layer (Helm)

Component	Purpose	Location
Sample App	Reference application with best practices	`helm/sample-app/`

Data Flow

Code Change: Developer pushes to feature branch
CI Pipeline: GitHub Actions builds, tests, scans, pushes image
Image Promotion: CI updates Helm values with new image tag
GitOps Sync: ArgoCD detects drift, syncs to cluster
Progressive Rollout: Argo Rollouts manages canary deployment
Validation: Prometheus metrics validate deployment health
Promotion/Rollback: Automatic promotion or rollback based on metrics

Security Architecture

flowchart LR
    subgraph External["External"]
        USER[User] -->|HTTPS| ALB[AWS ALB]
    end

    subgraph Cluster["EKS Cluster"]
        ALB -->|TLS| INGRESS[Ingress Controller]
        INGRESS -->|mTLS| SVC[Service]
        SVC -->|NetworkPolicy| POD[Pod]
        
        POD -->|IRSA| AWS[AWS APIs]
        
        KYVER[Kyverno] -->|enforce| POD
        ESO[External Secrets] -->|inject| POD
    end

    subgraph Secrets["Secret Management"]
        ESO -->|fetch| ASM[AWS Secrets Manager]
    end

Disaster Recovery

Scenario	RTO	RPO	Strategy
Pod failure	< 1 min	0	Kubernetes self-healing
Node failure	< 5 min	0	Cluster Autoscaler + PodDisruptionBudgets
AZ failure	< 10 min	0	Multi-AZ deployment
Region failure	< 1 hour	< 5 min	GitOps replay to DR cluster
Cluster corruption	< 30 min	< 5 min	Velero backup restore

Cost Optimization

Karpenter: Right-sized nodes based on pod requirements
Spot Instances: Non-production workloads on spot
Resource Quotas: Prevent runaway resource consumption
HPA: Scale down during low traffic periods

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Architecture Overview

System Architecture

Component Overview

Infrastructure Layer (Terraform)

GitOps Layer (ArgoCD)

Application Layer (Helm)

Data Flow

Security Architecture

Disaster Recovery

Cost Optimization

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

Architecture Overview

System Architecture

Component Overview

Infrastructure Layer (Terraform)

GitOps Layer (ArgoCD)

Application Layer (Helm)

Data Flow

Security Architecture

Disaster Recovery

Cost Optimization