Chapter 5: Deep-Dive into Compute

EC2, Containers, Serverless & Pricing Strategies

Amazon EC2 Instance Families

General Purpose (T, M)
Balanced CPU/RAM. Best for web servers, small DBs, and dev environments.

Compute Optimized (C)
High-performance processors. Best for batch processing, scientific modeling, video encoding.

Memory Optimized (R, X, z)
Fast performance for large datasets in memory. Best for high-perf DBs and real-time analytics.

Storage Optimized (I, D, H)
High, sequential R/W for local storage. Best for NoSQL DBs, data warehousing, and Hadoop.

Accelerated Computing (P, G, Inf)
Hardware accelerators (GPUs/FPGAs). Best for Machine Learning and Graphics rendering.

Burstable (T2/T3)
Baseline performance with ability to burst using CPU credits.

Feature	Amazon ECS	Amazon EKS	AWS Fargate
Model	AWS-native orchestration	Managed Kubernetes	Serverless Compute Engine
Complexity	Low (Easier to set up)	High (K8s expertise needed)	Minimal (No infra management)
Control	Deep AWS Integration	Open-source standard/Portable	Abstracted Infrastructure
Use Case	AWS-centric applications	Multi-cloud/K8s workflows	Event-driven/Unpredictable loads

API Gateway: Entry point for REST/WebSocket APIs; handles security & scaling.

DynamoDB: NoSQL database; millisecond latency; scales automatically.

Step Functions: Visual workflows (State Machines) to coordinate Lambda functions.

SAM: Template-based framework (YAML) for deploying serverless stacks.

Vertical Scaling: Upgrading the instance size (More RAM/CPU on one box).

                Horizontal Scaling: Adding more instances (Auto Scaling Groups). Preferred for high availability.
            

Compute Savings Plan: Most flexible. Applies to EC2, Fargate, and Lambda regardless of region/family.
EC2 Instance Savings Plan: Region/Family specific. Higher discount (72%).