Amazon S3 Glacier: The Deep Archive Deep-Dive
Amazon S3 Glacier is a suite of storage classes designed for low-cost, long-term data archiving. In the SAA-C03 exam, Glacier is the primary answer for any scenario involving “compliance,” “backups,” or “long-term retention” where immediate access is not always required.
Core Concepts & Storage Classes
AWS has evolved Glacier from a standalone service into three distinct S3 Storage Classes. Understanding the retrieval times for each is critical for the exam.
- S3 Glacier Instant Retrieval: For long-lived data that is rarely accessed but requires millisecond retrieval when needed. (e.g., Medical images, news media archives).
- S3 Glacier Flexible Retrieval: The classic “Glacier.” Data is accessed 1-2 times a year. Retrieval ranges from minutes to hours.
- S3 Glacier Deep Archive: The lowest-cost storage in AWS. Designed for data that might be accessed once every few years. Retrieval takes 12-48 hours.
Comparison Table: Glacier Classes
| Feature | Instant Retrieval | Flexible Retrieval | Deep Archive |
|---|---|---|---|
| Retrieval Time | Milliseconds | 1 min – 12 hours | 12 – 48 hours |
| Min Storage Duration | 90 Days | 90 Days | 180 Days |
| Cost (Storage) | Low | Very Low | Lowest |
| Cost (Retrieval) | High (per GB) | Moderate | Moderate |
Security & Compliance: S3 Glacier Vault Lock
For SAA-C03, “Compliance” usually points to S3 Glacier Vault Lock. This allows you to deploy and enforce compliance controls for individual S3 Glacier vaults with a “Write Once Read Many” (WORM) policy. Once locked, the policy becomes immutable—even the AWS Root account cannot delete the data until the retention period expires.
Decision Matrix / If–Then Guide
- If the requirement is “lowest possible cost” and 24-hour wait is okay ➜ S3 Glacier Deep Archive.
- If you need “rare access” but “immediate availability” ➜ S3 Glacier Instant Retrieval.
- If you need to meet “regulatory compliance” (WORM) ➜ S3 Glacier Vault Lock.
- If you need to automate moving data to Glacier ➜ S3 Lifecycle Policies.
Exam Tips and Gotchas
- Minimum Billable Duration: If you delete an object from Deep Archive after 30 days, you are still billed for the remaining 150 days.
- Retrieval Tiers: Flexible Retrieval has three tiers: Expedited (1-5 mins), Standard (3-5 hours), and Bulk (5-12 hours). Note: Expedited is NOT available for Deep Archive.
- Lifecycle Transitions: You can transition from any S3 class to Glacier, but you cannot transition “upwards” (e.g., Glacier to S3 Standard) via policy; you must “restore” a temporary copy.
- Object Size: There is a minimum billable object size (128KB) for Glacier classes. Small files are better bundled into ZIP/TAR before uploading to save costs.
Topics covered :
Summary of key subtopics covered in this guide:
- Difference between Instant, Flexible, and Deep Archive storage classes.
- Retrieval timeframes (Expedited vs. Standard vs. Bulk).
- Cost optimization via Minimum Storage Duration awareness.
- Security through Vault Lock and WORM policies.
- Architectural patterns using S3 Lifecycle Management.
Amazon S3 Glacier Ecosystem
S3 Lifecycle
Automate the transition of objects to Glacier based on age. Define rules to expire (delete) data after a set period to minimize costs.
Retrieval Tiers
Expedited: 1-5 mins ($$)
Standard: 3-5 hours ($)
Bulk: 5-12 hours (Free/Low Cost)
Storage Savings
Deep Archive is ~95% cheaper than S3 Standard. Ideal for magnetic tape replacement and regulatory data logs.