Amazon S3 Glacier: The Deep Archive Deep-Dive

Amazon S3 Glacier is a suite of storage classes designed for low-cost, long-term data archiving. In the SAA-C03 exam, Glacier is the primary answer for any scenario involving “compliance,” “backups,” or “long-term retention” where immediate access is not always required.

The “Attic” Analogy: Think of S3 Standard as your kitchen pantry (items you use daily). S3 Glacier is like your attic or a professional storage unit across town. You don’t go there every day, it takes time to drive there and retrieve boxes, but it’s significantly cheaper per square foot than your kitchen.

Core Concepts & Storage Classes

AWS has evolved Glacier from a standalone service into three distinct S3 Storage Classes. Understanding the retrieval times for each is critical for the exam.

  • S3 Glacier Instant Retrieval: For long-lived data that is rarely accessed but requires millisecond retrieval when needed. (e.g., Medical images, news media archives).
  • S3 Glacier Flexible Retrieval: The classic “Glacier.” Data is accessed 1-2 times a year. Retrieval ranges from minutes to hours.
  • S3 Glacier Deep Archive: The lowest-cost storage in AWS. Designed for data that might be accessed once every few years. Retrieval takes 12-48 hours.

Comparison Table: Glacier Classes

Feature Instant Retrieval Flexible Retrieval Deep Archive
Retrieval Time Milliseconds 1 min – 12 hours 12 – 48 hours
Min Storage Duration 90 Days 90 Days 180 Days
Cost (Storage) Low Very Low Lowest
Cost (Retrieval) High (per GB) Moderate Moderate

Security & Compliance: S3 Glacier Vault Lock

For SAA-C03, “Compliance” usually points to S3 Glacier Vault Lock. This allows you to deploy and enforce compliance controls for individual S3 Glacier vaults with a “Write Once Read Many” (WORM) policy. Once locked, the policy becomes immutable—even the AWS Root account cannot delete the data until the retention period expires.

Decision Matrix / If–Then Guide

  • If the requirement is “lowest possible cost” and 24-hour wait is okay ➜ S3 Glacier Deep Archive.
  • If you need “rare access” but “immediate availability” ➜ S3 Glacier Instant Retrieval.
  • If you need to meet “regulatory compliance” (WORM) ➜ S3 Glacier Vault Lock.
  • If you need to automate moving data to Glacier ➜ S3 Lifecycle Policies.

Exam Tips and Gotchas

  • Minimum Billable Duration: If you delete an object from Deep Archive after 30 days, you are still billed for the remaining 150 days.
  • Retrieval Tiers: Flexible Retrieval has three tiers: Expedited (1-5 mins), Standard (3-5 hours), and Bulk (5-12 hours). Note: Expedited is NOT available for Deep Archive.
  • Lifecycle Transitions: You can transition from any S3 class to Glacier, but you cannot transition “upwards” (e.g., Glacier to S3 Standard) via policy; you must “restore” a temporary copy.
  • Object Size: There is a minimum billable object size (128KB) for Glacier classes. Small files are better bundled into ZIP/TAR before uploading to save costs.

Topics covered :

Summary of key subtopics covered in this guide:

  • Difference between Instant, Flexible, and Deep Archive storage classes.
  • Retrieval timeframes (Expedited vs. Standard vs. Bulk).
  • Cost optimization via Minimum Storage Duration awareness.
  • Security through Vault Lock and WORM policies.
  • Architectural patterns using S3 Lifecycle Management.

Amazon S3 Glacier Ecosystem

S3 Standard Lifecycle Policy Instant Retrieval (ms) Flexible (Minutes/Hours) Deep Archive (12-48h) VAULT LOCK
SERVICE INTEGRATION

S3 Lifecycle

Automate the transition of objects to Glacier based on age. Define rules to expire (delete) data after a set period to minimize costs.

PERFORMANCE

Retrieval Tiers

Expedited: 1-5 mins ($$)
Standard: 3-5 hours ($)
Bulk: 5-12 hours (Free/Low Cost)

COST OPTIMIZATION

Storage Savings

Deep Archive is ~95% cheaper than S3 Standard. Ideal for magnetic tape replacement and regulatory data logs.

Production Use Case: A healthcare provider stores patient X-rays in S3 Standard for 30 days. After 30 days, a Lifecycle Policy moves them to S3 Glacier Instant Retrieval for 7 years to meet legal requirements while allowing doctors to pull records instantly if a patient returns.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top