Amazon Macie
Discovering & Protecting Sensitive Data (PII) in S3
The Ingredients: Data & Configuration
S3 Buckets
The primary storage location. Macie scans objects (CSV, JSON, PDF, Images) to identify potential security risks.
Managed Data Identifiers
Built-in machine learning patterns for PII (Names, SSNs, Credit Cards) and PHI (Medical records).
Custom Identifiers
Define your own Regex patterns to find proprietary data like Employee IDs or internal project codes.
The Analysis Process
Macie automatically lists all S3 buckets and evaluates public accessibility.
Sensitive data discovery jobs run on selected buckets using ML.
Findings are generated based on severity (Low, Medium, High).
Integrates with EventBridge to trigger Lambda or SNS alerts.