Skip to main content

AI & assistant-friendly summary

This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.

Summary

import PricingHeroStats from '~/components/blog/PricingHeroStats. astro'; import PricingDimensionTable from '~/components/blog/PricingDimensionTable. astro'; import BillSurpriseCallout from '~/components/blog/BillSurpriseCallout

Key Facts

  • Macie bills two ways: bucket-level evaluation at $0
  • 10 per bucket per month, and sensitive-data discovery at $1
  • 00 per GB inspected
  • A 500 GB bucket scanned for PII costs $500 just for the discovery; multi-bucket organizations easily hit $10K+ per month
  • The discovery line is where most Macie bills land surprisingly high — $1 per GB inspected adds up fast on a multi-TB sensitive-data inventory

Entity Definitions

Lambda
Lambda is an AWS service discussed in this article.
S3
S3 is an AWS service discussed in this article.
EventBridge
EventBridge is an AWS service discussed in this article.
Athena
Athena is an AWS service discussed in this article.
compliance
compliance is a cloud computing concept discussed in this article.
HIPAA
HIPAA is a cloud computing concept discussed in this article.
GDPR
GDPR is a cloud computing concept discussed in this article.

Amazon Macie Pricing: Why Scanning a 500 GB Bucket Can Cost $500

Quick summary: Macie bills two ways: bucket-level evaluation at $0.10 per bucket per month, and sensitive-data discovery at $1.00 per GB inspected. A 500 GB bucket scanned for PII costs $500 just for the discovery; multi-bucket organizations easily hit $10K+ per month. Automated discovery is cheaper than full jobs but compounds across the bucket footprint.

Key Takeaways

  • Macie bills two ways: bucket-level evaluation at $0
  • 10 per bucket per month, and sensitive-data discovery at $1
  • 00 per GB inspected
  • A 500 GB bucket scanned for PII costs $500 just for the discovery; multi-bucket organizations easily hit $10K+ per month
  • The discovery line is where most Macie bills land surprisingly high — $1 per GB inspected adds up fast on a multi-TB sensitive-data inventory
Amazon Macie Pricing: Why Scanning a 500 GB Bucket Can Cost $500
Table of Contents

Amazon Macie is the AWS sensitive-data classification service for S3 — it inspects object content, identifies PII, PHI, credentials, and custom-defined patterns, and produces findings that drive compliance reporting and breach detection. The bill structure has two distinct dimensions: a flat per-bucket evaluation charge that scales with bucket count, and a per-GB discovery charge that scales with the actual data inspected. The discovery line is where most Macie bills land surprisingly high — $1 per GB inspected adds up fast on a multi-TB sensitive-data inventory.

This post is the bill story. For the operational architecture — finding triage, custom data identifiers, integration with Security Hub — see our Macie investigation guide. For the S3 security context, S3 security best practices covers the protective controls that Macie complements.

The Two Macie Billing Dimensions

Macie pricing breakdown — us-east-1, June 2026

Prices in us-east-1

Two main dimensions. The per-bucket line is flat; the per-GB discovery line is the variable that drives most bills.

Bucket-level evaluation

$50

Inventory + policy + encryption checks

Unit price
$0.10 / bucket / month
Example workload
500 buckets across accounts

Sensitive-data discovery — full

$500 (first scan)

50 GB free per account per month

Unit price
$1.00 / GB inspected (tiered)
Example workload
500 GB bucket scanned

Sensitive-data discovery — automated

~$500

Configurable sampling rate

Unit price
$1.00 / GB sampled
Example workload
5% sample of 10 TB

Custom data identifier evaluation

$0 extra

No per-identifier surcharge

Unit price
Included in per-GB rate
Example workload
Custom regex on PII

Findings publication

$0

Downstream services bill separately

Unit price
Free to EventBridge / Security Hub
Example workload
Standard integrations

Multi-account organization administration

Scales linearly

No org-level discount on usage

Unit price
Per-account billing
Example workload
Org with 100 accounts

The 50 GB free tier is per-account per-month — modest relief for small accounts, irrelevant at organization scale.

Why a 500 GB Bucket Costs $500 to Scan

The math is direct: 500 GB × $1/GB = $500 for one full discovery scan. If that scan runs monthly, the bill is $500/month per bucket — for a single 500 GB bucket. Across an organization with 50 buckets of similar size all under quarterly compliance scan, the discovery line lands at ~$8,000/month.

The cost driver is the actual data volume scanned, not the object count. A bucket with 1 million 1 KB files (1 GB total) costs $1 to scan; a bucket with 100 objects of 5 GB each (500 GB total) costs $500. The per-object overhead is minimal; the per-byte content-inspection cost dominates.

Automated Discovery: The Cheaper Broad-Coverage Path

Automated sensitive data discovery samples a configurable subset of objects per bucket continuously. The default sampling rate balances cost and coverage; you can tune it per-bucket if specific buckets need higher confidence.

The economics compared to full jobs:

Automated discovery vs full job — 10 TB bucket inventory

Prices in us-east-1

Same data, two scan strategies. Automated discovery covers all buckets at sampled depth; full jobs scan everything but cost the full data volume.

Full scan, monthly

$10,000/mo

Compliance-grade coverage

Unit price
$1/GB × 10 TB
Example workload
Complete object-level coverage

Automated discovery, 5% sample

$500/mo

95% cheaper; finds which buckets have sensitive data

Unit price
$1/GB × 500 GB
Example workload
Broad bucket-level signal

Layered: automated + targeted full scans

~$2,500/mo

Best balance of cost and coverage

Unit price
Mixed
Example workload
20% of inventory flagged → full scan

Quarterly full scan only on flagged buckets

Bursty, scopeable

Schedule for budget control

Unit price
Per-quarter cost
Example workload
Compliance reporting

The layered approach — automated on everything, full scans on flagged subsets — is the right pattern for most compliance programs.

Custom Data Identifiers: Free at the Per-GB Rate

Custom data identifiers (regex patterns + keyword proximity rules + character distribution checks for your organization-specific data shapes — internal IDs, account number formats, proprietary classifications) are included in the standard per-GB discovery rate. There is no per-identifier surcharge.

This makes Macie cost-effective for organizations with unusual data-classification requirements. Build custom identifiers for your specific patterns; the discovery cost remains the same as the built-in identifiers.

The waste pattern: not using custom identifiers and instead running ad-hoc S3 queries via Athena to find specific patterns. Athena scans all data anyway and provides less structured findings; Macie’s per-GB rate covers the same scanning with managed pattern matching plus the integration with Security Hub and EventBridge.

Multi-Region: The Hidden Multiplier

Macie is enabled per region. Data replicated across regions (via S3 Cross-Region Replication or by application-level replication) means the same bytes get scanned multiple times if Macie runs in each region.

The mitigation: scope Macie to the source region. Replicated buckets in secondary regions can rely on the source-region scan results for compliance reporting — CRR guarantees byte-identical replication. Configure Macie organization-wide with explicit per-region enablement that matches your data-residency posture; don’t enable globally by default.

Bucket-Level Evaluation: The Per-Bucket Compounding

The $0.10/bucket/month evaluation charge covers bucket-policy review, encryption status, public access checks, and replication configuration. It is small per bucket but compounds at scale: a multi-account organization with 1,000 buckets pays $100/month for evaluation alone.

The line is non-negotiable when Macie is enabled (it is the base “Macie is on” charge). The mitigation is bucket consolidation rather than individual bucket cleanup — fewer buckets, lower evaluation bill. For organizations with bucket sprawl (one bucket per microservice per environment per region), the consolidated-bucket model with prefixes for tenancy reduces the evaluation line proportionally.

When to Use Macie vs Alternatives

Macie for managed sensitive-data classification on S3; custom Lambda+Comprehend pipeline only for unusual requirements.

Use when

  • Compliance-driven sensitive-data inventory across multi-bucket S3 deployments
  • Buckets receiving user-uploaded content where PII detection drives data handling decisions
  • Organizations needing managed PII / PHI classification with built-in identifiers (GDPR, HIPAA, PCI compliance)
  • Custom data identifiers for organization-specific patterns — included in the per-GB rate
  • Layered scanning: automated discovery on all buckets, full jobs on flagged subsets

Avoid when

  • Quarterly full scans on every bucket in inventory — pure waste compared to layered approach
  • Macie enabled in regions where sensitive data does not actually land
  • Macie on buckets containing only build artifacts, container images, or non-sensitive logs
  • Replicated data scanned in both source and destination regions — scan source only
  • DIY S3+Lambda+Comprehend pipelines at any scale where Macie covers the use case — operational cost dwarfs Macie

The compliance and operational value of managed classification almost always exceeds the bill. The optimization is in scope, not in choosing alternatives.

A 30-Day Macie Bill Cleanup Plan

Week 1 — Job scope audit. Identify scheduled discovery jobs. For each job with broad bucket scope, narrow to compliance-tracked buckets only. Use automated discovery for broader visibility on non-compliance buckets.

Week 2 — Regional enablement. Map Macie regional enablement to actual sensitive-data residency. Disable Macie in regions where sensitive data does not land (build artifact regions, log-only regions).

Week 3 — Automated discovery tuning. Review automated discovery sampling rates per bucket. Increase sampling on buckets that have generated findings; decrease on buckets consistently clean.

Week 4 — Custom identifier review. Validate that custom data identifiers cover the organization-specific patterns. Add identifiers for internal classifications previously discovered via ad-hoc Athena queries.

What This Post Doesn’t Cover

  • Finding triage workflows — covered in our Macie investigation guide.
  • Detective integration for investigation — Detective bills independently; covered in our security operations content.
  • Comparison with third-party DLP tools (Symantec, McAfee MVISION) — different operational model, different pricing structure.
  • DataZone classification overlap — DataZone provides a different classification layer for data lineage; covered separately.

If You Only Do One Thing This Week

Switch from monthly full-scan jobs to automated discovery on most buckets, with scheduled full scans only on buckets that have actually generated findings in the past quarter. Run aws macie2 list-classification-jobs to find currently-scheduled jobs; for each, check the bucket scope and the schedule. Replace broad-scope monthly jobs with automated discovery (toggle in the Macie console under Settings → Sensitive data discovery → Automated). The change typically cuts the discovery line by 70–90% with no loss of compliance coverage.

For the operational triage of Macie findings, the Macie investigation guide covers the design side.

PP
Palaniappan P

AWS Cloud Architect & AI Expert

AWS-certified cloud architect and AI expert with deep expertise in cloud migrations, cost optimization, and generative AI on AWS.

AWS ArchitectureCloud MigrationGenAI on AWSCost OptimizationDevOps

Recommended Reading

Explore All Articles »
5 min

Amazon GuardDuty Pricing: Nine Data Sources, One Compounding Bill

GuardDuty bills across nine separate data sources — CloudTrail management events at $4/M tiered down, VPC Flow Logs at $1/GB tiered, EKS Runtime Monitoring per vCPU-hour, plus S3, DNS, Lambda, RDS, and Malware Protection. The 30-day free trial regularly hides the true production bill, and organization-wide auto-enable turns every new account into a billing line.