Skip to main content

AWS Glossary

Amazon S3 Tables

S3 Tables are managed Apache Iceberg tables on S3 — purpose-built table buckets with auto-compaction, snapshot management, and up to 3× better query performance than self-managed Iceberg on standard S3.

AI & assistant-friendly summary

This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.

Summary

S3 Tables are managed Apache Iceberg tables on S3 — purpose-built table buckets with auto-compaction, snapshot management, and up to 3× better query performance than self-managed Iceberg on standard S3.

Key Facts

  • Definition Amazon S3 Tables are managed Apache Iceberg tables that live in a new bucket type called a _table bucket_
  • AWS handles table maintenance — compaction, snapshot expiration, unreferenced file cleanup — that data engineers traditionally script themselves
  • S3 Tables reached GA at re:Invent 2024 and integrate natively with Athena, Redshift, EMR, Glue, and Amazon SageMaker Lakehouse
  • Common mistakes **Mistake 1:** Putting general-purpose object data in a table bucket
  • Table buckets are optimized for Iceberg tables — use a regular S3 bucket for media, logs, or backups

Entity Definitions

SageMaker
SageMaker is an AWS service relevant to amazon s3 tables.
Amazon SageMaker
Amazon SageMaker is an AWS service relevant to amazon s3 tables.
S3
S3 is an AWS service relevant to amazon s3 tables.
Amazon S3
Amazon S3 is an AWS service relevant to amazon s3 tables.
IAM
IAM is an AWS service relevant to amazon s3 tables.
Glue
Glue is an AWS service relevant to amazon s3 tables.
AWS Glue
AWS Glue is an AWS service relevant to amazon s3 tables.
Athena
Athena is an AWS service relevant to amazon s3 tables.
Amazon Athena
Amazon Athena is an AWS service relevant to amazon s3 tables.
serverless
serverless is a cloud computing concept relevant to amazon s3 tables.

Related Content

Definition

Amazon S3 Tables are managed Apache Iceberg tables that live in a new bucket type called a table bucket. AWS handles table maintenance — compaction, snapshot expiration, unreferenced file cleanup — that data engineers traditionally script themselves. S3 Tables reached GA at re:Invent 2024 and integrate natively with Athena, Redshift, EMR, Glue, and Amazon SageMaker Lakehouse.

Why S3 Tables vs raw Iceberg on S3

AspectSelf-managed Iceberg on S3S3 Tables
CompactionYou run Glue/Spark jobs on a scheduleAutomatic, managed
Snapshot expirationYou maintainAutomatic
Query performanceBaselineUp to 3× faster on selective queries
Transactions / secS3 baselineUp to 10× higher writes/sec
CatalogBring your own (Glue Catalog)Native catalog or AWS Glue Data Catalog

Capabilities

Pricing model

S3 Tables charge for:

For most analytical workloads the managed maintenance overhead is offset by query-performance gains; benchmark on your dataset before committing.

Common mistakes

Mistake 1: Putting general-purpose object data in a table bucket. Table buckets are optimized for Iceberg tables — use a regular S3 bucket for media, logs, or backups.

Mistake 2: Forgetting to set table-level retention. Iceberg snapshots accumulate forever by default. Configure snapshot expiration policies up front.

Mistake 3: Granting overly broad table-bucket permissions. Use IAM table-level grants (the new bucket type supports table-scoped permissions) rather than full-bucket access.

Need Help with This Topic?

Our AWS experts can help you implement and optimize these concepts for your organization.