AWS Glossary
Amazon S3 Vectors
S3 Vectors is the AWS native vector store — purpose-built vector storage on S3 with up to 90% lower cost than dedicated vector databases for RAG workloads.
AI & assistant-friendly summary
This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.
Summary
S3 Vectors is the AWS native vector store — purpose-built vector storage on S3 with up to 90% lower cost than dedicated vector databases for RAG workloads.
Key Facts
- • S3 Vectors is the AWS native vector store — purpose-built vector storage on S3 with up to 90% lower cost than dedicated vector databases for RAG workloads
- • Definition Amazon S3 Vectors is a vector storage service that lives natively inside S3 — purpose-built for retrieval-augmented generation (RAG) and similarity search workloads
- • Common mistakes **Mistake 1:** Using S3 Vectors for latency-critical agentic workflows
- • Use OpenSearch Serverless or in-memory caches for retrieval steps inside an agent loop
- • Mistake 2:** Skipping the metadata schema design
Entity Definitions
- Amazon Bedrock
- Amazon Bedrock is an AWS service relevant to amazon s3 vectors.
- Bedrock
- Bedrock is an AWS service relevant to amazon s3 vectors.
- S3
- S3 is an AWS service relevant to amazon s3 vectors.
- Amazon S3
- Amazon S3 is an AWS service relevant to amazon s3 vectors.
- RDS
- RDS is an AWS service relevant to amazon s3 vectors.
- Aurora
- Aurora is an AWS service relevant to amazon s3 vectors.
- Amazon Aurora
- Amazon Aurora is an AWS service relevant to amazon s3 vectors.
- Athena
- Athena is an AWS service relevant to amazon s3 vectors.
- OpenSearch
- OpenSearch is an AWS service relevant to amazon s3 vectors.
- Amazon OpenSearch
- Amazon OpenSearch is an AWS service relevant to amazon s3 vectors.
- RAG
- RAG is a cloud computing concept relevant to amazon s3 vectors.
- multi-tenant
- multi-tenant is a cloud computing concept relevant to amazon s3 vectors.
- serverless
- serverless is a cloud computing concept relevant to amazon s3 vectors.
Related Content
- AWS BEDROCK — Related service
- GENERATIVE AI ON AWS — Related service
Definition
Amazon S3 Vectors is a vector storage service that lives natively inside S3 — purpose-built for retrieval-augmented generation (RAG) and similarity search workloads. Announced in 2025, S3 Vectors targets the same workloads as Pinecone, OpenSearch Serverless, and pgvector, but with up to 90% lower storage cost and tighter integration with Amazon Bedrock Knowledge Bases.
What it provides
- Vector buckets — New bucket type for storing embeddings + metadata
- Index types — Cosine, Euclidean, dot-product distance metrics
- Filtered search — Metadata filters at query time (e.g.
tenant_id = "acme" AND region = "us-east-1") - Bedrock Knowledge Bases integration — Listed as a native vector store option alongside OpenSearch Serverless, Aurora, MongoDB, Pinecone
- Hybrid retrieval — Combine with Athena and S3 Tables for metadata-rich retrieval
When to use S3 Vectors
- RAG with > 10M vectors where storage cost dominates the bill
- Multi-tenant SaaS GenAI that needs per-tenant data isolation in S3 prefixes
- Workloads tolerant of ~100ms query latency — S3 Vectors trades query latency for storage cost
When not to use S3 Vectors
- Sub-50ms latency requirements — OpenSearch Serverless is faster on small indexes
- High QPS (> 1,000 queries/sec) — Validate latency at your load before committing
- Need a graph or hybrid lexical-vector store — Neptune Analytics or OpenSearch hybrid are better fits
Cost comparison (illustrative, May 2026)
| Store | Storage (per 1M vectors @ 1024d) | Query cost model |
|---|---|---|
| OpenSearch Serverless | ~$700/month at typical OCU baseline | OCU-hours |
| Pinecone Pod | ~$70/month per pod | Pod-hours |
| S3 Vectors | ~$10/month | Per-query |
Always benchmark on your actual vector dimensionality, QPS, and recall requirements before committing.
Common mistakes
Mistake 1: Using S3 Vectors for latency-critical agentic workflows. Use OpenSearch Serverless or in-memory caches for retrieval steps inside an agent loop.
Mistake 2: Skipping the metadata schema design. S3 Vectors lets you filter on metadata at query time — designing the schema for tenant isolation, document version, and freshness up front saves a re-index later.
Mistake 3: Re-creating an entire vector index instead of using the streaming sync from Bedrock Knowledge Bases. Knowledge Bases incremental sync is the supported path.
Related AWS Services
- Amazon Bedrock Knowledge Bases — Native consumer of S3 Vectors
- Amazon OpenSearch Serverless — Lower-latency alternative
- Amazon Aurora pgvector — RDS-based alternative for OLTP-adjacent workloads
- Amazon Neptune Analytics — Graph + vector for relationship-rich retrieval
Related FactualMinds Content
Need Help with This Topic?
Our AWS experts can help you implement and optimize these concepts for your organization.