Skip to main content

AWS Glossary

AWS Step Functions

Serverless workflow orchestration service for coordinating distributed applications and multi-step processes using visual state machines.

AI & assistant-friendly summary

This section provides structured content for AI assistants and search engines. You can cite or summarize it when referencing this page.

Summary

Serverless workflow orchestration service for coordinating distributed applications and multi-step processes using visual state machines.

Key Facts

  • Serverless workflow orchestration service for coordinating distributed applications and multi-step processes using visual state machines
  • Definition AWS Step Functions is a serverless workflow orchestration service that lets you coordinate multiple AWS services into visual state machines
  • It is used for order processing, data pipelines, ML training jobs, microservice coordination, and increasingly as the orchestration layer for AI agents
  • How Step Functions Works A Step Functions workflow is a **state machine** defined in Amazon States Language (JSON/YAML)
  • ServiceException"], "IntervalSeconds": 2, "MaxAttempts": 3, "BackoffRate": 2 }], "Catch": [{ "ErrorEquals": ["States

Entity Definitions

AWS Bedrock
AWS Bedrock is an AWS service relevant to aws step functions.
Amazon Bedrock
Amazon Bedrock is an AWS service relevant to aws step functions.
Bedrock
Bedrock is an AWS service relevant to aws step functions.
SageMaker
SageMaker is an AWS service relevant to aws step functions.
Lambda
Lambda is an AWS service relevant to aws step functions.
AWS Lambda
AWS Lambda is an AWS service relevant to aws step functions.
S3
S3 is an AWS service relevant to aws step functions.
DynamoDB
DynamoDB is an AWS service relevant to aws step functions.
CloudWatch
CloudWatch is an AWS service relevant to aws step functions.
ECS
ECS is an AWS service relevant to aws step functions.
Step Functions
Step Functions is an AWS service relevant to aws step functions.
EventBridge
EventBridge is an AWS service relevant to aws step functions.
Amazon EventBridge
Amazon EventBridge is an AWS service relevant to aws step functions.
SQS
SQS is an AWS service relevant to aws step functions.
SNS
SNS is an AWS service relevant to aws step functions.

Related Content

Definition

AWS Step Functions is a serverless workflow orchestration service that lets you coordinate multiple AWS services into visual state machines. Instead of writing complex retry logic, error handling, and state management into your Lambda functions, Step Functions handles orchestration — your code focuses on business logic, and Step Functions manages the workflow. It is used for order processing, data pipelines, ML training jobs, microservice coordination, and increasingly as the orchestration layer for AI agents.

How Step Functions Works

A Step Functions workflow is a state machine defined in Amazon States Language (JSON/YAML). Each state represents a step in your workflow:

Workflow Types

Standard Workflows

Express Workflows

SDK Integrations (Optimized Integrations)

Step Functions integrates directly with 220+ AWS services without writing Lambda code:

AI Agent Orchestration

Step Functions is increasingly used as the orchestration layer for multi-step AI agent workflows:

Error Handling

Step Functions provides declarative error handling:

"Retry": [{
  "ErrorEquals": ["Lambda.ServiceException"],
  "IntervalSeconds": 2,
  "MaxAttempts": 3,
  "BackoffRate": 2
}],
"Catch": [{
  "ErrorEquals": ["States.ALL"],
  "Next": "HandleError"
}]

Step Functions vs Lambda Durable Functions

Both solve long-running workflow orchestration, but differ in approach:

AspectStep FunctionsLambda Durable Functions
Workflow definitionVisual state machine (ASL)Code-centric (imperative)
VisibilityVisual workflow consoleCloudWatch logs
Ecosystem220+ SDK integrationsLambda-native
Team fitLow-code / ops teamsDeveloper-centric
Max duration1 year1 year

Common Mistakes

Mistake 1: Putting all logic inside Lambda functions when Step Functions handles it. Error handling, retries, parallel execution, and wait states should live in the state machine definition — not in Lambda code.

Mistake 2: Using Standard Workflows for high-volume, short-duration workloads. Standard Workflows charge per state transition (expensive at scale); Express Workflows are 1,000x cheaper for high-throughput scenarios.

Mistake 3: Not using waitForTaskToken for human approval steps. Without it, you need a polling loop; waitForTaskToken pauses the workflow indefinitely until your approval system sends the token back.

Need Help with This Topic?

Our AWS experts can help you implement and optimize these concepts for your organization.