AWS AI services: how to think about them

AWS AI services have expanded faster than almost any other part of the platform. Writing a comprehensive feature list would be out of date within weeks. This post takes a different approach: covering the four tiers of AWS AI and how to decide which one fits your problem. The tiers do not change even when the individual services do.

The four tiers

Pre-built AI services are managed services that solve a specific, well-defined problem without requiring any machine learning expertise. You call an API, you get an answer.

Bedrock provides access to foundation models via API. You are not running or training a model. You are choosing from a catalogue of pre-trained models and using them to build applications.

SageMaker is a full ML platform for teams that need to train, fine-tune, or deploy their own models. Requires ML expertise to use effectively.

Amazon Q is an AI assistant designed to be connected to your own data and tools. It exists in several forms depending on the use case.

Pre-built AI services

These are the right starting point for the majority of use cases. The problem is already solved, the model is already trained, and you pay per API call. No infrastructure to manage.

Textract

Document extraction. Where traditional OCR fell apart on structured forms and tables, Textract understands context. Feed it a PDF or image of an invoice, a mortgage application, or a patient form and it returns the structured data. Common in financial services, healthcare, and legal workflows.

Transcribe

Speech to text. Supports real-time and batch transcription, speaker identification, custom vocabularies for domain-specific terminology, and PII redaction built in. The redaction capability matters for any regulated workload where transcripts cannot contain card numbers or personal identifiers.

Comprehend

Natural language processing. Entity recognition, sentiment analysis, key phrase extraction, and PII detection on text. Useful downstream of Transcribe for extracting insight from transcripts, or directly on any text corpus you want to analyse at scale.

Rekognition

Image and video analysis. Face detection and comparison, content moderation, object and scene detection, text in images. Content moderation use cases often reach for this first because it handles the heavy lifting of flagging unsafe content before human review.

Polly

Text to speech. Realistic voice synthesis with multiple voices and languages. Useful when you need to generate audio at scale rather than maintaining audio files manually.

Translate

Language translation. Supports real-time and batch translation across a large number of language pairs. Integrates cleanly with the rest of the pre-built services.

Bedrock

Bedrock is the right tier when the pre-built services do not cover your use case and you want to build on top of a foundation model without managing one yourself.

You choose from a catalogue of models including Amazon’s own Nova and Titan families, Anthropic’s Claude, Meta’s Llama, Mistral, and others. Different models suit different tasks, languages, and cost points. You can switch models without changing your application architecture.

Bedrock is not just model access. The platform includes:

Knowledge Bases connect a foundation model to your own data via retrieval-augmented generation (RAG). The model answers questions grounded in your documents rather than its training data alone. This is the pattern to reach for when you need an AI that knows about your business, not just the internet.

Agents enable multi-step reasoning where the model can call APIs and take actions to complete a task. You define the tools it can use; the model figures out the sequence of steps.

Guardrails provide content filtering and PII redaction applied consistently across your Bedrock usage. Useful for any customer-facing application where you need to control what the model will and will not respond to.

SageMaker

SageMaker is the right tier when Bedrock does not give you what you need: either because the available foundation models are not suitable for your problem, you need to fine-tune a model on proprietary data, or you need full control over the training and deployment pipeline.

It is a significant platform in its own right. Expect to invest time in it and have ML engineering capability available. For teams that need it, it covers everything from data labelling through training, evaluation, and production deployment.

For most AWS application teams, SageMaker is not the right starting point. Start with the pre-built services or Bedrock and escalate to SageMaker only when those tiers cannot meet your requirements.

Amazon Q

Amazon Q comes in several forms that serve different purposes.

Q Developer is a code assistant integrated into IDEs and the CLI. It generates code, explains existing code, and can handle larger tasks like upgrading Java applications or transforming code between frameworks. It is AWS-aware, so it understands CloudFormation, CDK, and AWS service patterns.

Q Business connects to your internal data sources, documents, and tools to answer employee questions in natural language. Think of it as a search and Q&A layer across your organisation’s knowledge. It respects the access controls of the underlying data sources.

Q in QuickSight adds natural language querying to your BI dashboards. You describe the chart or analysis you want rather than building it manually.

Which tier should I use?

Problem	Tier
Document extraction, transcription, image analysis, NLP, translation	Pre-built services
Building an application on top of an LLM	Bedrock
Fine-tuning or training your own model	SageMaker
AI assistant connected to your own data	Q Business
Code assistance in the IDE	Q Developer

Start at the top of that list and only go further down when the tier above cannot meet the requirement. The pre-built services are the fastest path to production, the lowest operational burden, and the lowest risk. Foundation models via Bedrock are the right choice for open-ended language tasks. SageMaker is powerful but carries real complexity cost.

The services within each tier will continue to evolve. The tiers themselves are stable.