hidekazu-konishi.com

Amazon Bedrock Models as of 2024 - An Analysis of the Comprehensive Model Catalog

First Published: 2024-12-26
Last Updated: 2025-03-12

AWS announces significant new technologies and services every year at re:Invent, and AWS re:Invent 2024 featured various new service and feature announcements.
In this article, I will focus on the announcements related to Amazon Bedrock, summarize the overall framework of Amazon Bedrock announcements at AWS re:Invent 2024, and consider an overview of Amazon Bedrock models as of the end of 2024.

Summary of Overall Amazon Bedrock Announcements at AWS re:Invent 2024

First, I have categorized the Amazon Bedrock announcements from AWS re:Invent 2024 into six categories from my perspective.
Within this framework, items marked with (*) are announcements related to Amazon Bedrock models.

Enhancement of RAG (Retrieval Augmented Generation) functionality
- (*) Optimization of search with Rerank model and API introduction
- Improvement of user experience through streaming output
- Implementation of custom connectors and streaming ingestion
- Integration with GraphRAG and Kendra GenAI index
Expansion of multimodal support
- (*) Provision of various models through Nova series (text, image, video, audio)
- Multimodal data ingestion for knowledge base
- Toxicity detection function for images
Model optimization and performance improvement
- Introduction of latency optimization options
- Implementation of model distillation functionality
- Support for prompt caching
- Prompt routing functionality
Ecosystem expansion
- (*) Introduction of Bedrock marketplace
- (*) Addition of new model providers
- (*) Integration of third-party models
Strengthening of quality control and evaluation functions
- Model evaluation using LLM-as-a-judge
- Evaluation function for knowledge base
- Functional expansion of guardrails
- Introduction of automatic inference checks
Promotion of automation and efficiency
- Introduction of data automation functionality
- Support for structured data queries
- Multi-agent collaboration functionality

Looking back at the overall Amazon Bedrock announcements at AWS re:Invent 2024 from this perspective, it appears that the following policies have been put forward regarding Amazon Bedrock:

Improvement of reliability and safety assuming enterprise use
Improvement of performance and efficiency of the entire Bedrock utilization system
Improvement of user and developer experience and lowering of implementation barriers
Support for a wider range of use cases

Amazon Bedrock, which became General Availability (GA) on 2023-09-28, has shifted its update content from providing a general generative AI foundation to functional enhancements and service expansions to meet various needs after one year.

Number of Amazon Bedrock Models (as of 2024-12-16)

Next, I'll summarize the number of models available in Amazon Bedrock.
In the Amazon Bedrock model catalog, models that can be used when enabled in Amazon Bedrock Model Access have come to be called Serverless models to distinguish them from models available from the Amazon Bedrock Marketplace.
The following calculation formula represents the number of models available in Amazon Bedrock in N. Virginia [us-east-1] and Oregon [us-west-2] as of 2024-12-16.

Serverless (us-east-1): |E| = 46
Serverless (us-west-2): |W| = 44
Serverless (us-east-1 ∪ us-west-2): |E ∪ W| = 52
Bedrock Marketplace: |M| = 122
Total: |E ∪ W| + |M| = 174

In other words, as of 2024-12-16, there are 52 types of Serverless models when merged, 122 types in the Bedrock Marketplace, and a total of 174 types of models available in N. Virginia [us-east-1] and Oregon [us-west-2].

List of Amazon Bedrock Serverless Models: By Region (as of 2024-12-16)

Next, I'd like to look at the list of 52 types of Serverless models mentioned in the calculation formula earlier, merged for N. Virginia [us-east-1] and Oregon [us-west-2] as of 2024-12-16, by region.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (red) are models available only in N. Virginia [us-east-1]
* Bold (blue) are models available only in Oregon [us-west-2]

Provider	Modality	Model Name
AI21 Labs	Text	Jurassic-2[Ultra, Mid], Jamba 1.5[Mini, Large], Jamba-Instruct
Amazon	Text	Nova Micro, Titan Text G1[Lite, Express, Premier], Rerank 1.0
Amazon	Text & Vision	Nova[Pro, Lite]
Amazon	Image	Nova Canvas, Titan Image Generator G1[v1, v2]
Amazon	Video	Nova Reel
Amazon	Embeddings	Titan Multimodal Embeddings G1, Titan Text Embeddings V2, Titan Embeddings G1 – Text
Anthropic	Text	Claude 3.5[Haiku], Claude[v2.1, v2.0], Claude Instant[v1.2]
Anthropic	Text & Vision	Claude 3.5[Sonnet v2, Sonnet], Claude 3[Haiku, Sonnet, Opus]
Cohere	Text	Command R+, Command R, Command, Command Light, Rerank 3.5
Cohere	Embeddings	Embed Multilingual, Embed English
Meta	Text	Llama 3.2[1B Instruct, 3B Instruct], Llama 3.1[8B Instruct, 70B Instruct, 405B Instruct], Llama 3[8B Instruct, 70B Instruct]
Meta	Text & Vision	Llama 3.2[90B Vision Instruct, 11B Vision Instruct]
Mistral AI	Text	Mistral Large 2, Mistral Large, Mistral Small, Mixtral 8x7B Instruct, Mistral 7B Instruct
Stability AI	Image	SD3 Large 1.0, Stable Image Core 1.0, Stable Image Ultra 1.0, SDXL(1.0)

Based on this table, it seems likely that models that consume more capacity are placed only in Oregon [us-west-2], while more challenging models are placed only in N. Virginia [us-east-1].

List of Amazon Bedrock Serverless Models: By Provision Period (as of 2024-12-16)

Next, let's look at the list of Serverless models merged for N. Virginia [us-east-1] and Oregon [us-west-2] from a different perspective, categorized by provision period.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (green) are models that became available from January 2024 to before AWS re:Invent 2024
* Bold (brown)) are models that became available at AWS re:Invent 2024

Provider	Modality	Model Name
AI21 Labs	Text	Jurassic-2[Ultra, Mid], Jamba 1.5[Mini, Large], Jamba-Instruct
Amazon	Text	Nova Micro, Titan Text G1[Lite, Express, Premier], Rerank 1.0
Amazon	Text & Vision	Nova[Pro, Lite]
Amazon	Image	Nova Canvas, Titan Image Generator G1[v1, v2]
Amazon	Video	Nova Reel
Amazon	Embeddings	Titan Multimodal Embeddings G1, Titan Text Embeddings V2, Titan Embeddings G1 – Text
Anthropic	Text	Claude 3.5[Haiku], Claude[v2.1, v2.0], Claude Instant[v1.2]
Anthropic	Text & Vision	Claude 3.5[Sonnet v2, Sonnet], Claude 3[Haiku, Sonnet, Opus]
Cohere	Text	Command R+, Command R, Command, Command Light, Rerank 3.5
Cohere	Embeddings	Embed Multilingual, Embed English
Meta	Text	Llama 3.2[1B Instruct, 3B Instruct], Llama 3.1[8B Instruct, 70B Instruct, 405B Instruct], Llama 3[8B Instruct, 70B Instruct]
Meta	Text & Vision	Llama 3.2[90B Vision Instruct, 11B Vision Instruct]
Mistral AI	Text	Mistral Large 2, Mistral Large, Mistral Small, Mixtral 8x7B Instruct, Mistral 7B Instruct
Stability AI	Image	SD3 Large 1.0, Stable Image Core 1.0, Stable Image Ultra 1.0, SDXL(1.0)

As you can see from this table, most of the models as of 2024-12-16 became available in 2024.
This is natural as Amazon Bedrock became General Availability (GA) on 2023-09-28, but I can also see a trend where older versions of models provided at the initial GA are gradually phased out as new versions appear.
From this, it is important to identify which models the newly announced models correspond to as successors, and to understand the functions and features of new models, confirm their substitutability, and prepare for migration as early as possible, assuming the replacement of older model versions.

Serverless Models Announced at AWS re:Invent 2024 (Available)

Next, I'll introduce the new Serverless models announced as available at AWS re:Invent 2024.
Although not listed in this table, among the models announced as coming soon at AWS re:Invent 2024, which will be introduced in the next heading, Stable Diffusion 3.5 Large became available on 2024-12-19.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (red) are models available only in N. Virginia [us-east-1]
* Bold (blue) are models available only in Oregon [us-west-2]

Provider	Modality	Model Name	Overview
Cohere	Text	Rerank 3.5	A model to improve search accuracy in RAG applications. It takes user queries and retrieved document sets as input, re-ranks them based on relevance, and prioritizes the selection of optimal documents as model input to improve the quality of generated responses.
Amazon	Text	Rerank 1.0	A model to improve search accuracy in RAG applications. It takes user queries and retrieved document sets as input, re-ranks them based on relevance, and prioritizes the selection of optimal documents as model input to improve the quality of generated responses.
Amazon	Text	Nova Micro	A text-only model capable of high-speed processing with minimal latency. Optimized for basic tasks such as summarization, translation, classification, dialogue, and coding with a context length of 128K tokens. Supports fine-tuning.
Amazon	Text & Vision	Nova Lite	A low-cost multimodal model capable of high-speed processing. Generates text from image, video, and text inputs. Can handle inputs up to 300K tokens and analyze multiple images and videos up to 30 minutes. Supports fine-tuning.
Amazon	Text & Vision	Nova Pro	A high-performance multimodal model with excellent balance of accuracy, speed, and cost. Supports inputs up to 300K tokens and achieves top-level performance in visual question answering and video understanding. Capable of executing complex workflows through API and tool integration. Supports fine-tuning.
Amazon	Image	Nova Canvas	A state-of-the-art model capable of high-quality image generation. Features precise control over style and content, inpainting (partial modification), outpainting (image extension), and background removal editing functions. Achieves high performance in evaluating image generation fidelity.
Amazon	Video	Nova Reel	A state-of-the-art model capable of generating professional-quality videos. Enables video generation from text or images, with control over visual style and pacing. Demonstrates excellent performance in video quality and consistency.

Until now, Amazon as a model provider has offered the Amazon Titan series, but in addition to that, the Amazon Nova series has become available, providing text generation from multimodal inputs, image generation with editing functions, and video generation with control over visual style and pacing.
Also, Rerank models (Amazon Rerank, Cohere Rerank) to improve search accuracy in RAG applications have been provided and become available from Amazon and Cohere providers.

Serverless Models Announced at AWS re:Invent 2024 (Coming Soon)

Next, I'll introduce the Serverless models announced as coming soon at AWS re:Invent 2024.
Among these, Stable Diffusion 3.5 Large became available on 2024-12-19.

Provider	Modality	Model Name	Overview
Amazon	Text & Vision	Nova Premier	The top-tier multimodal model for complex inference tasks. Also optimal as a teacher model for knowledge distillation of custom models (knowledge transfer of probability distribution and latent representation of intermediate layers from large-scale models to small-scale models). Scheduled for release in early 2025.
poolside	Text	malibu	A model specialized for complex software engineering challenges such as advanced tasks including code generation, test creation, refactoring, and documentation creation. By collaborating with an assistant, it can be used directly within the developer's IDE, is fine-tuned based on the knowledge base, and has the flexibility to meet organization-specific needs.
poolside	Text	point	A model specialized in rapid code completion that accurately predicts developer needs using advanced context awareness. By collaborating with an assistant, it can be used directly within the developer's IDE, is fine-tuned based on the knowledge base, and has the flexibility to meet organization-specific needs.
Stability AI	Image	Stable Diffusion 3.5 Large	* Became available on 2024-12-19. The latest high-performance image generation model provided by Stability AI. Capable of generating high-quality and beautiful images from text. Streamlines the creation of concept art, visual effects, and detailed product images.
Luma AI	Video	Ray 2	The latest video generation model that can generate high-quality videos from text or image prompts in about 10 seconds. Achieves smooth motion, advanced filming techniques, and dynamic camera work, capable of creating videos up to 1 minute long.

Amazon Bedrock Marketplace Model List (as of 2024-12-16)

Finally, I'll introduce the models in the Amazon Bedrock Marketplace.
As mentioned at the beginning, there were 122 types of models in the Amazon Bedrock Marketplace as of 2024-12-16, but the majority of these, 83 types, were models from HuggingFace, an open-source community.
In addition, various providers offer models with unique features, such as industry-specific models for medical and financial sectors.

Provider	Category	Model Name
HuggingFace	Text Generation, Text Summarization, Automatic Speech Recognition, etc.	Number of models: 83 types. Main series: BART, Bloom, DBRX, Dolly, EleutherAI GPT, Falcon, Flan-T5, Gemma, Mistral, MPT, Phi, Yi, Zephyr, etc.
Arcee AI	Text Generation	Arcee[Lite, Nova, SuperNova], Llama Spark, Llama 3.1 SuperNova Lite
Camb.ai	Text To Audio	MARS6
EvolutionaryScale, PBC	Multimodal Generation	ESM3-open
Gretel	Text Generation	Gretel Navigator Tabular
IBM Data and AI	Text Generation	IBM Granite[8B Code Instruct - 128K, 3B Code Instruct - 128K, 34B Code Instruct - 8K, 20B Code Instruct - 8K], Granite 3.0[8B Instruct, 2B Instruct]
John Snow Labs	Text Summarization	Medical LLM[Small, Medium]
John Snow Labs	Translation	Medical Text Translation (EN-ES)
Karakuri, Inc.	Text Generation	KARAKURI LM 8x7b instruct
LG CNS	Text Generation	EXAONE_v3.0 7.8B Instruct
Liquidai	Text Generation	Liquid LFM[40B (L40S), 40B (H100), 40B (A100)]
NCSoft	Text Generation	Llama-3-Varco-Offsetbias-8B, VARCO LLM KO/EN-13B-IST
NVIDIA	Text Generation	NVIDIA Nemotron-4 15B NIM Microservice
Preferred Networks, Inc.	Text Generation	PLaMo API
Stability AI	Text To Image	Stable Diffusion 3.5 Large
Stockmark Inc.	Text Generation	Stockmark-LLM-13b
Upstage	Text Generation	Solar[Pro, Pro – Quant], Solar Mini[Chat, Chat – Quant, Chat ja, Chat ja – Quant]
Widn.AI	Translation	Widn Tower Sugarloaf, Widn Tower Anthill, Widn Llama3-Tower Vesuvius
Writer	Text Generation	Writer Palmyra-Med-70B-32K, Writer Palmyra-Fin-70B-32K

References:
Tech Blog with curated related content
Comprehensive Overview of Amazon Bedrock Models 2024(Japanese Presentation) | Speaker Deck
Comprehensive Overview of Amazon Bedrock Models 2024(Japanese Presentation) | Docswell

Summary

In this article, I summarized the overall framework of Amazon Bedrock announcements at AWS re:Invent 2024 and considered an overview of Amazon Bedrock models as of the end of 2024.

For Amazon Bedrock as a whole at AWS re:Invent 2024, I felt that it strongly emphasized directions such as improving reliability and safety assuming enterprise use, enhancing performance and efficiency of the entire Bedrock utilization system, improving user and developer experience and lowering implementation barriers, and supporting a wider range of use cases.

Focusing on the Amazon Bedrock model updates at AWS re:Invent 2024, the major changes were the addition of the Amazon Nova series strengthening Amazon's proprietary AI models including multimodal capabilities, the addition of Rerank models (Amazon Rerank, Cohere Rerank) enabling improved RAG search accuracy, and the emergence of Amazon Bedrock Marketplace making even more diverse models available.

Also, examining Amazon Bedrock models as of the end of 2024 from various angles revealed that the cycle of feature additions and model additions/removals in Amazon Bedrock is rapid, and it's important to consider this in keeping up and building systems.
Especially for systems in the operational phase, I strongly felt the need to constantly try out and understand the latest models, assuming the replacement of currently used models.

On the other hand, with the introduction of Amazon Bedrock Marketplace in addition to Amazon Bedrock's Serverless models, numerous industry-specific models addressing particular challenges and models specialized in specific technical domains are now available, allowing users to select models that are more optimized for their problem-solving needs.

I plan to continue keeping up with updates to Amazon Bedrock and AI models, and to try them out regularly to understand their functions, features, and use cases.

Written by Hidekazu Konishi