hidekazu-konishi.com

Amazon Bedrock Models as of 2024 - An Analysis of the Comprehensive Model Catalog

First Published:
Last Updated:

AWS announces significant new technologies and services every year at re:Invent, and AWS re:Invent 2024 featured various new service and feature announcements.
In this article, I will focus on the announcements related to Amazon Bedrock, summarize the overall framework of Amazon Bedrock announcements at AWS re:Invent 2024, and consider an overview of Amazon Bedrock models as of the end of 2024.

Summary of Overall Amazon Bedrock Announcements at AWS re:Invent 2024

First, I have categorized the Amazon Bedrock announcements from AWS re:Invent 2024 into six categories from my perspective.
Within this framework, items marked with (*) are announcements related to Amazon Bedrock models.

  • Enhancement of RAG (Retrieval Augmented Generation) functionality
    • (*) Optimization of search with Rerank model and API introduction
    • Improvement of user experience through streaming output
    • Implementation of custom connectors and streaming ingestion
    • Integration with GraphRAG and Kendra GenAI index
  • Expansion of multimodal support
    • (*) Provision of various models through Nova series (text, image, video, audio)
    • Multimodal data ingestion for knowledge base
    • Toxicity detection function for images
  • Model optimization and performance improvement
    • Introduction of latency optimization options
    • Implementation of model distillation functionality
    • Support for prompt caching
    • Prompt routing functionality
  • Ecosystem expansion
    • (*) Introduction of Bedrock marketplace
    • (*) Addition of new model providers
    • (*) Integration of third-party models
  • Strengthening of quality control and evaluation functions
    • Model evaluation using LLM-as-a-judge
    • Evaluation function for knowledge base
    • Functional expansion of guardrails
    • Introduction of automatic inference checks
  • Promotion of automation and efficiency
    • Introduction of data automation functionality
    • Support for structured data queries
    • Multi-agent collaboration functionality
Looking back at the overall Amazon Bedrock announcements at AWS re:Invent 2024 from this perspective, it appears that the following policies have been put forward regarding Amazon Bedrock:

  • Improvement of reliability and safety assuming enterprise use
  • Improvement of performance and efficiency of the entire Bedrock utilization system
  • Improvement of user and developer experience and lowering of implementation barriers
  • Support for a wider range of use cases
Amazon Bedrock, which became General Availability (GA) on 2023-09-28, has shifted its update content from providing a general generative AI foundation to functional enhancements and service expansions to meet various needs after one year.

Number of Amazon Bedrock Models (as of 2024-12-16)

Next, I'll summarize the number of models available in Amazon Bedrock.
In the Amazon Bedrock model catalog, models that can be used when enabled in Amazon Bedrock Model Access have come to be called Serverless models to distinguish them from models available from the Amazon Bedrock Marketplace.
The following calculation formula represents the number of models available in Amazon Bedrock in N. Virginia [us-east-1] and Oregon [us-west-2] as of 2024-12-16.

Serverless (us-east-1): |E| = 46
Serverless (us-west-2): |W| = 44
Serverless (us-east-1 ∪ us-west-2): |E ∪ W| = 52
Bedrock Marketplace: |M| = 122
Total: |E ∪ W| + |M| = 174

In other words, as of 2024-12-16, there are 52 types of Serverless models when merged, 122 types in the Bedrock Marketplace, and a total of 174 types of models available in N. Virginia [us-east-1] and Oregon [us-west-2].

List of Amazon Bedrock Serverless Models: By Region (as of 2024-12-16)

Next, I'd like to look at the list of 52 types of Serverless models mentioned in the calculation formula earlier, merged for N. Virginia [us-east-1] and Oregon [us-west-2] as of 2024-12-16, by region.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (red) are models available only in N. Virginia [us-east-1]
* Bold (blue) are models available only in Oregon [us-west-2]
Provider Modality Model Name
AI21 Labs Text Jurassic-2[Ultra, Mid], Jamba 1.5[Mini, Large], Jamba-Instruct
Amazon Text Nova Micro, Titan Text G1[Lite, Express, Premier], Rerank 1.0
Amazon Text & Vision Nova[Pro, Lite]
Amazon Image Nova Canvas, Titan Image Generator G1[v1, v2]
Amazon Video Nova Reel
Amazon Embeddings Titan Multimodal Embeddings G1, Titan Text Embeddings V2, Titan Embeddings G1 – Text
Anthropic Text Claude 3.5[Haiku], Claude[v2.1, v2.0], Claude Instant[v1.2]
Anthropic Text & Vision Claude 3.5[Sonnet v2, Sonnet], Claude 3[Haiku, Sonnet, Opus]
Cohere Text Command R+, Command R, Command, Command Light, Rerank 3.5
Cohere Embeddings Embed Multilingual, Embed English
Meta Text Llama 3.2[1B Instruct, 3B Instruct],
Llama 3.1[8B Instruct, 70B Instruct, 405B Instruct], Llama 3[8B Instruct, 70B Instruct]
Meta Text & Vision Llama 3.2[90B Vision Instruct, 11B Vision Instruct]
Mistral AI Text Mistral Large 2, Mistral Large, Mistral Small, Mixtral 8x7B Instruct, Mistral 7B Instruct
Stability AI Image SD3 Large 1.0, Stable Image Core 1.0, Stable Image Ultra 1.0, SDXL(1.0)
Based on this table, it seems likely that models that consume more capacity are placed only in Oregon [us-west-2], while more challenging models are placed only in N. Virginia [us-east-1].

List of Amazon Bedrock Serverless Models: By Provision Period (as of 2024-12-16)

Next, let's look at the list of Serverless models merged for N. Virginia [us-east-1] and Oregon [us-west-2] from a different perspective, categorized by provision period.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (green) are models that became available from January 2024 to before AWS re:Invent 2024
* Bold (brown)) are models that became available at AWS re:Invent 2024
Provider Modality Model Name
AI21 Labs Text Jurassic-2[Ultra, Mid], Jamba 1.5[Mini, Large], Jamba-Instruct
Amazon Text Nova Micro, Titan Text G1[Lite, Express, Premier], Rerank 1.0
Amazon Text & Vision Nova[Pro, Lite]
Amazon Image Nova Canvas, Titan Image Generator G1[v1, v2]
Amazon Video Nova Reel
Amazon Embeddings Titan Multimodal Embeddings G1, Titan Text Embeddings V2, Titan Embeddings G1 – Text
Anthropic Text Claude 3.5[Haiku], Claude[v2.1, v2.0], Claude Instant[v1.2]
Anthropic Text & Vision Claude 3.5[Sonnet v2, Sonnet], Claude 3[Haiku, Sonnet, Opus]
Cohere Text Command R+, Command R, Command, Command Light, Rerank 3.5
Cohere Embeddings Embed Multilingual, Embed English
Meta Text Llama 3.2[1B Instruct, 3B Instruct],
Llama 3.1[8B Instruct, 70B Instruct, 405B Instruct], Llama 3[8B Instruct, 70B Instruct]
Meta Text & Vision Llama 3.2[90B Vision Instruct, 11B Vision Instruct]
Mistral AI Text Mistral Large 2, Mistral Large, Mistral Small, Mixtral 8x7B Instruct, Mistral 7B Instruct
Stability AI Image SD3 Large 1.0, Stable Image Core 1.0, Stable Image Ultra 1.0, SDXL(1.0)
As you can see from this table, most of the models as of 2024-12-16 became available in 2024.
This is natural as Amazon Bedrock became General Availability (GA) on 2023-09-28, but I can also see a trend where older versions of models provided at the initial GA are gradually phased out as new versions appear.
From this, it is important to identify which models the newly announced models correspond to as successors, and to understand the functions and features of new models, confirm their substitutability, and prepare for migration as early as possible, assuming the replacement of older model versions.

Serverless Models Announced at AWS re:Invent 2024 (Available)

Next, I'll introduce the new Serverless models announced as available at AWS re:Invent 2024.
Although not listed in this table, among the models announced as coming soon at AWS re:Invent 2024, which will be introduced in the next heading, Stable Diffusion 3.5 Large became available on 2024-12-19.

Here's how to read the table:
* List of models by provider and model type (merged for N. Virginia [us-east-1] and Oregon [us-west-2])
* Italics (red) are models available only in N. Virginia [us-east-1]
* Bold (blue) are models available only in Oregon [us-west-2]
Provider Modality Model Name Overview
Cohere Text Rerank 3.5 A model to improve search accuracy in RAG applications. It takes user queries and retrieved document sets as input, re-ranks them based on relevance, and prioritizes the selection of optimal documents as model input to improve the quality of generated responses.
Amazon Text Rerank 1.0 A model to improve search accuracy in RAG applications. It takes user queries and retrieved document sets as input, re-ranks them based on relevance, and prioritizes the selection of optimal documents as model input to improve the quality of generated responses.
Amazon Text Nova Micro A text-only model capable of high-speed processing with minimal latency. Optimized for basic tasks such as summarization, translation, classification, dialogue, and coding with a context length of 128K tokens. Supports fine-tuning.
Amazon Text & Vision Nova Lite A low-cost multimodal model capable of high-speed processing. Generates text from image, video, and text inputs. Can handle inputs up to 300K tokens and analyze multiple images and videos up to 30 minutes. Supports fine-tuning.
Amazon Text & Vision Nova Pro A high-performance multimodal model with excellent balance of accuracy, speed, and cost. Supports inputs up to 300K tokens and achieves top-level performance in visual question answering and video understanding. Capable of executing complex workflows through API and tool integration. Supports fine-tuning.
Amazon Image Nova Canvas A state-of-the-art model capable of high-quality image generation. Features precise control over style and content, inpainting (partial modification), outpainting (image extension), and background removal editing functions. Achieves high performance in evaluating image generation fidelity.
Amazon Video Nova Reel A state-of-the-art model capable of generating professional-quality videos. Enables video generation from text or images, with control over visual style and pacing. Demonstrates excellent performance in video quality and consistency.

Until now, Amazon as a model provider has offered the Amazon Titan series, but in addition to that, the Amazon Nova series has become available, providing text generation from multimodal inputs, image generation with editing functions, and video generation with control over visual style and pacing.
Also, Rerank models (Amazon Rerank, Cohere Rerank) to improve search accuracy in RAG applications have been provided and become available from Amazon and Cohere providers.

Serverless Models Announced at AWS re:Invent 2024 (Coming Soon)

Next, I'll introduce the Serverless models announced as coming soon at AWS re:Invent 2024.
Among these, Stable Diffusion 3.5 Large became available on 2024-12-19.

Provider Modality Model Name Overview
Amazon Text & Vision Nova Premier The top-tier multimodal model for complex inference tasks. Also optimal as a teacher model for knowledge distillation of custom models (knowledge transfer of probability distribution and latent representation of intermediate layers from large-scale models to small-scale models). Scheduled for release in early 2025.
poolside Text malibu A model specialized for complex software engineering challenges such as advanced tasks including code generation, test creation, refactoring, and documentation creation. By collaborating with an assistant, it can be used directly within the developer's IDE, is fine-tuned based on the knowledge base, and has the flexibility to meet organization-specific needs.
poolside Text point A model specialized in rapid code completion that accurately predicts developer needs using advanced context awareness. By collaborating with an assistant, it can be used directly within the developer's IDE, is fine-tuned based on the knowledge base, and has the flexibility to meet organization-specific needs.
Stability AI Image Stable Diffusion 3.5 Large * Became available on 2024-12-19.
The latest high-performance image generation model provided by Stability AI. Capable of generating high-quality and beautiful images from text. Streamlines the creation of concept art, visual effects, and detailed product images.
Luma AI Video Ray 2 The latest video generation model that can generate high-quality videos from text or image prompts in about 10 seconds. Achieves smooth motion, advanced filming techniques, and dynamic camera work, capable of creating videos up to 1 minute long.

Amazon Bedrock Marketplace Model List (as of 2024-12-16)

Finally, I'll introduce the models in the Amazon Bedrock Marketplace.
As mentioned at the beginning, there were 122 types of models in the Amazon Bedrock Marketplace as of 2024-12-16, but the majority of these, 83 types, were models from HuggingFace, an open-source community.
In addition, various providers offer models with unique features, such as industry-specific models for medical and financial sectors.

Provider Category Model Name
HuggingFace Text Generation, Text Summarization, Automatic Speech Recognition, etc. Number of models: 83 types. Main series: BART, Bloom, DBRX, Dolly, EleutherAI GPT, Falcon, Flan-T5, Gemma, Mistral, MPT, Phi, Yi, Zephyr, etc.
Arcee AI Text Generation Arcee[Lite, Nova, SuperNova], Llama Spark, Llama 3.1 SuperNova Lite
Camb.ai Text To Audio MARS6
EvolutionaryScale, PBC Multimodal Generation ESM3-open
Gretel Text Generation Gretel Navigator Tabular
IBM Data and AI Text Generation IBM Granite[8B Code Instruct - 128K, 3B Code Instruct - 128K, 34B Code Instruct - 8K, 20B Code Instruct - 8K], Granite 3.0[8B Instruct, 2B Instruct]
John Snow Labs Text Summarization Medical LLM[Small, Medium]
John Snow Labs Translation Medical Text Translation (EN-ES)
Karakuri, Inc. Text Generation KARAKURI LM 8x7b instruct
LG CNS Text Generation EXAONE_v3.0 7.8B Instruct
Liquidai Text Generation Liquid LFM[40B (L40S), 40B (H100), 40B (A100)]
NCSoft Text Generation Llama-3-Varco-Offsetbias-8B, VARCO LLM KO/EN-13B-IST
NVIDIA Text Generation NVIDIA Nemotron-4 15B NIM Microservice
Preferred Networks, Inc. Text Generation PLaMo API
Stability AI Text To Image Stable Diffusion 3.5 Large
Stockmark Inc. Text Generation Stockmark-LLM-13b
Upstage Text Generation Solar[Pro, Pro – Quant], Solar Mini[Chat, Chat – Quant, Chat ja, Chat ja – Quant]
Widn.AI Translation Widn Tower Sugarloaf, Widn Tower Anthill, Widn Llama3-Tower Vesuvius
Writer Text Generation Writer Palmyra-Med-70B-32K, Writer Palmyra-Fin-70B-32K

References:
Tech Blog with curated related content
Comprehensive Overview of Amazon Bedrock Models 2024(Japanese Presentation) | Speaker Deck
Comprehensive Overview of Amazon Bedrock Models 2024(Japanese Presentation) | Docswell

Summary

In this article, I summarized the overall framework of Amazon Bedrock announcements at AWS re:Invent 2024 and considered an overview of Amazon Bedrock models as of the end of 2024.

For Amazon Bedrock as a whole at AWS re:Invent 2024, I felt that it strongly emphasized directions such as improving reliability and safety assuming enterprise use, enhancing performance and efficiency of the entire Bedrock utilization system, improving user and developer experience and lowering implementation barriers, and supporting a wider range of use cases.

Focusing on the Amazon Bedrock model updates at AWS re:Invent 2024, the major changes were the addition of the Amazon Nova series strengthening Amazon's proprietary AI models including multimodal capabilities, the addition of Rerank models (Amazon Rerank, Cohere Rerank) enabling improved RAG search accuracy, and the emergence of Amazon Bedrock Marketplace making even more diverse models available.

Also, examining Amazon Bedrock models as of the end of 2024 from various angles revealed that the cycle of feature additions and model additions/removals in Amazon Bedrock is rapid, and it's important to consider this in keeping up and building systems.
Especially for systems in the operational phase, I strongly felt the need to constantly try out and understand the latest models, assuming the replacement of currently used models.

I plan to continue keeping up with updates to Amazon Bedrock and AI models, and to try them out regularly to understand their functions, features, and use cases.

Written by Hidekazu Konishi


Copyright © Hidekazu Konishi ( hidekazu-konishi.com ) All Rights Reserved.