SageMaker Archives - Page 3 of 4

Monitor embedding drift for LLMs deployed from Amazon SageMaker JumpStart

One of the most useful application patterns for generative AI workloads is Retrieval Augmented Generation (RAG). In the RAG pattern,

Data is the foundation to capturing the maximum value from AI technology and solving business problems quickly. To unlock the

In the first part of this three-part series, we presented a solution that demonstrates how you can automate detecting document

With the advent of generative AI, today’s foundation models (FMs), such as the large language models (LLMs) Claude 2 and

When deploying a large language model (LLM), machine learning (ML) practitioners typically care about two measurements for model serving performance:

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia

Geospatial data is data about specific locations on the earth’s surface. It can represent a geographical area as a whole

OpenAI Whisper is an advanced automatic speech recognition (ASR) model with an MIT license. ASR technology finds utility in transcription

This post is co-written with Jayadeep Pabbisetty, Sr. Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at