Tag: model

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT

Imagine you’re facing the following challenge: you want to develop a Large Language Model (LLM) that can proficiently respond to

OpenAI Whisper is an advanced automatic speech recognition (ASR) model with an MIT license. ASR technology finds utility in transcription

Enterprises have access to massive amounts of data, much of which is difficult to discover because the data is unstructured.

This post is co-written with Jayadeep Pabbisetty, Sr. Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at

Large language model (LLM) training has surged in popularity over the last year with the release of several popular models

During a chemical reaction, molecules gain energy until they reach what’s known as the transition state — a point of

Research Published 14 November 2023 Authors Remi Lam on behalf of the GraphCast team Our state-of-the-art model delivers 10-day weather

Research Published 28 July 2023 Authors Yevgen Chebotar, Tianhe Yu Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model

In this post, we introduce Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data gathered from the web.