Blog Post

Azure AI Foundry Blog

7 MIN READ

Introducing AI21 Labs Jamba 1.5 Large and Jamba 1.5 Mini on Azure AI Models-as-a-Service

Microsoft

Aug 22, 2024

This June, AI21 Jamba-Instruct from AI21 Labs launched first on Azure and now, in partnership with AI21, we are excited to announce the availability of two new open models, AI21 Jamba 1.5 Large and A...

Updated Aug 22, 2024

Version 1.0

azure ai studio

azure machine learning

model catalog

Yina Arenas

Microsoft

Joined August 24, 2017

View Profile

Azure AI Foundry Blog

Follow this blog board to get notified when there's new activity

ThasmikaGokal

Microsoft

Sep 08, 2024

Hi MattGohmann, thanks for reaching out! To clarify, max_tokens is about the maximum number of tokens to allow for each generated response message. In other words, max_tokens represents the maximum number of output tokens, and is set within AI21's container to max_tokens = 4096. Having a context window of 256K is the max tokens for input, which is why the model is perfect for long-context RAG applications: Understanding Large Language Models Context Windows | Appen, Context Window (LLMs) — Klu

This might not be entirely clear within our or AI21's docs, so I've flagged the necessary changes within both sites - thanks for inspiring this change!

How to deploy AI21's Jamba family models with Azure AI Studio - Azure AI Studio | Microsoft Learn, Jamba 1.5 (ai21.com)