\n

Introduction

\n

Phi-3 is a family of small language models (SLMs) developed by Microsoft that delivers exceptional performance and cost-effectiveness. In this tutorial, you will learn how to fine-tune the Phi-3 model and integrate it with Prompt flow. By leveraging Azure Machine Learning, and Prompt flow you will establish a workflow for deploying and utilizing custom AI models. This tutorial is divided into three series:

\n

Series 1: Set up Azure resources and Prepare for fine-tuning

\n

\n
Create Azure Machine Learning workspace: Set up an Azure Machine Learning workspace, which serves as the hub for managing machine learning experiments and models.
\n
\n
Request GPU quotas: Request GPU quotas in your Azure subscription to ensure sufficient resources for model fine-tuning.
\n
\n
Add role assignment: Set up a User Assigned Managed Identity (UAI) and assign it necessary permissions (Contributor, Storage Blob Data Reader, AcrPull) to access resources like storage accounts and container registries.
\n
\n
Set up the project: Create a local environment, set up a virtual environment, install required packages, and create a script (download_dataset.py) to download the dataset (ULTRACHAT_200k) required for fine-tuning.
\n

\n

Series 2: Fine-tune and Deploy the Phi-3 model

\n

\n
Define fine-tuning process: Add code to the fine_tune.py file to define the fine-tuning process, including data loading, preprocessing, and training configurations.
\n
\n
Fine-tune the Phi-3 model: Add code to and run the setup_ml.py file to set up the compute environment, define the fine-tuning job, and submit it to Azure Machine Learning.
\n
\n
Deploy the Fine-tuned model: Once fine-tuning is complete, Add code to the deploy_model.py file to register the fine-tuned model in Azure Machine Learning, create an online endpoint, and deploy the model for real-time inference.
\n

\n

Series 3: Integrate the custom Phi-3 model with Prompt flow

\n

\n
Build Prompt flow: Add code to the flow.dag.yml file to build a flow.
\n
\n
Integrate with Prompt flow: Add code to integrate_with_promptflow file to integrate the custom Phi-3 model with Prompt flow.
\n

\n

Here is an overview of this tutorial.

\n

Note\n

Microsoft has released the Phi-3.5 models, featuring enhanced multi-language support, improved vision capabilities, and advanced Intelligence Mixture of Experts (MOEs). Although this tutorial primarily focuses on Phi-3, you can apply the same steps to fine-tune and integrate the Phi-3.5 model for even better performance. A tip on how to modify the fine_tune.py script to switch to the Phi-3.5 model is included below at Fine-tune the Phi-3 model section.

\n

For more detailed information and to explore additional resources about Phi-3 and Phi-3.5, please visit the Phi-3CookBook.

\n

Prerequisites

\n

Series 1: Set up Azure resources and Prepare for fine-tuning

\n

Create Azure Machine Learning Workspace

\n

In this exercise, you will:

\n

Create an Azure Machine Learning Workspace.

\n

Create an Azure Machine Learning Workspace

\n

\n
Type azure machine learning in the search bar at the top of the portal page and select Azure Machine Learning from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select + Create from the navigation menu.
\n
\n
Select New workspace from the navigation menu.
\n

\n
\n

\n\n

\n
\n
\n
Perform the following tasks:
\n
- Select your Azure Subscription.
- Select the Resource group to use (create a new one if needed).
- Enter Workspace Name. It must be a unique value.
- Select the Region you'd like to use.
- Select the Storage account to use (create a new one if needed).
- Select the Key vault to use (create a new one if needed).
- Select the Application insights to use (create a new one if needed).
- Select the Container registry to use (create a new one if needed).
\n

\n
\n

\n\n

\n
\n
Tip\n
When you create or use a Storage account in Azure Machine Learning, a container named \"azureml\" is automatically created within the Storage account. This container is used for storing model artifacts, training outputs, and other data generated during the machine learning process. In this tutorial, you will use the \"azureml\" container to manage and store all the necessary files and outputs related to our machine learning workflows.
\n
\n
\n

\n
\n
\n
Select Review + Create.
\n
\n
Select Create.
\n

\n

Request GPU quotas in Azure Subscription

\n

In this tutorial, you will learn how to fine-tune and deploy a Phi-3 model, using GPUs. For fine-tuning, you will use the Standard_NC24ads_A100_v4 GPU, which requires a quota request. For deployment, you will use the Standard_E4s_v3 CPU, which does not require a quota request.

\n

Note\n

Only Pay-As-You-Go subscriptions (the standard subscription type) are eligible for GPU allocation; benefit subscriptions are not currently supported.

\n

For those using benefit subscriptions (such as Visual Studio Enterprise Subscription) or those looking to quickly test the fine-tuning and deployment process, this tutorial also provides guidance for fine-tuning with a minimal dataset using a CPU. However, it is important to note that fine-tuning results are significantly better when using a GPU with larger datasets.

\n

In this exercise, you will:

\n

Request GPU Quotas in your Azure Subscription

\n

Request GPU Quotas in Azure Subscription

\n

\n
Visit Azure ML Studio.
\n
\n
Perform the following tasks to request Standard NCADSA100v4 Family quota:
\n
- Select Quota from the left side tab.
- \n
  Select the Virtual machine family to use. For example, select Standard NCADSA100v4 Family Cluster Dedicated vCPUs, which includes the Standard_NC24ads_A100_v4 GPU.
  \n
- \n
  Select the Request quota from the navigation menu.
  \n
  
  \n
  \n
  
  \n\n
  
  \n
  \n
- \n
  Inside the Request quota page, enter the New cores limit you'd like to use. For example, 24.
  \n
- \n
  Inside the Request quota page, select Submit to request the GPU quota.
  \n
\n

\n

Note\n

You can select the appropriate GPU or CPU for your needs by referring to Sizes for Virtual Machines in Azure document.

\n

Add role assignment

\n

To fine-tune and deploy your models, you must first ceate a User Assigned Managed Identity (UAI) and assign it the appropriate permissions. This UAI will be used for authentication during deployment, so it is critical to grant it access to the storage accounts, container registry, and resource group.

\n

In this exercise, you will:

\n

Create User Assigned Managed Identity(UAI).
Add Contributor role assignment to Managed Identity.
Add Storage Blob Data Reader role assignment to Managed Identity.
Add AcrPull role assignment to Managed Identity.

\n

Create User Assigned Managed Identity(UAI)

\n

\n
Type managed identities in the search bar at the top of the portal page and select Managed Identities from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select + Create.
\n

\n
\n

\n\n

\n
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Select your Azure Subscription.
- Select the Resource group to use (create a new one if needed).
- Select the Region you'd like to use.
- Enter the Name. It must be a unique value.
\n

\n
\n

\n\n

\n
\n
\n
Select Review + create.
\n
\n
Select + Create.
\n

\n

Add Contributor role assignment to Managed Identity

\n

\n
Navigate to the Managed Identity resource that you created.
\n
\n
Select Azure role assignments from the left side tab.
\n
\n
Select +Add role assignment from the navigation menu.
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- Select the Scope to Resource group.
- Select your Azure Subscription.
- Select the Resource group to use.
- Select the Role to Contributor.
\n

\n
\n

\n\n

\n
\n
\n
Select Save.
\n

\n

Add Storage Blob Data Reader role assignment to Managed Identity

\n

\n
Type azure storage accounts in the search bar at the top of the portal page and select Storage accounts from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select the storage account that associated with the Azure Machine Learning workspace. For example, finetunephistorage.
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Navigate to the Azure Storage account that you created.
- Select Access Control (IAM) from the left side tab.
- Select + Add from the navigation menu.
- Select Add role assignment from the navigation menu.
\n

\n
\n

\n
\n

\n
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- \n
  Inside the Role page, type Storage Blob Data Reader in the search bar and select Storage Blob Data Reader from the options that appear.
  \n
  
  \n
  \n
  
  \n
  \n
  
  \n
- \n
  Inside the Role page, select Next.
  \n
- \n
  Inside the Members page, select Assign access to Managed identity.
  \n
- \n
  Inside the Members page, select + Select members.
  \n
- \n
  Inside Select managed identities page, select your Azure Subscription.
  \n
- \n
  Inside Select managed identities page, select the Managed identity to Manage Identity.
  \n
- \n
  Inside Select managed identities page, select the Manage Identity that you created. For example, finetunephi-managedidentity.
  \n
- \n
  Inside Select managed identities page, select Select.
  \n
  
  \n
  \n
  
  \n
  \n
  
  \n
- \n
  Select Review + assign.
  \n
\n

\n

Add AcrPull role assignment to Managed Identity

\n

\n
Type container registries in the search bar at the top of the portal page and select Container registries from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select the container registry that associated with the Azure Machine Learning workspace. For example, finetunephicontainerregistries
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Select Access Control (IAM) from the left side tab.
- Select + Add from the navigation menu.
- Select Add role assignment from the navigation menu.
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- Inside the Role page, Type AcrPull in the search bar and select AcrPull from the options that appear.
- Inside the Role page, select Next.
- Inside the Members page, select Assign access to Managed identity.
- Inside the Members page, select + Select members.
- Inside Select managed identities page, select your Azure Subscription.
- Inside Select managed identities page, select the Managed identity to Manage Identity.
- Inside Select managed identities page, select the Manage Identity that you created. For example, finetunephi-managedidentity.
- Inside Select managed identities page, select Select.
- Select Review + assign.
\n

\n

Set up the project and install the libraries

\n

Now, you will create a folder to work in and set up a virtual environment to develop a program.

\n

In this exercise, you will

\n

Create a folder to work inside it.
Create a virtual environment.
Install the required packages.

\n

Create a folder to work inside it

\n

\n
Open a terminal window and type the following command to create a folder named finetune-phi in the default path.

\n
```
mkdir finetune-phi\n
```
\n
\n
Type the following command inside your terminal to navigate to the finetune-phi folder you created.

\n
```
cd finetune-phi\n
```
\n

\n

Create a virtual environment

\n

\n
Type the following command inside your terminal to create a virtual environment named .venv.

\n
```
python -m venv .venv\n
```
\n
\n
Type the following command inside your terminal to activate the virtual environment.

\n
```
.venv\\Scripts\\activate.bat\n
```
\n

\n

Note\n

If it worked, you should see (.venv) before the command prompt.

\n

Install the required packages

\n

Type the following commands inside your terminal to install the required packages.

\n

pip install datasets==2.19.1\npip install transformers==4.41.1\npip install azure-ai-ml==1.16.0\npip install torch==2.3.1\npip install trl==0.9.4\npip install promptflow==1.12.0

\n

Set up project files in Visual Studio Code

\n

In this exercise, you will create the essential files for our project. These files include scripts for downloading the dataset, setting up the Azure Machine Learning environment, fine-tuning the Phi-3 model, and deploying the fine-tuned model. You will also create a conda.yml file to set up the fine-tuning environment.

\n

In this exercise, you will:

\n

Create a download_dataset.py file to download the dataset.
Create a setup_ml.py file to set up the Azure Machine Learning environment.
Create a fine_tune.py file in the finetuning_dir folder to fine-tune the Phi-3 model using the dataset.
Create a conda.yml file to setup fine-tuning environment.
Create a deploy_model.py file to deploy the fine-tuned model.
Create a integrate_with_promptflow.py file, to integrate the fine-tuned model and execute the model using Prompt flow.
Create a flow.dag.yml file, to set up the workflow structure for Prompt flow.
Create a config.py file to enter Azure information.

\n

Note\n

Complete folder structure:

\n

└── YourUserName\n.    └── finetune-phi\n.        ├── finetuning_dir\n.        │      └── fine_tune.py\n.        ├── conda.yml\n.        ├── config.py\n.        ├── deploy_model.py\n.        ├── download_dataset.py\n.        ├── flow.dag.yml\n.        ├── integrate_with_promptflow.py\n.        └── setup_ml.py\n

\n

Create Project Files

\n

\n
Open Visual Studio Code.
\n
\n
Select File from the menu bar.
\n
\n
Select Open Folder.
\n
\n
Select the finetune-phi folder that you created, which is located at C:\\Users\\yourUserName\\finetune-phi.
\n

\n
\n

\n\n

\n
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named download_dataset.py.
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named setup_ml.py.
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named deploy_model.py.
\n

\n
\n

\n\n

\n
\n
\n
In the left pane of Visual Studio Code, right-click and select New Folder to create a new forder named finetuning_dir.
\n
\n
In the finetuning_dir folder, create a new file named fine_tune.py.

\n

\n

Create and Configure conda.yml file

\n

\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named conda.yml.
\n

\n

Add the following code to the conda.yml file to set up the fine-tuning environment for the Phi-3 model.

\n

name: phi-3-training-env\nchannels:\n  - defaults\n  - conda-forge\ndependencies:\n  - python=3.10\n  - pip\n  - numpy<2.0\n  - pip:\n      - torch==2.4.0\n      - torchvision==0.19.0\n      - trl==0.8.6\n      - transformers==4.41\n      - datasets==2.21.0\n      - azureml-core==1.57.0\n      - azure-storage-blob==12.19.0\n      - azure-ai-ml==1.16\n      - azure-identity==1.17.1\n      - accelerate==0.33.0\n      - mlflow==2.15.1\n      - azureml-mlflow==1.57.0

\n

Create and Configure config.py file

\n

\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named config.py.
\n

\n

Add the following code to the config.py file to include your Azure information.

\n

# Azure settings\nAZURE_SUBSCRIPTION_ID = \"your_subscription_id\"\nAZURE_RESOURCE_GROUP_NAME = \"your_resource_group_name\" # \"TestGroup\"\n\n# Azure Machine Learning settings\nAZURE_ML_WORKSPACE_NAME = \"your_workspace_name\" # \"finetunephi-workspace\"\n\n# Azure Managed Identity settings\nAZURE_MANAGED_IDENTITY_CLIENT_ID = \"your_azure_managed_identity_client_id\"\nAZURE_MANAGED_IDENTITY_NAME = \"your_azure_managed_identity_name\" # \"finetunephi-mangedidentity\"\nAZURE_MANAGED_IDENTITY_RESOURCE_ID = f\"/subscriptions/{AZURE_SUBSCRIPTION_ID}/resourceGroups/{AZURE_RESOURCE_GROUP_NAME}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{AZURE_MANAGED_IDENTITY_NAME}\"\n\n# Dataset file paths\nTRAIN_DATA_PATH = \"data/train_data.jsonl\"\nTEST_DATA_PATH = \"data/test_data.jsonl\"\n\n# Fine-tuned model settings\nAZURE_MODEL_NAME = \"your_fine_tuned_model_name\" # \"finetune-phi-model\"\nAZURE_ENDPOINT_NAME = \"your_fine_tuned_model_endpoint_name\" # \"finetune-phi-endpoint\"\nAZURE_DEPLOYMENT_NAME = \"your_fine_tuned_model_deployment_name\" # \"finetune-phi-deployment\"\n\nAZURE_ML_API_KEY = \"your_fine_tuned_model_api_key\"\nAZURE_ML_ENDPOINT = \"your_fine_tuned_model_endpoint_uri\" # \"https://{your-endpoint-name}.{your-region}.inference.ml.azure.com/score\"

\n

Add Azure Environment Variables

\n

\n
Perform the following tasks to add the Azure Subscription ID:
\n
- Type subscriptions in the search bar at the top of the portal page and select Subscriptions from the options that appear.
  
  \n
  \n
  
  \n\n
  
  \n
  \n
- Select the Azure Subscription you are currently using.
- Copy and paste your Subscription ID into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Workspace Name:
\n
- Navigate to the Azure Machine Learning resource that you created.
- Copy and paste your account name into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Resource Group Name:
\n
- Navigate to the Azure Machine Learning resource that you created.
- Copy and paste your Azure Resource Group Name into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Managed Identity name
\n
- Navigate to the Managed Identities resource that you created.
- Copy and paste your Azure Managed Identity name into the config.py file.
\n

\n

\n

Prepare Dataset for Fine-tuning

\n

In this exercise, you will run the download_dataset.py file to download the ultrachat_200k datasets to your local environment. You will then use this datasets to fine-tune the Phi-3 model in Azure Machine Learning.

\n

In this exercise, you will:

\n

Add code to the download_dataset.py file to download the datasets.
Run the download_dataset.py file to download datasets to your local environment.

\n

Download your dataset using download_dataset.py

\n

\n
Open the download_dataset.py file in Visual Studio Code.
\n

\n

Add the following code into download_dataset.py.

\n

import json\nimport os\nfrom datasets import load_dataset\nfrom config import (\n    TRAIN_DATA_PATH,\n    TEST_DATA_PATH)\n\ndef load_and_split_dataset(dataset_name, config_name, split_ratio):\n    \"\"\"\n    Load and split a dataset.\n    \"\"\"\n    # Load the dataset with the specified name, configuration, and split ratio\n    dataset = load_dataset(dataset_name, config_name, split=split_ratio)\n    print(f\"Original dataset size: {len(dataset)}\")\n    \n    # Split the dataset into train and test sets (80% train, 20% test)\n    split_dataset = dataset.train_test_split(test_size=0.2)\n    print(f\"Train dataset size: {len(split_dataset['train'])}\")\n    print(f\"Test dataset size: {len(split_dataset['test'])}\")\n    \n    return split_dataset\n\ndef save_dataset_to_jsonl(dataset, filepath):\n    \"\"\"\n    Save a dataset to a JSONL file.\n    \"\"\"\n    # Create the directory if it does not exist\n    os.makedirs(os.path.dirname(filepath), exist_ok=True)\n    \n    # Open the file in write mode\n    with open(filepath, 'w', encoding='utf-8') as f:\n        # Iterate over each record in the dataset\n        for record in dataset:\n            # Dump the record as a JSON object and write it to the file\n            json.dump(record, f)\n            # Write a newline character to separate records\n            f.write('\\n')\n    \n    print(f\"Dataset saved to {filepath}\")\n\ndef main():\n    \"\"\"\n    Main function to load, split, and save the dataset.\n    \"\"\"\n    # Load and split the ULTRACHAT_200k dataset with a specific configuration and split ratio\n    dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:1%]')\n    \n    # Extract the train and test datasets from the split\n    train_dataset = dataset['train']\n    test_dataset = dataset['test']\n\n    # Save the train dataset to a JSONL file\n    save_dataset_to_jsonl(train_dataset, TRAIN_DATA_PATH)\n    \n    # Save the test dataset to a separate JSONL file\n    save_dataset_to_jsonl(test_dataset, TEST_DATA_PATH)\n\nif __name__ == \"__main__\":\n    main()\n

\n

Tip\n

Guidance for fine-tuning with a minimal dataset using a CPU

\n

If you want to use a CPU for fine-tuning, this approach is ideal for those with benefit subscriptions (such as Visual Studio Enterprise Subscription) or to quickly test the fine-tuning and deployment process.

\n

Replace dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:1%]') with dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:10]')

\n

\n
Type the following command inside your terminal to run the script and download the dataset to your local environment.

\n
```
python download_dataset.py\n
```
\n
\n
Verify that the datasets were saved successfully to your local finetune-phi/data directory.

\n

\n

Note\n

Note on dataset size and fine-tuning time

\n

In this tutorial, you use only 1% of the dataset (train_sft[:1%]). This significantly reduces the amount of data, speeding up both the upload and fine-tuning processes. You can adjust the percentage to find the right balance between training time and model performance. Using a smaller subset of the dataset reduces the time required for fine-tuning, making the process more manageable for a tutorial.

\n

Series 2: Fine-tune and Deploy the Phi-3 model

\n

Fine-tune the Phi-3 model

\n

In this exercise, you will fine-tune the Phi-3 model using the provided dataset. First, you will define the fine-tuning process in the fine_tune.py file. Then, you will configure the Azure Machine Learning environment and initiate the fine-tuning process by running the setup_ml.py file. This script ensures that the fine-tuning occurs within the Azure Machine Learning environment.

\n

By running setup_ml.py, you will run the fine-tuning process in the Azure Machine Learning environment.

\n

In this exercise, you will:

\n

Set up Azure CLI to authenticate environment
Add code to the fine_tune.py file to fine-tune the model.
Add code to and run the setup_ml.py file to initiate the fine-tuning process in Azure Machine Learning.
Run the setup_ml.py file to fine-tune the Phi-3 model using Azure Machine Learning.

\n

Set up Azure CLI

\n

You need to set up Azure CLI to authenticate your environment. Azure CLI allows you to manage Azure resources directly from the command line and provides the credentials necessary for Azure Machine Learning to access these resources. To get started install Azure CLI

\n

\n
Open a terminal window and type the following command to log in to your Azure account.

\n
```
az login\n
```
\n
\n
Select your Azure account to use.
\n
\n
Select your Azure subscription to use.
\n

\n
\n

\n\n

\n
\n

\n

Tip\n

Having trouble signing in to Azure? Try using a device code

\n

Open a terminal window and type the following command to log in to your Azure account.

\n
```
az login --use-device-code
```
\n

\n

Visit the website displayed in the terminal window and enter the provided code on that site.

\n

Inside the website, select Next.

\n

\n
Inside the website, select the account to use in this tutorial

\n

\n
Inside the website, select continue to complete login.
After successful login, go back to your terminal and select your Azure subscription to use.

\n

\n

\n

Add code to the fine_tune.py file

\n

\n
Navigate to the finetuning_dir folder and Open the fine_tune.py file in Visual Studio Code.
\n

\n

Add the following code into fine_tune.py.

\n

import argparse\nimport sys\nimport logging\nimport os\nfrom datasets import load_dataset\nimport torch\nimport mlflow\nfrom transformers import AutoModelForCausalLM, AutoTokenizer, TrainingArguments\nfrom trl import SFTTrainer\n\n# To avoid the INVALID_PARAMETER_VALUE error in MLflow, disable MLflow integration\nos.environ[\"DISABLE_MLFLOW_INTEGRATION\"] = \"True\"\n\n# Logging setup\nlogging.basicConfig(\n    format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n    datefmt=\"%Y-%m-%d %H:%M:%S\",\n    handlers=[logging.StreamHandler(sys.stdout)],\n    level=logging.WARNING\n)\nlogger = logging.getLogger(__name__)\n\ndef initialize_model_and_tokenizer(model_name, model_kwargs):\n    \"\"\"\n    Initialize the model and tokenizer with the given pretrained model name and arguments.\n    \"\"\"\n    model = AutoModelForCausalLM.from_pretrained(model_name, **model_kwargs)\n    tokenizer = AutoTokenizer.from_pretrained(model_name)\n    tokenizer.model_max_length = 2048\n    tokenizer.pad_token = tokenizer.unk_token\n    tokenizer.pad_token_id = tokenizer.convert_tokens_to_ids(tokenizer.pad_token)\n    tokenizer.padding_side = 'right'\n    return model, tokenizer\n\ndef apply_chat_template(example, tokenizer):\n    \"\"\"\n    Apply a chat template to tokenize messages in the example.\n    \"\"\"\n    messages = example[\"messages\"]\n    if messages[0][\"role\"] != \"system\":\n        messages.insert(0, {\"role\": \"system\", \"content\": \"\"})\n    example[\"text\"] = tokenizer.apply_chat_template(\n        messages, tokenize=False, add_generation_prompt=False\n    )\n    return example\n\ndef load_and_preprocess_data(train_filepath, test_filepath, tokenizer):\n    \"\"\"\n    Load and preprocess the dataset.\n    \"\"\"\n    train_dataset = load_dataset('json', data_files=train_filepath, split='train')\n    test_dataset = load_dataset('json', data_files=test_filepath, split='train')\n    column_names = list(train_dataset.features)\n\n    train_dataset = train_dataset.map(\n        apply_chat_template,\n        fn_kwargs={\"tokenizer\": tokenizer},\n        num_proc=10,\n        remove_columns=column_names,\n        desc=\"Applying chat template to train dataset\",\n    )\n\n    test_dataset = test_dataset.map(\n        apply_chat_template,\n        fn_kwargs={\"tokenizer\": tokenizer},\n        num_proc=10,\n        remove_columns=column_names,\n        desc=\"Applying chat template to test dataset\",\n    )\n\n    return train_dataset, test_dataset\n\ndef train_and_evaluate_model(train_dataset, test_dataset, model, tokenizer, output_dir):\n    \"\"\"\n    Train and evaluate the model.\n    \"\"\"\n    training_args = TrainingArguments(\n        bf16=True,\n        do_eval=True,\n        output_dir=output_dir,\n        eval_strategy=\"epoch\",\n        learning_rate=5.0e-06,\n        logging_steps=20,\n        lr_scheduler_type=\"cosine\",\n        num_train_epochs=3,\n        overwrite_output_dir=True,\n        per_device_eval_batch_size=4,\n        per_device_train_batch_size=4,\n        remove_unused_columns=True,\n        save_steps=500,\n        seed=0,\n        gradient_checkpointing=True,\n        gradient_accumulation_steps=1,\n        warmup_ratio=0.2,\n    )\n\n    trainer = SFTTrainer(\n        model=model,\n        args=training_args,\n        train_dataset=train_dataset,\n        eval_dataset=test_dataset,\n        max_seq_length=2048,\n        dataset_text_field=\"text\",\n        tokenizer=tokenizer,\n        packing=True\n    )\n\n    train_result = trainer.train()\n    trainer.log_metrics(\"train\", train_result.metrics)\n\n    mlflow.transformers.log_model(\n        transformers_model={\"model\": trainer.model, \"tokenizer\": tokenizer},\n        artifact_path=output_dir,\n    )\n\n    tokenizer.padding_side = 'left'\n    eval_metrics = trainer.evaluate()\n    eval_metrics[\"eval_samples\"] = len(test_dataset)\n    trainer.log_metrics(\"eval\", eval_metrics)\n\ndef main(train_file, eval_file, model_output_dir):\n    \"\"\"\n    Main function to fine-tune the model.\n    \"\"\"\n    model_kwargs = {\n        \"use_cache\": False,\n        \"trust_remote_code\": True,\n        \"torch_dtype\": torch.bfloat16,\n        \"device_map\": None,\n        \"attn_implementation\": \"eager\"\n    }\n    \n    pretrained_model_name = \"microsoft/Phi-3.5-mini-instruct\"\n    # pretrained_model_name = \"microsoft/Phi-3-mini-4k-instruct\"\n\n    with mlflow.start_run():\n        model, tokenizer = initialize_model_and_tokenizer(pretrained_model_name, model_kwargs)\n        train_dataset, test_dataset = load_and_preprocess_data(train_file, eval_file, tokenizer)\n        train_and_evaluate_model(train_dataset, test_dataset, model, tokenizer, model_output_dir)\n\nif __name__ == \"__main__\":\n    parser = argparse.ArgumentParser()\n    parser.add_argument(\"--train-file\", type=str, required=True, help=\"Path to the training data\")\n    parser.add_argument(\"--eval-file\", type=str, required=True, help=\"Path to the evaluation data\")\n    parser.add_argument(\"--model_output_dir\", type=str, required=True, help=\"Directory to save the fine-tuned model\")\n    args = parser.parse_args()\n    main(args.train_file, args.eval_file, args.model_output_dir)\n

\n

\n
Save and close the fine_tune.py file.
\n

\n

Tip\n

You can fine-tune Phi-3.5 model

\n

In fine_tune.py file, you can change the pretrained_model_name from \"microsoft/Phi-3-mini-4k-instruct\" to any model you want to fine-tune. For example, if you change it to \"microsoft/Phi-3.5-mini-instruct\", you'll be using the Phi-3.5-mini-instruct model for fine-tuning. To find and use the model name you prefer, visit Hugging Face, search for the model you're interested in, and then copy and paste its name into the pretrained_model_name field in your script.

\n

Add code to the setup_ml.py file

\n

\n
Open the setup_ml.py file in Visual Studio Code.
\n

\n

Add the following code into setup_ml.py.

\n

import logging\nfrom azure.ai.ml import MLClient, command, Input\nfrom azure.ai.ml.entities import Environment, AmlCompute\nfrom azure.identity import AzureCliCredential\nfrom config import (\n    AZURE_SUBSCRIPTION_ID,\n    AZURE_RESOURCE_GROUP_NAME,\n    AZURE_ML_WORKSPACE_NAME,\n    TRAIN_DATA_PATH,\n    TEST_DATA_PATH\n)\n\n# Constants\n\n# Uncomment the following lines to use a CPU instance for training\n# COMPUTE_INSTANCE_TYPE = \"Standard_E16s_v3\" # cpu\n# COMPUTE_NAME = \"cpu-e16s-v3\"\n# DOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu20.04:latest\"\n\n# Uncomment the following lines to use a GPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_NC24ads_A100_v4\"\nCOMPUTE_NAME = \"gpu-nc24s-a100-v4\"\nDOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/curated/acft-hf-nlp-gpu:59\"\n\nCONDA_FILE = \"conda.yml\"\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster\nFINETUNING_DIR = \"./finetuning_dir\" # Path to the fine-tuning script\nTRAINING_ENV_NAME = \"phi-3-training-environment\" # Name of the training environment\nMODEL_OUTPUT_DIR = \"./model_output\" # Path to the model output directory in azure ml\n\n# Logging setup to track the process\nlogger = logging.getLogger(__name__)\nlogging.basicConfig(\n    format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n    datefmt=\"%Y-%m-%d %H:%M:%S\",\n    level=logging.WARNING\n)\n\ndef get_ml_client():\n    \"\"\"\n    Initialize the ML Client using Azure CLI credentials.\n    \"\"\"\n    credential = AzureCliCredential()\n    return MLClient(credential, AZURE_SUBSCRIPTION_ID, AZURE_RESOURCE_GROUP_NAME, AZURE_ML_WORKSPACE_NAME)\n\ndef create_or_get_environment(ml_client):\n    \"\"\"\n    Create or update the training environment in Azure ML.\n    \"\"\"\n    env = Environment(\n        image=DOCKER_IMAGE_NAME,  # Docker image for the environment\n        conda_file=CONDA_FILE,  # Conda environment file\n        name=TRAINING_ENV_NAME,  # Name of the environment\n    )\n    return ml_client.environments.create_or_update(env)\n\ndef create_or_get_compute_cluster(ml_client, compute_name, COMPUTE_INSTANCE_TYPE, location):\n    \"\"\"\n    Create or update the compute cluster in Azure ML.\n    \"\"\"\n    try:\n        compute_cluster = ml_client.compute.get(compute_name)\n        logger.info(f\"Compute cluster '{compute_name}' already exists. Reusing it for the current run.\")\n    except Exception:\n        logger.info(f\"Compute cluster '{compute_name}' does not exist. Creating a new one with size {COMPUTE_INSTANCE_TYPE}.\")\n        compute_cluster = AmlCompute(\n            name=compute_name,\n            size=COMPUTE_INSTANCE_TYPE,\n            location=location,\n            tier=\"Dedicated\",  # Tier of the compute cluster\n            min_instances=0,  # Minimum number of instances\n            max_instances=1  # Maximum number of instances\n        )\n        ml_client.compute.begin_create_or_update(compute_cluster).wait()  # Wait for the cluster to be created\n    return compute_cluster\n\ndef create_fine_tuning_job(env, compute_name):\n    \"\"\"\n    Set up the fine-tuning job in Azure ML.\n    \"\"\"\n    return command(\n        code=FINETUNING_DIR,  # Path to fine_tune.py\n        command=(\n            \"python fine_tune.py \"\n            \"--train-file ${{inputs.train_file}} \"\n            \"--eval-file ${{inputs.eval_file}} \"\n            \"--model_output_dir ${{inputs.model_output}}\"\n        ),\n        environment=env,  # Training environment\n        compute=compute_name,  # Compute cluster to use\n        inputs={\n            \"train_file\": Input(type=\"uri_file\", path=TRAIN_DATA_PATH),  # Path to the training data file\n            \"eval_file\": Input(type=\"uri_file\", path=TEST_DATA_PATH),  # Path to the evaluation data file\n            \"model_output\": MODEL_OUTPUT_DIR\n        }\n    )\n\ndef main():\n    \"\"\"\n    Main function to set up and run the fine-tuning job in Azure ML.\n    \"\"\"\n    # Initialize ML Client\n    ml_client = get_ml_client()\n\n    # Create Environment\n    env = create_or_get_environment(ml_client)\n    \n    # Create or get existing compute cluster\n    create_or_get_compute_cluster(ml_client, COMPUTE_NAME, COMPUTE_INSTANCE_TYPE, LOCATION)\n\n    # Create and Submit Fine-Tuning Job\n    job = create_fine_tuning_job(env, COMPUTE_NAME)\n    returned_job = ml_client.jobs.create_or_update(job)  # Submit the job\n    ml_client.jobs.stream(returned_job.name)  # Stream the job logs\n    \n    # Capture the job name\n    job_name = returned_job.name\n    print(f\"Job name: {job_name}\")\n\nif __name__ == \"__main__\":\n    main()\n

\n

Replace COMPUTE_INSTANCE_TYPE, COMPUTE_NAME, and LOCATION with your specific details.

\n

# Uncomment the following lines to use a GPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_NC24ads_A100_v4\"\nCOMPUTE_NAME = \"gpu-nc24s-a100-v4\"\n...\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster\n

\n

Tip\n

Guidance for fine-tuning with a minimal dataset using a CPU

\n

If you want to use a CPU for fine-tuning, this approach is ideal for those with benefit subscriptions (such as Visual Studio Enterprise Subscription) or to quickly test the fine-tuning and deployment process.

\n

Open the setup_ml file.
Replace COMPUTE_INSTANCE_TYPE, COMPUTE_NAME, and DOCKER_IMAGE_NAME with the following. If you do not have access to Standard_E16s_v3, you can use an equivalent CPU instance or request a new quota.
Replace LOCATION with your specific details.

\n

# Uncomment the following lines to use a CPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_E16s_v3\" # cpu\nCOMPUTE_NAME = \"cpu-e16s-v3\"\nDOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu20.04:latest\"\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster

\n

\n
Type the following command to run the setup_ml.py script and start the fine-tuning process in Azure Machine Learning.

\n
```
python setup_ml.py\n
```
\n
\n
In this exercise, you successfully fine-tuned the Phi-3 model using Azure Machine Learning. By running the setup_ml.py script, you have set up the Azure Machine Learning environment and initiated the fine-tuning process defined in fine_tune.py file. Please note that the fine-tuning process can take a considerable amount of time. After running the python setup_ml.py command, you need to wait for the process to complete. You can monitor the status of the fine-tuning job by following the link provided in the terminal to the Azure Machine Learning portal. In the next series, you will deploy the fine-tuned model and integrate it with Prompt flow.

\n\n

\n

\n

Deploy the fine-tuned model

\n

To integrate the fine-tuned Phi-3 model with Prompt Flow, you need to deploy the model to make it accessible for real-time inference. This process involves registering the model, creating an online endpoint, and deploying the model.

\n

In this exercise, you will:

\n

Set the model name, endpoint name, and deployment name for deployment.
Register the fine-tuned model in the Azure Machine Learning workspace.
Create an online endpoint.
Deploy the registered fine-tuned Phi-3 model.

\n

Set the model name, endpoint name, and deployment name for deployment

\n

\n
Open config.py file.
\n
\n
Replace AZURE_MODEL_NAME = \"your_fine_tuned_model_name\" with the desired name for your model.
\n
\n
Replace AZURE_ENDPOINT_NAME = \"your_fine_tuned_model_endpoint_name\" with the desired name for your endpoint.
\n
\n
Replace AZURE_DEPLOYMENT_NAME = \"your_fine_tuned_model_deployment_name\" with the desired name for your deployment.
\n

\n

Deploy the fine-tuned model

\n

Running the deploy_model.py file automates the entire deployment process. It registers the model, creates an endpoint, and executes the deployment based on the settings specified in the config.py file, which includes the model name, endpoint name, and deployment name.

\n

\n
Open the deploy_model.py file in Visual Studio Code.
\n

\n

Add the following code into deploy_model.py.

\n

import logging\nfrom azure.identity import AzureCliCredential\nfrom azure.ai.ml import MLClient\nfrom azure.ai.ml.entities import Model, ProbeSettings, ManagedOnlineEndpoint, ManagedOnlineDeployment, IdentityConfiguration, ManagedIdentityConfiguration, OnlineRequestSettings\nfrom azure.ai.ml.constants import AssetTypes\n\n# Configuration imports\nfrom config import (\n    AZURE_SUBSCRIPTION_ID,\n    AZURE_RESOURCE_GROUP_NAME,\n    AZURE_ML_WORKSPACE_NAME,\n    AZURE_MANAGED_IDENTITY_RESOURCE_ID,\n    AZURE_MANAGED_IDENTITY_CLIENT_ID,\n    AZURE_MODEL_NAME,\n    AZURE_ENDPOINT_NAME,\n    AZURE_DEPLOYMENT_NAME\n)\n\n# Constants\nJOB_NAME = \"your-job-name\"\nCOMPUTE_INSTANCE_TYPE = \"Standard_E4s_v3\"\n\ndeployment_env_vars = {\n    \"SUBSCRIPTION_ID\": AZURE_SUBSCRIPTION_ID,\n    \"RESOURCE_GROUP_NAME\": AZURE_RESOURCE_GROUP_NAME,\n    \"UAI_CLIENT_ID\": AZURE_MANAGED_IDENTITY_CLIENT_ID,\n}\n\n# Logging setup\nlogging.basicConfig(\n    format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n    datefmt=\"%Y-%m-%d %H:%M:%S\",\n    level=logging.DEBUG\n)\nlogger = logging.getLogger(__name__)\n\ndef get_ml_client():\n    \"\"\"Initialize and return the ML Client.\"\"\"\n    credential = AzureCliCredential()\n    return MLClient(credential, AZURE_SUBSCRIPTION_ID, AZURE_RESOURCE_GROUP_NAME, AZURE_ML_WORKSPACE_NAME)\n\ndef register_model(ml_client, model_name, job_name):\n    \"\"\"Register a new model.\"\"\"\n    model_path = f\"azureml://jobs/{job_name}/outputs/artifacts/paths/model_output\"\n    logger.info(f\"Registering model {model_name} from job {job_name} at path {model_path}.\")\n    run_model = Model(\n        path=model_path,\n        name=model_name,\n        description=\"Model created from run.\",\n        type=AssetTypes.MLFLOW_MODEL,\n    )\n    model = ml_client.models.create_or_update(run_model)\n    logger.info(f\"Registered model ID: {model.id}\")\n    return model\n\ndef delete_existing_endpoint(ml_client, endpoint_name):\n    \"\"\"Delete existing endpoint if it exists.\"\"\"\n    try:\n        endpoint_result = ml_client.online_endpoints.get(name=endpoint_name)\n        logger.info(f\"Deleting existing endpoint {endpoint_name}.\")\n        ml_client.online_endpoints.begin_delete(name=endpoint_name).result()\n        logger.info(f\"Deleted existing endpoint {endpoint_name}.\")\n    except Exception as e:\n        logger.info(f\"No existing endpoint {endpoint_name} found to delete: {e}\")\n\ndef create_or_update_endpoint(ml_client, endpoint_name, description=\"\"):\n    \"\"\"Create or update an endpoint.\"\"\"\n    delete_existing_endpoint(ml_client, endpoint_name)\n    logger.info(f\"Creating new endpoint {endpoint_name}.\")\n    endpoint = ManagedOnlineEndpoint(\n        name=endpoint_name,\n        description=description,\n        identity=IdentityConfiguration(\n            type=\"user_assigned\",\n            user_assigned_identities=[ManagedIdentityConfiguration(resource_id=AZURE_MANAGED_IDENTITY_RESOURCE_ID)]\n        )\n    )\n    endpoint_result = ml_client.online_endpoints.begin_create_or_update(endpoint).result()\n    logger.info(f\"Created new endpoint {endpoint_name}.\")\n    return endpoint_result\n\ndef create_or_update_deployment(ml_client, endpoint_name, deployment_name, model):\n    \"\"\"Create or update a deployment.\"\"\"\n\n    logger.info(f\"Creating deployment {deployment_name} for endpoint {endpoint_name}.\")\n    deployment = ManagedOnlineDeployment(\n        name=deployment_name,\n        endpoint_name=endpoint_name,\n        model=model.id,\n        instance_type=COMPUTE_INSTANCE_TYPE,\n        instance_count=1,\n        environment_variables=deployment_env_vars,\n        request_settings=OnlineRequestSettings(\n            max_concurrent_requests_per_instance=3,\n            request_timeout_ms=180000,\n            max_queue_wait_ms=120000\n        ),\n        liveness_probe=ProbeSettings(\n            failure_threshold=30,\n            success_threshold=1,\n            period=100,\n            initial_delay=500,\n        ),\n        readiness_probe=ProbeSettings(\n            failure_threshold=30,\n            success_threshold=1,\n            period=100,\n            initial_delay=500,\n        ),\n    )\n    deployment_result = ml_client.online_deployments.begin_create_or_update(deployment).result()\n    logger.info(f\"Created deployment {deployment.name} for endpoint {endpoint_name}.\")\n    return deployment_result\n\ndef set_traffic_to_deployment(ml_client, endpoint_name, deployment_name):\n    \"\"\"Set traffic to the specified deployment.\"\"\"\n    try:\n        # Fetch the current endpoint details\n        endpoint = ml_client.online_endpoints.get(name=endpoint_name)\n        \n        # Log the current traffic allocation for debugging\n        logger.info(f\"Current traffic allocation: {endpoint.traffic}\")\n        \n        # Set the traffic allocation for the deployment\n        endpoint.traffic = {deployment_name: 100}\n        \n        # Update the endpoint with the new traffic allocation\n        endpoint_poller = ml_client.online_endpoints.begin_create_or_update(endpoint)\n        updated_endpoint = endpoint_poller.result()\n        \n        # Log the updated traffic allocation for debugging\n        logger.info(f\"Updated traffic allocation: {updated_endpoint.traffic}\")\n        logger.info(f\"Set traffic to deployment {deployment_name} at endpoint {endpoint_name}.\")\n        return updated_endpoint\n    except Exception as e:\n        # Log any errors that occur during the process\n        logger.error(f\"Failed to set traffic to deployment: {e}\")\n        raise\n\n\ndef main():\n    ml_client = get_ml_client()\n\n    registered_model = register_model(ml_client, AZURE_MODEL_NAME, JOB_NAME)\n    logger.info(f\"Registered model ID: {registered_model.id}\")\n\n    endpoint = create_or_update_endpoint(ml_client, AZURE_ENDPOINT_NAME, \"Endpoint for finetuned Phi-3 model\")\n    logger.info(f\"Endpoint {AZURE_ENDPOINT_NAME} is ready.\")\n\n    try:\n        deployment = create_or_update_deployment(ml_client, AZURE_ENDPOINT_NAME, AZURE_DEPLOYMENT_NAME, registered_model)\n        logger.info(f\"Deployment {AZURE_DEPLOYMENT_NAME} is created for endpoint {AZURE_ENDPOINT_NAME}.\")\n\n        set_traffic_to_deployment(ml_client, AZURE_ENDPOINT_NAME, AZURE_DEPLOYMENT_NAME)\n        logger.info(f\"Traffic is set to deployment {AZURE_DEPLOYMENT_NAME} at endpoint {AZURE_ENDPOINT_NAME}.\")\n    except Exception as e:\n        logger.error(f\"Failed to create or update deployment: {e}\")\n\nif __name__ == \"__main__\":\n    main()\n

\n

\n
Perform the following tasks to get the JOB_NAME:
\n
- Navigate to Azure Machine Learning resource that you created.
- Select Studio web URL to open the Azure Machine Learning workspace.
- Select Jobs from the left side tab.
- Select the experiment for fine-tuning. For example, finetunephi.
- Select the job that you created.
- Copy and paste your job Name into the JOB_NAME = \"your-job-name\" in deploy_model.py file.
\n
\n
Replace COMPUTE_INSTANCE_TYPE with your specific details.
\n
\n
Type the following command to run the deploy_model.py script and start the deployment process in Azure Machine Learning.

\n
```
python deploy_model.py
```
\n

\n

Warning\n

To avoid additional charges to your account, make sure to delete the created endpoint in the Azure Machine Learning workspace.

\n

Check deployment status in Azure Machine Learning Workspace

\n

\n
Visit Azure ML Studio.
\n
\n
Navigate to Azure Machine Learning workspace that you created.
\n
\n
Select Studio web URL to open the Azure Machine Learning workspace.
\n
Select Endpoints from the left side tab.\n

\n
\n

\n\n

\n
\n
\n
Select endpoint that you created.
\n

\n
\n

\n\n

\n
\n
\n
On this page, you can manage the endpoints created during the deployment process.

\n

\n

Series 3: Integrate the custom Phi-3 model with Prompt flow

\n

Integrate the custom Phi-3 model with Prompt Flow

\n

After successfully deploying your fine-tuned model, you can now integrate it with Prompt Flow to use your model in real-time applications, enabling a variety of interactive tasks with your custom Phi-3 model.

\n

In this exercise, you will:

\n

Set api key and endpoint uri of the fine-tuned Phi-3 model.
Add code to the flow.dag.yml file.
Add code to the integrate_with_promptflow.py file.
Test your custom Phi-3 model on Prompt flow.

\n

Set api key and endpoint uri of the fine-tuned Phi-3 model

\n

\n
Navigate to the Azure Machine learning workspace that you created.
\n
\n
Select Endpoints from the left side tab.
\n

\n
\n

\n\n

\n
\n
\n
Select endpoint that you created.
\n

\n
\n

\n\n

\n
\n
\n
Select Consume from the navigation menu.
\n
\n
Copy and paste your REST endpoint into the config.py file, replacing AZURE_ML_ENDPOINT = \"your_fine_tuned_model_endpoint_uri\" with your REST endpoint.
\n
\n
Copy and paste your Primary key into the config.py file, replacing AZURE_ML_API_KEY = \"your_fine_tuned_model_api_key\" with your Primary key.
\n

\n
\n

\n\n

\n
\n

\n

Add code to the flow.dag.yml file

\n

\n
Open the flow.dag.yml file in Visual Studio Code.
\n

\n

Add the following code into flow.dag.yml.

\n

inputs:\n  input_data:\n    type: string\n    default: \"Who founded Microsoft?\"\n\noutputs:\n  answer:\n    type: string\n    reference: ${integrate_with_promptflow.output}\n\nnodes:\n- name: integrate_with_promptflow\n  type: python\n  source:\n    type: code\n    path: integrate_with_promptflow.py\n  inputs:\n    input_data: ${inputs.input_data}\n

\n

Add code to the integrate_with_promptflow.py file

\n

\n
Open the integrate_with_promptflow.py file in Visual Studio Code.
\n

\n

Add the following code into integrate_with_promptflow.py.

\n

import logging\nimport requests\nfrom promptflow.core import tool\nimport asyncio\nimport platform\nfrom config import (\n    AZURE_ML_ENDPOINT,\n    AZURE_ML_API_KEY\n)\n\n# Logging setup\nlogging.basicConfig(\n    format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n    datefmt=\"%Y-%m-%d %H:%M:%S\",\n    level=logging.DEBUG\n)\nlogger = logging.getLogger(__name__)\n\ndef query_azml_endpoint(input_data: list, endpoint_url: str, api_key: str) -> str:\n    \"\"\"\n    Send a request to the Azure ML endpoint with the given input data.\n    \"\"\"\n    headers = {\n        \"Content-Type\": \"application/json\",\n        \"Authorization\": f\"Bearer {api_key}\"\n    }\n    data = {\n        \"input_data\": [input_data],\n        \"params\": {\n            \"temperature\": 0.7,\n            \"max_new_tokens\": 128,\n            \"do_sample\": True,\n            \"return_full_text\": True\n        }\n    }\n    try:\n        response = requests.post(endpoint_url, json=data, headers=headers)\n        response.raise_for_status()\n        result = response.json()[0]\n        logger.info(\"Successfully received response from Azure ML Endpoint.\")\n        return result\n    except requests.exceptions.RequestException as e:\n        logger.error(f\"Error querying Azure ML Endpoint: {e}\")\n        raise\n\ndef setup_asyncio_policy():\n    \"\"\"\n    Setup asyncio event loop policy for Windows.\n    \"\"\"\n    if platform.system() == 'Windows':\n        asyncio.set_event_loop_policy(asyncio.WindowsSelectorEventLoopPolicy())\n        logger.info(\"Set Windows asyncio event loop policy.\")\n\n@tool\ndef my_python_tool(input_data: str) -> str:\n    \"\"\"\n    Tool function to process input data and query the Azure ML endpoint.\n    \"\"\"\n    setup_asyncio_policy()\n    return query_azml_endpoint(input_data, AZURE_ML_ENDPOINT, AZURE_ML_API_KEY)\n

\n

\n
Type the following command to run the integrate_with_promptflow script and start Prompt flow.

\n
```
pf flow serve --source ./ --port 8080 --host localhost\n
```
\n
\n
Here's an example of the results: Now you can chat with your custom Phi-3 model. It is recommended to ask questions based on the data used for fine-tuning.
\n

\n
\n

\n

\n

\n
\n

\n

Congratulations!

\n

You've completed this tutorial

\n

Congratulations! You have successfully completed the tutorial on fine-tuning and integrating custom Phi-3 models with Prompt flow. This tutorial introduced the simplest method of fine-tuning, avoiding additional techniques such as LoRA or QLoRA, and using MLflow to streamline the fine-tuning and deployment process. Advanced techniques and detailed explanations will be covered in the next series.

\n

\n\n

\n

Clean Up Azure Resources

\n

Cleanup your Azure resources to avoid additional charges to your account. Go to the Azure portal and delete the following resources:

\n

The Azure Machine Learning resource.
The Azure Machine Learning model endpoint.

\n

Source Code for the Tutorial

\n

You can find the complete source code for this tutorial in the following repository:

\n

skytin1004/Fine-Tune-and-Integrate-Custom-Phi-3-Models-with-Prompt-Flow

\n

Reference

\n

microsoft/Phi-3CookBook
Azure/azure-llm-fine-tuning

\n

Introduction

\n

Phi-3 is a family of small language models (SLMs) developed by Microsoft that delivers exceptional performance and cost-effectiveness. In this tutorial, you will learn how to fine-tune the Phi-3 model and integrate it with Prompt flow. By leveraging Azure Machine Learning, and Prompt flow you will establish a workflow for deploying and utilizing custom AI models. This tutorial is divided into three series:

\n

Series 1: Set up Azure resources and Prepare for fine-tuning

\n

\n
Create Azure Machine Learning workspace: Set up an Azure Machine Learning workspace, which serves as the hub for managing machine learning experiments and models.
\n
\n
Request GPU quotas: Request GPU quotas in your Azure subscription to ensure sufficient resources for model fine-tuning.
\n
\n
Add role assignment: Set up a User Assigned Managed Identity (UAI) and assign it necessary permissions (Contributor, Storage Blob Data Reader, AcrPull) to access resources like storage accounts and container registries.
\n
\n
Set up the project: Create a local environment, set up a virtual environment, install required packages, and create a script (download_dataset.py) to download the dataset (ULTRACHAT_200k) required for fine-tuning.
\n

\n

Series 2: Fine-tune and Deploy the Phi-3 model

\n

\n
Define fine-tuning process: Add code to the fine_tune.py file to define the fine-tuning process, including data loading, preprocessing, and training configurations.
\n
\n
Fine-tune the Phi-3 model: Add code to and run the setup_ml.py file to set up the compute environment, define the fine-tuning job, and submit it to Azure Machine Learning.
\n
\n
Deploy the Fine-tuned model: Once fine-tuning is complete, Add code to the deploy_model.py file to register the fine-tuned model in Azure Machine Learning, create an online endpoint, and deploy the model for real-time inference.
\n

\n

Series 3: Integrate the custom Phi-3 model with Prompt flow

\n

\n
Build Prompt flow: Add code to the flow.dag.yml file to build a flow.
\n
\n
Integrate with Prompt flow: Add code to integrate_with_promptflow file to integrate the custom Phi-3 model with Prompt flow.
\n

\n

Here is an overview of this tutorial.

\n

Note\n

Microsoft has released the Phi-3.5 models, featuring enhanced multi-language support, improved vision capabilities, and advanced Intelligence Mixture of Experts (MOEs). Although this tutorial primarily focuses on Phi-3, you can apply the same steps to fine-tune and integrate the Phi-3.5 model for even better performance. A tip on how to modify the fine_tune.py script to switch to the Phi-3.5 model is included below at Fine-tune the Phi-3 model section.

\n

For more detailed information and to explore additional resources about Phi-3 and Phi-3.5, please visit the Phi-3CookBook.

\n

Prerequisites

\n

Series 1: Set up Azure resources and Prepare for fine-tuning

\n

Create Azure Machine Learning Workspace

\n

In this exercise, you will:

\n

Create an Azure Machine Learning Workspace.

\n

Create an Azure Machine Learning Workspace

\n

\n
Type azure machine learning in the search bar at the top of the portal page and select Azure Machine Learning from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select + Create from the navigation menu.
\n
\n
Select New workspace from the navigation menu.
\n

\n
\n

\n\n

\n
\n
\n
Perform the following tasks:
\n
- Select your Azure Subscription.
- Select the Resource group to use (create a new one if needed).
- Enter Workspace Name. It must be a unique value.
- Select the Region you'd like to use.
- Select the Storage account to use (create a new one if needed).
- Select the Key vault to use (create a new one if needed).
- Select the Application insights to use (create a new one if needed).
- Select the Container registry to use (create a new one if needed).
\n

\n
\n

\n\n

\n
\n
Tip\n
When you create or use a Storage account in Azure Machine Learning, a container named \"azureml\" is automatically created within the Storage account. This container is used for storing model artifacts, training outputs, and other data generated during the machine learning process. In this tutorial, you will use the \"azureml\" container to manage and store all the necessary files and outputs related to our machine learning workflows.
\n
\n
\n

\n
\n
\n
Select Review + Create.
\n
\n
Select Create.
\n

\n

Request GPU quotas in Azure Subscription

\n

In this tutorial, you will learn how to fine-tune and deploy a Phi-3 model, using GPUs. For fine-tuning, you will use the Standard_NC24ads_A100_v4 GPU, which requires a quota request. For deployment, you will use the Standard_E4s_v3 CPU, which does not require a quota request.

\n

Note\n

Only Pay-As-You-Go subscriptions (the standard subscription type) are eligible for GPU allocation; benefit subscriptions are not currently supported.

\n

For those using benefit subscriptions (such as Visual Studio Enterprise Subscription) or those looking to quickly test the fine-tuning and deployment process, this tutorial also provides guidance for fine-tuning with a minimal dataset using a CPU. However, it is important to note that fine-tuning results are significantly better when using a GPU with larger datasets.

\n

In this exercise, you will:

\n

Request GPU Quotas in your Azure Subscription

\n

Request GPU Quotas in Azure Subscription

\n

\n
Visit Azure ML Studio.
\n
\n
Perform the following tasks to request Standard NCADSA100v4 Family quota:
\n
- Select Quota from the left side tab.
- \n
  Select the Virtual machine family to use. For example, select Standard NCADSA100v4 Family Cluster Dedicated vCPUs, which includes the Standard_NC24ads_A100_v4 GPU.
  \n
- \n
  Select the Request quota from the navigation menu.
  \n
  
  \n
  \n
  
  \n\n
  
  \n
  \n
- \n
  Inside the Request quota page, enter the New cores limit you'd like to use. For example, 24.
  \n
- \n
  Inside the Request quota page, select Submit to request the GPU quota.
  \n
\n

\n

Note\n

You can select the appropriate GPU or CPU for your needs by referring to Sizes for Virtual Machines in Azure document.

\n

Add role assignment

\n

To fine-tune and deploy your models, you must first ceate a User Assigned Managed Identity (UAI) and assign it the appropriate permissions. This UAI will be used for authentication during deployment, so it is critical to grant it access to the storage accounts, container registry, and resource group.

\n

In this exercise, you will:

\n

Create User Assigned Managed Identity(UAI).
Add Contributor role assignment to Managed Identity.
Add Storage Blob Data Reader role assignment to Managed Identity.
Add AcrPull role assignment to Managed Identity.

\n

Create User Assigned Managed Identity(UAI)

\n

\n
Type managed identities in the search bar at the top of the portal page and select Managed Identities from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select + Create.
\n

\n
\n

\n\n

\n
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Select your Azure Subscription.
- Select the Resource group to use (create a new one if needed).
- Select the Region you'd like to use.
- Enter the Name. It must be a unique value.
\n

\n
\n

\n\n

\n
\n
\n
Select Review + create.
\n
\n
Select + Create.
\n

\n

Add Contributor role assignment to Managed Identity

\n

\n
Navigate to the Managed Identity resource that you created.
\n
\n
Select Azure role assignments from the left side tab.
\n
\n
Select +Add role assignment from the navigation menu.
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- Select the Scope to Resource group.
- Select your Azure Subscription.
- Select the Resource group to use.
- Select the Role to Contributor.
\n

\n
\n

\n\n

\n
\n
\n
Select Save.
\n

\n

Add Storage Blob Data Reader role assignment to Managed Identity

\n

\n
Type azure storage accounts in the search bar at the top of the portal page and select Storage accounts from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select the storage account that associated with the Azure Machine Learning workspace. For example, finetunephistorage.
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Navigate to the Azure Storage account that you created.
- Select Access Control (IAM) from the left side tab.
- Select + Add from the navigation menu.
- Select Add role assignment from the navigation menu.
\n

\n
\n

\n
\n

\n
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- \n
  Inside the Role page, type Storage Blob Data Reader in the search bar and select Storage Blob Data Reader from the options that appear.
  \n
  
  \n
  \n
  
  \n
  \n
  
  \n
- \n
  Inside the Role page, select Next.
  \n
- \n
  Inside the Members page, select Assign access to Managed identity.
  \n
- \n
  Inside the Members page, select + Select members.
  \n
- \n
  Inside Select managed identities page, select your Azure Subscription.
  \n
- \n
  Inside Select managed identities page, select the Managed identity to Manage Identity.
  \n
- \n
  Inside Select managed identities page, select the Manage Identity that you created. For example, finetunephi-managedidentity.
  \n
- \n
  Inside Select managed identities page, select Select.
  \n
  
  \n
  \n
  
  \n
  \n
  
  \n
- \n
  Select Review + assign.
  \n
\n

\n

Add AcrPull role assignment to Managed Identity

\n

\n
Type container registries in the search bar at the top of the portal page and select Container registries from the options that appear.
\n

\n
\n

\n\n

\n
\n
\n
Select the container registry that associated with the Azure Machine Learning workspace. For example, finetunephicontainerregistries
\n
\n
Perform the following tasks to navigate to Add role assignment page:
\n
- Select Access Control (IAM) from the left side tab.
- Select + Add from the navigation menu.
- Select Add role assignment from the navigation menu.
\n
\n
Inside Add role assignment page, Perform the following tasks:
\n
- Inside the Role page, Type AcrPull in the search bar and select AcrPull from the options that appear.
- Inside the Role page, select Next.
- Inside the Members page, select Assign access to Managed identity.
- Inside the Members page, select + Select members.
- Inside Select managed identities page, select your Azure Subscription.
- Inside Select managed identities page, select the Managed identity to Manage Identity.
- Inside Select managed identities page, select the Manage Identity that you created. For example, finetunephi-managedidentity.
- Inside Select managed identities page, select Select.
- Select Review + assign.
\n

\n

Set up the project and install the libraries

\n

Now, you will create a folder to work in and set up a virtual environment to develop a program.

\n

In this exercise, you will

\n

Create a folder to work inside it.
Create a virtual environment.
Install the required packages.

\n

Create a folder to work inside it

\n

\n
Open a terminal window and type the following command to create a folder named finetune-phi in the default path.

\n
```
mkdir finetune-phi\n
```
\n
\n
Type the following command inside your terminal to navigate to the finetune-phi folder you created.

\n
```
cd finetune-phi\n
```
\n

\n

Create a virtual environment

\n

\n
Type the following command inside your terminal to create a virtual environment named .venv.

\n
```
python -m venv .venv\n
```
\n
\n
Type the following command inside your terminal to activate the virtual environment.

\n
```
.venv\\Scripts\\activate.bat\n
```
\n

\n

Note\n

If it worked, you should see (.venv) before the command prompt.

\n

Install the required packages

\n

Type the following commands inside your terminal to install the required packages.

\n

pip install datasets==2.19.1\npip install transformers==4.41.1\npip install azure-ai-ml==1.16.0\npip install torch==2.3.1\npip install trl==0.9.4\npip install promptflow==1.12.0

\n

Set up project files in Visual Studio Code

\n

In this exercise, you will create the essential files for our project. These files include scripts for downloading the dataset, setting up the Azure Machine Learning environment, fine-tuning the Phi-3 model, and deploying the fine-tuned model. You will also create a conda.yml file to set up the fine-tuning environment.

\n

In this exercise, you will:

\n

Create a download_dataset.py file to download the dataset.
Create a setup_ml.py file to set up the Azure Machine Learning environment.
Create a fine_tune.py file in the finetuning_dir folder to fine-tune the Phi-3 model using the dataset.
Create a conda.yml file to setup fine-tuning environment.
Create a deploy_model.py file to deploy the fine-tuned model.
Create a integrate_with_promptflow.py file, to integrate the fine-tuned model and execute the model using Prompt flow.
Create a flow.dag.yml file, to set up the workflow structure for Prompt flow.
Create a config.py file to enter Azure information.

\n

Note\n

Complete folder structure:

\n

└── YourUserName\n.    └── finetune-phi\n.        ├── finetuning_dir\n.        │      └── fine_tune.py\n.        ├── conda.yml\n.        ├── config.py\n.        ├── deploy_model.py\n.        ├── download_dataset.py\n.        ├── flow.dag.yml\n.        ├── integrate_with_promptflow.py\n.        └── setup_ml.py\n

\n

Create Project Files

\n

\n
Open Visual Studio Code.
\n
\n
Select File from the menu bar.
\n
\n
Select Open Folder.
\n
\n
Select the finetune-phi folder that you created, which is located at C:\\Users\\yourUserName\\finetune-phi.
\n

\n
\n

\n\n

\n
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named download_dataset.py.
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named setup_ml.py.
\n
\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named deploy_model.py.
\n

\n
\n

\n\n

\n
\n
\n
In the left pane of Visual Studio Code, right-click and select New Folder to create a new forder named finetuning_dir.
\n
\n
In the finetuning_dir folder, create a new file named fine_tune.py.

\n

\n

Create and Configure conda.yml file

\n

\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named conda.yml.
\n
\n
Add the following code to the conda.yml file to set up the fine-tuning environment for the Phi-3 model.

\nname: phi-3-training-env\nchannels:\n - defaults\n - conda-forge\ndependencies:\n - python=3.10\n - pip\n - numpy<2.0\n - pip:\n - torch==2.4.0\n - torchvision==0.19.0\n - trl==0.8.6\n - transformers==4.41\n - datasets==2.21.0\n - azureml-core==1.57.0\n - azure-storage-blob==12.19.0\n - azure-ai-ml==1.16\n - azure-identity==1.17.1\n - accelerate==0.33.0\n - mlflow==2.15.1\n - azureml-mlflow==1.57.0\n

\n

\n

Create and Configure config.py file

\n

\n
In the left pane of Visual Studio Code, right-click and select New File to create a new file named config.py.
\n
\n
Add the following code to the config.py file to include your Azure information.

\n# Azure settings\nAZURE_SUBSCRIPTION_ID = \"your_subscription_id\"\nAZURE_RESOURCE_GROUP_NAME = \"your_resource_group_name\" # \"TestGroup\"\n\n# Azure Machine Learning settings\nAZURE_ML_WORKSPACE_NAME = \"your_workspace_name\" # \"finetunephi-workspace\"\n\n# Azure Managed Identity settings\nAZURE_MANAGED_IDENTITY_CLIENT_ID = \"your_azure_managed_identity_client_id\"\nAZURE_MANAGED_IDENTITY_NAME = \"your_azure_managed_identity_name\" # \"finetunephi-mangedidentity\"\nAZURE_MANAGED_IDENTITY_RESOURCE_ID = f\"/subscriptions/{AZURE_SUBSCRIPTION_ID}/resourceGroups/{AZURE_RESOURCE_GROUP_NAME}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{AZURE_MANAGED_IDENTITY_NAME}\"\n\n# Dataset file paths\nTRAIN_DATA_PATH = \"data/train_data.jsonl\"\nTEST_DATA_PATH = \"data/test_data.jsonl\"\n\n# Fine-tuned model settings\nAZURE_MODEL_NAME = \"your_fine_tuned_model_name\" # \"finetune-phi-model\"\nAZURE_ENDPOINT_NAME = \"your_fine_tuned_model_endpoint_name\" # \"finetune-phi-endpoint\"\nAZURE_DEPLOYMENT_NAME = \"your_fine_tuned_model_deployment_name\" # \"finetune-phi-deployment\"\n\nAZURE_ML_API_KEY = \"your_fine_tuned_model_api_key\"\nAZURE_ML_ENDPOINT = \"your_fine_tuned_model_endpoint_uri\" # \"https://{your-endpoint-name}.{your-region}.inference.ml.azure.com/score\"

\n

Add Azure Environment Variables

\n

\n
Perform the following tasks to add the Azure Subscription ID:
\n
- Type subscriptions in the search bar at the top of the portal page and select Subscriptions from the options that appear.
  
  \n
  \n
  
  \n\n
  
  \n
  \n
- Select the Azure Subscription you are currently using.
- Copy and paste your Subscription ID into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Workspace Name:
\n
- Navigate to the Azure Machine Learning resource that you created.
- Copy and paste your account name into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Resource Group Name:
\n
- Navigate to the Azure Machine Learning resource that you created.
- Copy and paste your Azure Resource Group Name into the config.py file.
\n

\n
\n
Perform the following tasks to add the Azure Managed Identity name
\n
- Navigate to the Managed Identities resource that you created.
- Copy and paste your Azure Managed Identity name into the config.py file.
\n

\n

\n

Prepare Dataset for Fine-tuning

\n

In this exercise, you will run the download_dataset.py file to download the ultrachat_200k datasets to your local environment. You will then use this datasets to fine-tune the Phi-3 model in Azure Machine Learning.

\n

In this exercise, you will:

\n

Add code to the download_dataset.py file to download the datasets.
Run the download_dataset.py file to download datasets to your local environment.

\n

Download your dataset using download_dataset.py

\n

\n
Open the download_dataset.py file in Visual Studio Code.
\n
\n
Add the following code into download_dataset.py.

\nimport json\nimport os\nfrom datasets import load_dataset\nfrom config import (\n TRAIN_DATA_PATH,\n TEST_DATA_PATH)\n\ndef load_and_split_dataset(dataset_name, config_name, split_ratio):\n \"\"\"\n Load and split a dataset.\n \"\"\"\n # Load the dataset with the specified name, configuration, and split ratio\n dataset = load_dataset(dataset_name, config_name, split=split_ratio)\n print(f\"Original dataset size: {len(dataset)}\")\n \n # Split the dataset into train and test sets (80% train, 20% test)\n split_dataset = dataset.train_test_split(test_size=0.2)\n print(f\"Train dataset size: {len(split_dataset['train'])}\")\n print(f\"Test dataset size: {len(split_dataset['test'])}\")\n \n return split_dataset\n\ndef save_dataset_to_jsonl(dataset, filepath):\n \"\"\"\n Save a dataset to a JSONL file.\n \"\"\"\n # Create the directory if it does not exist\n os.makedirs(os.path.dirname(filepath), exist_ok=True)\n \n # Open the file in write mode\n with open(filepath, 'w', encoding='utf-8') as f:\n # Iterate over each record in the dataset\n for record in dataset:\n # Dump the record as a JSON object and write it to the file\n json.dump(record, f)\n # Write a newline character to separate records\n f.write('\\n')\n \n print(f\"Dataset saved to {filepath}\")\n\ndef main():\n \"\"\"\n Main function to load, split, and save the dataset.\n \"\"\"\n # Load and split the ULTRACHAT_200k dataset with a specific configuration and split ratio\n dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:1%]')\n \n # Extract the train and test datasets from the split\n train_dataset = dataset['train']\n test_dataset = dataset['test']\n\n # Save the train dataset to a JSONL file\n save_dataset_to_jsonl(train_dataset, TRAIN_DATA_PATH)\n \n # Save the test dataset to a separate JSONL file\n save_dataset_to_jsonl(test_dataset, TEST_DATA_PATH)\n\nif __name__ == \"__main__\":\n main()\n\n

\n
\n
Tip\n
Guidance for fine-tuning with a minimal dataset using a CPU
\n
If you want to use a CPU for fine-tuning, this approach is ideal for those with benefit subscriptions (such as Visual Studio Enterprise Subscription) or to quickly test the fine-tuning and deployment process.
\n
Replace dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:1%]') with dataset = load_and_split_dataset(\"HuggingFaceH4/ultrachat_200k\", 'default', 'train_sft[:10]')
\n
\n
\n

\n
\n
Type the following command inside your terminal to run the script and download the dataset to your local environment.

\n
```
python download_dataset.py\n
```
\n
\n
Verify that the datasets were saved successfully to your local finetune-phi/data directory.

\n

\n

Note\n

Note on dataset size and fine-tuning time

\n

In this tutorial, you use only 1% of the dataset (train_sft[:1%]). This significantly reduces the amount of data, speeding up both the upload and fine-tuning processes. You can adjust the percentage to find the right balance between training time and model performance. Using a smaller subset of the dataset reduces the time required for fine-tuning, making the process more manageable for a tutorial.

\n

Series 2: Fine-tune and Deploy the Phi-3 model

\n

Fine-tune the Phi-3 model

\n

In this exercise, you will fine-tune the Phi-3 model using the provided dataset. First, you will define the fine-tuning process in the fine_tune.py file. Then, you will configure the Azure Machine Learning environment and initiate the fine-tuning process by running the setup_ml.py file. This script ensures that the fine-tuning occurs within the Azure Machine Learning environment.

\n

By running setup_ml.py, you will run the fine-tuning process in the Azure Machine Learning environment.

\n

In this exercise, you will:

\n

Set up Azure CLI to authenticate environment
Add code to the fine_tune.py file to fine-tune the model.
Add code to and run the setup_ml.py file to initiate the fine-tuning process in Azure Machine Learning.
Run the setup_ml.py file to fine-tune the Phi-3 model using Azure Machine Learning.

\n

Set up Azure CLI

\n

You need to set up Azure CLI to authenticate your environment. Azure CLI allows you to manage Azure resources directly from the command line and provides the credentials necessary for Azure Machine Learning to access these resources. To get started install Azure CLI

\n

\n
Open a terminal window and type the following command to log in to your Azure account.

\n
```
az login\n
```
\n
\n
Select your Azure account to use.
\n
\n
Select your Azure subscription to use.
\n

\n
\n

\n\n

\n
\n

\n

Tip\n

Having trouble signing in to Azure? Try using a device code

\n

Open a terminal window and type the following command to log in to your Azure account.

\n
```
az login --use-device-code
```
\n

\n

Visit the website displayed in the terminal window and enter the provided code on that site.

\n

Inside the website, select Next.

\n

\n
Inside the website, select the account to use in this tutorial

\n

\n
Inside the website, select continue to complete login.
After successful login, go back to your terminal and select your Azure subscription to use.

\n

\n

\n

Add code to the fine_tune.py file

\n

\n
Navigate to the finetuning_dir folder and Open the fine_tune.py file in Visual Studio Code.
\n
\n
Add the following code into fine_tune.py.

\nimport argparse\nimport sys\nimport logging\nimport os\nfrom datasets import load_dataset\nimport torch\nimport mlflow\nfrom transformers import AutoModelForCausalLM, AutoTokenizer, TrainingArguments\nfrom trl import SFTTrainer\n\n# To avoid the INVALID_PARAMETER_VALUE error in MLflow, disable MLflow integration\nos.environ[\"DISABLE_MLFLOW_INTEGRATION\"] = \"True\"\n\n# Logging setup\nlogging.basicConfig(\n format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n datefmt=\"%Y-%m-%d %H:%M:%S\",\n handlers=[logging.StreamHandler(sys.stdout)],\n level=logging.WARNING\n)\nlogger = logging.getLogger(__name__)\n\ndef initialize_model_and_tokenizer(model_name, model_kwargs):\n \"\"\"\n Initialize the model and tokenizer with the given pretrained model name and arguments.\n \"\"\"\n model = AutoModelForCausalLM.from_pretrained(model_name, **model_kwargs)\n tokenizer = AutoTokenizer.from_pretrained(model_name)\n tokenizer.model_max_length = 2048\n tokenizer.pad_token = tokenizer.unk_token\n tokenizer.pad_token_id = tokenizer.convert_tokens_to_ids(tokenizer.pad_token)\n tokenizer.padding_side = 'right'\n return model, tokenizer\n\ndef apply_chat_template(example, tokenizer):\n \"\"\"\n Apply a chat template to tokenize messages in the example.\n \"\"\"\n messages = example[\"messages\"]\n if messages[0][\"role\"] != \"system\":\n messages.insert(0, {\"role\": \"system\", \"content\": \"\"})\n example[\"text\"] = tokenizer.apply_chat_template(\n messages, tokenize=False, add_generation_prompt=False\n )\n return example\n\ndef load_and_preprocess_data(train_filepath, test_filepath, tokenizer):\n \"\"\"\n Load and preprocess the dataset.\n \"\"\"\n train_dataset = load_dataset('json', data_files=train_filepath, split='train')\n test_dataset = load_dataset('json', data_files=test_filepath, split='train')\n column_names = list(train_dataset.features)\n\n train_dataset = train_dataset.map(\n apply_chat_template,\n fn_kwargs={\"tokenizer\": tokenizer},\n num_proc=10,\n remove_columns=column_names,\n desc=\"Applying chat template to train dataset\",\n )\n\n test_dataset = test_dataset.map(\n apply_chat_template,\n fn_kwargs={\"tokenizer\": tokenizer},\n num_proc=10,\n remove_columns=column_names,\n desc=\"Applying chat template to test dataset\",\n )\n\n return train_dataset, test_dataset\n\ndef train_and_evaluate_model(train_dataset, test_dataset, model, tokenizer, output_dir):\n \"\"\"\n Train and evaluate the model.\n \"\"\"\n training_args = TrainingArguments(\n bf16=True,\n do_eval=True,\n output_dir=output_dir,\n eval_strategy=\"epoch\",\n learning_rate=5.0e-06,\n logging_steps=20,\n lr_scheduler_type=\"cosine\",\n num_train_epochs=3,\n overwrite_output_dir=True,\n per_device_eval_batch_size=4,\n per_device_train_batch_size=4,\n remove_unused_columns=True,\n save_steps=500,\n seed=0,\n gradient_checkpointing=True,\n gradient_accumulation_steps=1,\n warmup_ratio=0.2,\n )\n\n trainer = SFTTrainer(\n model=model,\n args=training_args,\n train_dataset=train_dataset,\n eval_dataset=test_dataset,\n max_seq_length=2048,\n dataset_text_field=\"text\",\n tokenizer=tokenizer,\n packing=True\n )\n\n train_result = trainer.train()\n trainer.log_metrics(\"train\", train_result.metrics)\n\n mlflow.transformers.log_model(\n transformers_model={\"model\": trainer.model, \"tokenizer\": tokenizer},\n artifact_path=output_dir,\n )\n\n tokenizer.padding_side = 'left'\n eval_metrics = trainer.evaluate()\n eval_metrics[\"eval_samples\"] = len(test_dataset)\n trainer.log_metrics(\"eval\", eval_metrics)\n\ndef main(train_file, eval_file, model_output_dir):\n \"\"\"\n Main function to fine-tune the model.\n \"\"\"\n model_kwargs = {\n \"use_cache\": False,\n \"trust_remote_code\": True,\n \"torch_dtype\": torch.bfloat16,\n \"device_map\": None,\n \"attn_implementation\": \"eager\"\n }\n \n pretrained_model_name = \"microsoft/Phi-3.5-mini-instruct\"\n # pretrained_model_name = \"microsoft/Phi-3-mini-4k-instruct\"\n\n with mlflow.start_run():\n model, tokenizer = initialize_model_and_tokenizer(pretrained_model_name, model_kwargs)\n train_dataset, test_dataset = load_and_preprocess_data(train_file, eval_file, tokenizer)\n train_and_evaluate_model(train_dataset, test_dataset, model, tokenizer, model_output_dir)\n\nif __name__ == \"__main__\":\n parser = argparse.ArgumentParser()\n parser.add_argument(\"--train-file\", type=str, required=True, help=\"Path to the training data\")\n parser.add_argument(\"--eval-file\", type=str, required=True, help=\"Path to the evaluation data\")\n parser.add_argument(\"--model_output_dir\", type=str, required=True, help=\"Directory to save the fine-tuned model\")\n args = parser.parse_args()\n main(args.train_file, args.eval_file, args.model_output_dir)\n\n

\n
\n
Save and close the fine_tune.py file.
\n

\n

Tip\n

You can fine-tune Phi-3.5 model

\n

In fine_tune.py file, you can change the pretrained_model_name from \"microsoft/Phi-3-mini-4k-instruct\" to any model you want to fine-tune. For example, if you change it to \"microsoft/Phi-3.5-mini-instruct\", you'll be using the Phi-3.5-mini-instruct model for fine-tuning. To find and use the model name you prefer, visit Hugging Face, search for the model you're interested in, and then copy and paste its name into the pretrained_model_name field in your script.

\n

Add code to the setup_ml.py file

\n

\n
Open the setup_ml.py file in Visual Studio Code.
\n
\n
Add the following code into setup_ml.py.

\nimport logging\nfrom azure.ai.ml import MLClient, command, Input\nfrom azure.ai.ml.entities import Environment, AmlCompute\nfrom azure.identity import AzureCliCredential\nfrom config import (\n AZURE_SUBSCRIPTION_ID,\n AZURE_RESOURCE_GROUP_NAME,\n AZURE_ML_WORKSPACE_NAME,\n TRAIN_DATA_PATH,\n TEST_DATA_PATH\n)\n\n# Constants\n\n# Uncomment the following lines to use a CPU instance for training\n# COMPUTE_INSTANCE_TYPE = \"Standard_E16s_v3\" # cpu\n# COMPUTE_NAME = \"cpu-e16s-v3\"\n# DOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu20.04:latest\"\n\n# Uncomment the following lines to use a GPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_NC24ads_A100_v4\"\nCOMPUTE_NAME = \"gpu-nc24s-a100-v4\"\nDOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/curated/acft-hf-nlp-gpu:59\"\n\nCONDA_FILE = \"conda.yml\"\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster\nFINETUNING_DIR = \"./finetuning_dir\" # Path to the fine-tuning script\nTRAINING_ENV_NAME = \"phi-3-training-environment\" # Name of the training environment\nMODEL_OUTPUT_DIR = \"./model_output\" # Path to the model output directory in azure ml\n\n# Logging setup to track the process\nlogger = logging.getLogger(__name__)\nlogging.basicConfig(\n format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n datefmt=\"%Y-%m-%d %H:%M:%S\",\n level=logging.WARNING\n)\n\ndef get_ml_client():\n \"\"\"\n Initialize the ML Client using Azure CLI credentials.\n \"\"\"\n credential = AzureCliCredential()\n return MLClient(credential, AZURE_SUBSCRIPTION_ID, AZURE_RESOURCE_GROUP_NAME, AZURE_ML_WORKSPACE_NAME)\n\ndef create_or_get_environment(ml_client):\n \"\"\"\n Create or update the training environment in Azure ML.\n \"\"\"\n env = Environment(\n image=DOCKER_IMAGE_NAME, # Docker image for the environment\n conda_file=CONDA_FILE, # Conda environment file\n name=TRAINING_ENV_NAME, # Name of the environment\n )\n return ml_client.environments.create_or_update(env)\n\ndef create_or_get_compute_cluster(ml_client, compute_name, COMPUTE_INSTANCE_TYPE, location):\n \"\"\"\n Create or update the compute cluster in Azure ML.\n \"\"\"\n try:\n compute_cluster = ml_client.compute.get(compute_name)\n logger.info(f\"Compute cluster '{compute_name}' already exists. Reusing it for the current run.\")\n except Exception:\n logger.info(f\"Compute cluster '{compute_name}' does not exist. Creating a new one with size {COMPUTE_INSTANCE_TYPE}.\")\n compute_cluster = AmlCompute(\n name=compute_name,\n size=COMPUTE_INSTANCE_TYPE,\n location=location,\n tier=\"Dedicated\", # Tier of the compute cluster\n min_instances=0, # Minimum number of instances\n max_instances=1 # Maximum number of instances\n )\n ml_client.compute.begin_create_or_update(compute_cluster).wait() # Wait for the cluster to be created\n return compute_cluster\n\ndef create_fine_tuning_job(env, compute_name):\n \"\"\"\n Set up the fine-tuning job in Azure ML.\n \"\"\"\n return command(\n code=FINETUNING_DIR, # Path to fine_tune.py\n command=(\n \"python fine_tune.py \"\n \"--train-file ${{inputs.train_file}} \"\n \"--eval-file ${{inputs.eval_file}} \"\n \"--model_output_dir ${{inputs.model_output}}\"\n ),\n environment=env, # Training environment\n compute=compute_name, # Compute cluster to use\n inputs={\n \"train_file\": Input(type=\"uri_file\", path=TRAIN_DATA_PATH), # Path to the training data file\n \"eval_file\": Input(type=\"uri_file\", path=TEST_DATA_PATH), # Path to the evaluation data file\n \"model_output\": MODEL_OUTPUT_DIR\n }\n )\n\ndef main():\n \"\"\"\n Main function to set up and run the fine-tuning job in Azure ML.\n \"\"\"\n # Initialize ML Client\n ml_client = get_ml_client()\n\n # Create Environment\n env = create_or_get_environment(ml_client)\n \n # Create or get existing compute cluster\n create_or_get_compute_cluster(ml_client, COMPUTE_NAME, COMPUTE_INSTANCE_TYPE, LOCATION)\n\n # Create and Submit Fine-Tuning Job\n job = create_fine_tuning_job(env, COMPUTE_NAME)\n returned_job = ml_client.jobs.create_or_update(job) # Submit the job\n ml_client.jobs.stream(returned_job.name) # Stream the job logs\n \n # Capture the job name\n job_name = returned_job.name\n print(f\"Job name: {job_name}\")\n\nif __name__ == \"__main__\":\n main()\n\n

\n

\n

Replace COMPUTE_INSTANCE_TYPE, COMPUTE_NAME, and LOCATION with your specific details.

\n

# Uncomment the following lines to use a GPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_NC24ads_A100_v4\"\nCOMPUTE_NAME = \"gpu-nc24s-a100-v4\"\n...\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster\n

\n

Tip\n

Guidance for fine-tuning with a minimal dataset using a CPU

\n

If you want to use a CPU for fine-tuning, this approach is ideal for those with benefit subscriptions (such as Visual Studio Enterprise Subscription) or to quickly test the fine-tuning and deployment process.

\n

Open the setup_ml file.
Replace COMPUTE_INSTANCE_TYPE, COMPUTE_NAME, and DOCKER_IMAGE_NAME with the following. If you do not have access to Standard_E16s_v3, you can use an equivalent CPU instance or request a new quota.
Replace LOCATION with your specific details.

\n

# Uncomment the following lines to use a CPU instance for training\nCOMPUTE_INSTANCE_TYPE = \"Standard_E16s_v3\" # cpu\nCOMPUTE_NAME = \"cpu-e16s-v3\"\nDOCKER_IMAGE_NAME = \"mcr.microsoft.com/azureml/openmpi4.1.0-ubuntu20.04:latest\"\nLOCATION = \"eastus2\" # Replace with the location of your compute cluster

\n

\n
Type the following command to run the setup_ml.py script and start the fine-tuning process in Azure Machine Learning.

\n
```
python setup_ml.py\n
```
\n
\n
In this exercise, you successfully fine-tuned the Phi-3 model using Azure Machine Learning. By running the setup_ml.py script, you have set up the Azure Machine Learning environment and initiated the fine-tuning process defined in fine_tune.py file. Please note that the fine-tuning process can take a considerable amount of time. After running the python setup_ml.py command, you need to wait for the process to complete. You can monitor the status of the fine-tuning job by following the link provided in the terminal to the Azure Machine Learning portal. In the next series, you will deploy the fine-tuned model and integrate it with Prompt flow.

\n\n

\n

\n

Deploy the fine-tuned model

\n

To integrate the fine-tuned Phi-3 model with Prompt Flow, you need to deploy the model to make it accessible for real-time inference. This process involves registering the model, creating an online endpoint, and deploying the model.

\n

In this exercise, you will:

\n

Set the model name, endpoint name, and deployment name for deployment.
Register the fine-tuned model in the Azure Machine Learning workspace.
Create an online endpoint.
Deploy the registered fine-tuned Phi-3 model.

\n

Set the model name, endpoint name, and deployment name for deployment

\n

\n
Open config.py file.
\n
\n
Replace AZURE_MODEL_NAME = \"your_fine_tuned_model_name\" with the desired name for your model.
\n
\n
Replace AZURE_ENDPOINT_NAME = \"your_fine_tuned_model_endpoint_name\" with the desired name for your endpoint.
\n
\n
Replace AZURE_DEPLOYMENT_NAME = \"your_fine_tuned_model_deployment_name\" with the desired name for your deployment.
\n

\n

Deploy the fine-tuned model

\n

Running the deploy_model.py file automates the entire deployment process. It registers the model, creates an endpoint, and executes the deployment based on the settings specified in the config.py file, which includes the model name, endpoint name, and deployment name.

\n

\n
Open the deploy_model.py file in Visual Studio Code.
\n
\n
Add the following code into deploy_model.py.

\nimport logging\nfrom azure.identity import AzureCliCredential\nfrom azure.ai.ml import MLClient\nfrom azure.ai.ml.entities import Model, ProbeSettings, ManagedOnlineEndpoint, ManagedOnlineDeployment, IdentityConfiguration, ManagedIdentityConfiguration, OnlineRequestSettings\nfrom azure.ai.ml.constants import AssetTypes\n\n# Configuration imports\nfrom config import (\n AZURE_SUBSCRIPTION_ID,\n AZURE_RESOURCE_GROUP_NAME,\n AZURE_ML_WORKSPACE_NAME,\n AZURE_MANAGED_IDENTITY_RESOURCE_ID,\n AZURE_MANAGED_IDENTITY_CLIENT_ID,\n AZURE_MODEL_NAME,\n AZURE_ENDPOINT_NAME,\n AZURE_DEPLOYMENT_NAME\n)\n\n# Constants\nJOB_NAME = \"your-job-name\"\nCOMPUTE_INSTANCE_TYPE = \"Standard_E4s_v3\"\n\ndeployment_env_vars = {\n \"SUBSCRIPTION_ID\": AZURE_SUBSCRIPTION_ID,\n \"RESOURCE_GROUP_NAME\": AZURE_RESOURCE_GROUP_NAME,\n \"UAI_CLIENT_ID\": AZURE_MANAGED_IDENTITY_CLIENT_ID,\n}\n\n# Logging setup\nlogging.basicConfig(\n format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n datefmt=\"%Y-%m-%d %H:%M:%S\",\n level=logging.DEBUG\n)\nlogger = logging.getLogger(__name__)\n\ndef get_ml_client():\n \"\"\"Initialize and return the ML Client.\"\"\"\n credential = AzureCliCredential()\n return MLClient(credential, AZURE_SUBSCRIPTION_ID, AZURE_RESOURCE_GROUP_NAME, AZURE_ML_WORKSPACE_NAME)\n\ndef register_model(ml_client, model_name, job_name):\n \"\"\"Register a new model.\"\"\"\n model_path = f\"azureml://jobs/{job_name}/outputs/artifacts/paths/model_output\"\n logger.info(f\"Registering model {model_name} from job {job_name} at path {model_path}.\")\n run_model = Model(\n path=model_path,\n name=model_name,\n description=\"Model created from run.\",\n type=AssetTypes.MLFLOW_MODEL,\n )\n model = ml_client.models.create_or_update(run_model)\n logger.info(f\"Registered model ID: {model.id}\")\n return model\n\ndef delete_existing_endpoint(ml_client, endpoint_name):\n \"\"\"Delete existing endpoint if it exists.\"\"\"\n try:\n endpoint_result = ml_client.online_endpoints.get(name=endpoint_name)\n logger.info(f\"Deleting existing endpoint {endpoint_name}.\")\n ml_client.online_endpoints.begin_delete(name=endpoint_name).result()\n logger.info(f\"Deleted existing endpoint {endpoint_name}.\")\n except Exception as e:\n logger.info(f\"No existing endpoint {endpoint_name} found to delete: {e}\")\n\ndef create_or_update_endpoint(ml_client, endpoint_name, description=\"\"):\n \"\"\"Create or update an endpoint.\"\"\"\n delete_existing_endpoint(ml_client, endpoint_name)\n logger.info(f\"Creating new endpoint {endpoint_name}.\")\n endpoint = ManagedOnlineEndpoint(\n name=endpoint_name,\n description=description,\n identity=IdentityConfiguration(\n type=\"user_assigned\",\n user_assigned_identities=[ManagedIdentityConfiguration(resource_id=AZURE_MANAGED_IDENTITY_RESOURCE_ID)]\n )\n )\n endpoint_result = ml_client.online_endpoints.begin_create_or_update(endpoint).result()\n logger.info(f\"Created new endpoint {endpoint_name}.\")\n return endpoint_result\n\ndef create_or_update_deployment(ml_client, endpoint_name, deployment_name, model):\n \"\"\"Create or update a deployment.\"\"\"\n\n logger.info(f\"Creating deployment {deployment_name} for endpoint {endpoint_name}.\")\n deployment = ManagedOnlineDeployment(\n name=deployment_name,\n endpoint_name=endpoint_name,\n model=model.id,\n instance_type=COMPUTE_INSTANCE_TYPE,\n instance_count=1,\n environment_variables=deployment_env_vars,\n request_settings=OnlineRequestSettings(\n max_concurrent_requests_per_instance=3,\n request_timeout_ms=180000,\n max_queue_wait_ms=120000\n ),\n liveness_probe=ProbeSettings(\n failure_threshold=30,\n success_threshold=1,\n period=100,\n initial_delay=500,\n ),\n readiness_probe=ProbeSettings(\n failure_threshold=30,\n success_threshold=1,\n period=100,\n initial_delay=500,\n ),\n )\n deployment_result = ml_client.online_deployments.begin_create_or_update(deployment).result()\n logger.info(f\"Created deployment {deployment.name} for endpoint {endpoint_name}.\")\n return deployment_result\n\ndef set_traffic_to_deployment(ml_client, endpoint_name, deployment_name):\n \"\"\"Set traffic to the specified deployment.\"\"\"\n try:\n # Fetch the current endpoint details\n endpoint = ml_client.online_endpoints.get(name=endpoint_name)\n \n # Log the current traffic allocation for debugging\n logger.info(f\"Current traffic allocation: {endpoint.traffic}\")\n \n # Set the traffic allocation for the deployment\n endpoint.traffic = {deployment_name: 100}\n \n # Update the endpoint with the new traffic allocation\n endpoint_poller = ml_client.online_endpoints.begin_create_or_update(endpoint)\n updated_endpoint = endpoint_poller.result()\n \n # Log the updated traffic allocation for debugging\n logger.info(f\"Updated traffic allocation: {updated_endpoint.traffic}\")\n logger.info(f\"Set traffic to deployment {deployment_name} at endpoint {endpoint_name}.\")\n return updated_endpoint\n except Exception as e:\n # Log any errors that occur during the process\n logger.error(f\"Failed to set traffic to deployment: {e}\")\n raise\n\n\ndef main():\n ml_client = get_ml_client()\n\n registered_model = register_model(ml_client, AZURE_MODEL_NAME, JOB_NAME)\n logger.info(f\"Registered model ID: {registered_model.id}\")\n\n endpoint = create_or_update_endpoint(ml_client, AZURE_ENDPOINT_NAME, \"Endpoint for finetuned Phi-3 model\")\n logger.info(f\"Endpoint {AZURE_ENDPOINT_NAME} is ready.\")\n\n try:\n deployment = create_or_update_deployment(ml_client, AZURE_ENDPOINT_NAME, AZURE_DEPLOYMENT_NAME, registered_model)\n logger.info(f\"Deployment {AZURE_DEPLOYMENT_NAME} is created for endpoint {AZURE_ENDPOINT_NAME}.\")\n\n set_traffic_to_deployment(ml_client, AZURE_ENDPOINT_NAME, AZURE_DEPLOYMENT_NAME)\n logger.info(f\"Traffic is set to deployment {AZURE_DEPLOYMENT_NAME} at endpoint {AZURE_ENDPOINT_NAME}.\")\n except Exception as e:\n logger.error(f\"Failed to create or update deployment: {e}\")\n\nif __name__ == \"__main__\":\n main()\n\n

\n
\n
Perform the following tasks to get the JOB_NAME:
\n
- Navigate to Azure Machine Learning resource that you created.
- Select Studio web URL to open the Azure Machine Learning workspace.
- Select Jobs from the left side tab.
- Select the experiment for fine-tuning. For example, finetunephi.
- Select the job that you created.
- Copy and paste your job Name into the JOB_NAME = \"your-job-name\" in deploy_model.py file.
\n
\n
Replace COMPUTE_INSTANCE_TYPE with your specific details.
\n
\n
Type the following command to run the deploy_model.py script and start the deployment process in Azure Machine Learning.

\n
```
python deploy_model.py
```
\n

\n

Warning\n

To avoid additional charges to your account, make sure to delete the created endpoint in the Azure Machine Learning workspace.

\n

Check deployment status in Azure Machine Learning Workspace

\n

\n
Visit Azure ML Studio.
\n
\n
Navigate to Azure Machine Learning workspace that you created.
\n
\n
Select Studio web URL to open the Azure Machine Learning workspace.
\n
Select Endpoints from the left side tab.\n

\n
\n

\n\n

\n
\n
\n
Select endpoint that you created.
\n

\n
\n

\n\n

\n
\n
\n
On this page, you can manage the endpoints created during the deployment process.

\n

\n

Series 3: Integrate the custom Phi-3 model with Prompt flow

\n

Integrate the custom Phi-3 model with Prompt Flow

\n

After successfully deploying your fine-tuned model, you can now integrate it with Prompt Flow to use your model in real-time applications, enabling a variety of interactive tasks with your custom Phi-3 model.

\n

In this exercise, you will:

\n

Set api key and endpoint uri of the fine-tuned Phi-3 model.
Add code to the flow.dag.yml file.
Add code to the integrate_with_promptflow.py file.
Test your custom Phi-3 model on Prompt flow.

\n

Set api key and endpoint uri of the fine-tuned Phi-3 model

\n

\n
Navigate to the Azure Machine learning workspace that you created.
\n
\n
Select Endpoints from the left side tab.
\n

\n
\n

\n\n

\n
\n
\n
Select endpoint that you created.
\n

\n
\n

\n\n

\n
\n
\n
Select Consume from the navigation menu.
\n
\n
Copy and paste your REST endpoint into the config.py file, replacing AZURE_ML_ENDPOINT = \"your_fine_tuned_model_endpoint_uri\" with your REST endpoint.
\n
\n
Copy and paste your Primary key into the config.py file, replacing AZURE_ML_API_KEY = \"your_fine_tuned_model_api_key\" with your Primary key.
\n

\n
\n

\n\n

\n
\n

\n

Add code to the flow.dag.yml file

\n

\n
Open the flow.dag.yml file in Visual Studio Code.
\n
\n
Add the following code into flow.dag.yml.

\ninputs:\n input_data:\n type: string\n default: \"Who founded Microsoft?\"\n\noutputs:\n answer:\n type: string\n reference: ${integrate_with_promptflow.output}\n\nnodes:\n- name: integrate_with_promptflow\n type: python\n source:\n type: code\n path: integrate_with_promptflow.py\n inputs:\n input_data: ${inputs.input_data}\n

\n

Add code to the integrate_with_promptflow.py file

\n

\n
Open the integrate_with_promptflow.py file in Visual Studio Code.
\n
\n
Add the following code into integrate_with_promptflow.py.

\nimport logging\nimport requests\nfrom promptflow.core import tool\nimport asyncio\nimport platform\nfrom config import (\n AZURE_ML_ENDPOINT,\n AZURE_ML_API_KEY\n)\n\n# Logging setup\nlogging.basicConfig(\n format=\"%(asctime)s - %(levelname)s - %(name)s - %(message)s\",\n datefmt=\"%Y-%m-%d %H:%M:%S\",\n level=logging.DEBUG\n)\nlogger = logging.getLogger(__name__)\n\ndef query_azml_endpoint(input_data: list, endpoint_url: str, api_key: str) -> str:\n \"\"\"\n Send a request to the Azure ML endpoint with the given input data.\n \"\"\"\n headers = {\n \"Content-Type\": \"application/json\",\n \"Authorization\": f\"Bearer {api_key}\"\n }\n data = {\n \"input_data\": [input_data],\n \"params\": {\n \"temperature\": 0.7,\n \"max_new_tokens\": 128,\n \"do_sample\": True,\n \"return_full_text\": True\n }\n }\n try:\n response = requests.post(endpoint_url, json=data, headers=headers)\n response.raise_for_status()\n result = response.json()[0]\n logger.info(\"Successfully received response from Azure ML Endpoint.\")\n return result\n except requests.exceptions.RequestException as e:\n logger.error(f\"Error querying Azure ML Endpoint: {e}\")\n raise\n\ndef setup_asyncio_policy():\n \"\"\"\n Setup asyncio event loop policy for Windows.\n \"\"\"\n if platform.system() == 'Windows':\n asyncio.set_event_loop_policy(asyncio.WindowsSelectorEventLoopPolicy())\n logger.info(\"Set Windows asyncio event loop policy.\")\n\n@tool\ndef my_python_tool(input_data: str) -> str:\n \"\"\"\n Tool function to process input data and query the Azure ML endpoint.\n \"\"\"\n setup_asyncio_policy()\n return query_azml_endpoint(input_data, AZURE_ML_ENDPOINT, AZURE_ML_API_KEY)\n\n

\n
\n
Type the following command to run the integrate_with_promptflow script and start Prompt flow.

\n
```
pf flow serve --source ./ --port 8080 --host localhost\n
```
\n
\n
Here's an example of the results: Now you can chat with your custom Phi-3 model. It is recommended to ask questions based on the data used for fine-tuning.
\n

\n
\n

\n

\n

\n
\n

\n

Congratulations!

\n

You've completed this tutorial

\n

Congratulations! You have successfully completed the tutorial on fine-tuning and integrating custom Phi-3 models with Prompt flow. This tutorial introduced the simplest method of fine-tuning, avoiding additional techniques such as LoRA or QLoRA, and using MLflow to streamline the fine-tuning and deployment process. Advanced techniques and detailed explanations will be covered in the next series.

\n

\n\n

\n

Clean Up Azure Resources

\n

Cleanup your Azure resources to avoid additional charges to your account. Go to the Azure portal and delete the following resources:

\n

The Azure Machine Learning resource.
The Azure Machine Learning model endpoint.

\n

Source Code for the Tutorial

\n

You can find the complete source code for this tutorial in the following repository:

\n

skytin1004/Fine-Tune-and-Integrate-Custom-Phi-3-Models-with-Prompt-Flow

\n

Reference

\n

microsoft/Phi-3CookBook
Azure/azure-llm-fine-tuning

\n

Blog Post

Fine-Tune and Integrate Custom Phi-3 Models with Prompt Flow: Step-by-Step Guide

Fine-Tune and Integrate Custom Phi-3 Models with Prompt Flow: Step-by-Step Guide

Introduction

Series 1: Set up Azure resources and Prepare for fine-tuning

Series 2: Fine-tune and Deploy the Phi-3 model

Series 3: Integrate the custom Phi-3 model with Prompt flow

Prerequisites

Table of Contents

Series 1: Set up Azure resources and Prepare for fine-tuning

Create Azure Machine Learning Workspace

Create an Azure Machine Learning Workspace

Request GPU quotas in Azure Subscription

Request GPU Quotas in Azure Subscription

Add role assignment

Create User Assigned Managed Identity(UAI)

Add Contributor role assignment to Managed Identity

Add Storage Blob Data Reader role assignment to Managed Identity

Add AcrPull role assignment to Managed Identity

Set up the project and install the libraries

Create a folder to work inside it

Create a virtual environment

Install the required packages

Set up project files in Visual Studio Code

Create Project Files

Create and Configure conda.yml file

Create and Configure config.py file

Add Azure Environment Variables

Prepare Dataset for Fine-tuning

Download your dataset using download_dataset.py

Guidance for fine-tuning with a minimal dataset using a CPU

Note on dataset size and fine-tuning time

Series 2: Fine-tune and Deploy the Phi-3 model

Fine-tune the Phi-3 model

Set up Azure CLI

Having trouble signing in to Azure? Try using a device code

Add code to the fine_tune.py file

You can fine-tune Phi-3.5 model

Add code to the setup_ml.py file

Guidance for fine-tuning with a minimal dataset using a CPU

Deploy the fine-tuned model

Set the model name, endpoint name, and deployment name for deployment

Deploy the fine-tuned model

Check deployment status in Azure Machine Learning Workspace

Series 3: Integrate the custom Phi-3 model with Prompt flow

Integrate the custom Phi-3 model with Prompt Flow

Set api key and endpoint uri of the fine-tuned Phi-3 model

Add code to the flow.dag.yml file

Add code to the integrate_with_promptflow.py file

Congratulations!

You've completed this tutorial

Clean Up Azure Resources

Source Code for the Tutorial

Reference

Update (July 25, 2024): Fine-Tune and Integrate Custom Phi-3 Models with Prompt Flow: Step-by-Step Guide

Updates:

Confirmation of Changes:

Fine-Tune and Integrate Custom Phi-3 Models with Prompt Flow: Step-by-Step Guide

Introduction

Series 1: Set up Azure resources and Prepare for fine-tuning

Series 2: Fine-tune and Deploy the Phi-3 model

Series 3: Integrate the custom Phi-3 model with Prompt flow

Prerequisites

Table of Contents

Series 1: Set up Azure resources and Prepare for fine-tuning

Create Azure Machine Learning Workspace

Create an Azure Machine Learning Workspace

Request GPU quotas in Azure Subscription

Request GPU Quotas in Azure Subscription

Add role assignment

Create User Assigned Managed Identity(UAI)

Add Contributor role assignment to Managed Identity

Add Storage Blob Data Reader role assignment to Managed Identity

Add AcrPull role assignment to Managed Identity

Set up the project and install the libraries

Create a folder to work inside it

Create a virtual environment

Install the required packages

Set up project files in Visual Studio Code

Create Project Files