Deep learning

28 Topics

Training and Inference of LLMs with PyTorch Fully Sharded Data Parallel and Better Transformer
In this blog we show how to perform efficient and optimized distributed training and inference of large language models using PyTorch’s Fully Sharded Data Parallel and Better Transformer implementations, on the Spark platform. In this implementation, we combine Microsoft Fabric for data preparation and model inference, and Azure Databricks for model training, having all our data under Microsoft Fabric’s OneLake. The code for this blog is available at this GitHub repository, as a series of PySpark notebooks for Microsoft Fabric and Azure Databricks.
vilcek
Jun 14, 2023 Place Microsoft Developer Community Blog
9.2KViews
1like
1Comment
Optimized Training and Inference of Hugging Face Models on Azure Databricks – Part 1
In this two-part blog series, we explore how to perform optimized training and inference of large language models from Hugging Face, at scale, on Azure Databricks.
vilcek
Sep 19, 2022 Place Microsoft Developer Community Blog
9KViews
1like
0Comments
Optimized Training and Inference of Hugging Face Models on Azure Databricks – Part 2
In this two-part blog series, we explore how to perform optimized training and inference of large language models from Hugging Face, at scale, on Azure Databricks.
vilcek
Sep 19, 2022 Place Microsoft Developer Community Blog
8.3KViews
1like
2Comments
Deep Learning with Microsoft Cognitive Toolkit CNTK
First published on MSDN on Feb 10, 2017 Extracting value from large amounts of data {and making human sense of it is one of the primary challenge of data science Introduction to Data Science 1.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
7.4KViews
0likes
1Comment
Understanding your GPU Performance on Azure with GPU Monitor
First published on MSDN on May 03, 2018 So I get lots of questions from Academics.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
7.1KViews
0likes
0Comments
Bored of MNIST? Let’s build your own OCR deep learning computer vision AI using Microsoft CNTK with EMNIST (Step by step guide)
First published on MSDN on Nov 20, 2017 Guest post by Chih Han Chen , Microsoft Student Partner from Imperial College London.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
3.7KViews
0likes
0Comments
Microsoft Deep Learning Virtual Machine
First published on MSDN on Sep 29, 2017 The DLVM is a specially configured variant of the Data Science VM DSVM that is custom made to help users jump start deep learning on Azure GPU VMs.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
3.6KViews
0likes
0Comments
Build your first deep neural network with Microsoft A.I. tool CNTK (Step by step guide)
First published on MSDN on Aug 23, 2017 A guest post by Chih Han Chen , Microsoft Student Partner from Imperial College London.
Lee_Stott
Mar 21, 2019 Place Educator Developer Blog
2.3KViews
0likes
0Comments
Interview with Jeremy Howard Fast.ai AI Application without a PhD
Fast.ai has made it their mission to make deep learning as accessible as possible, and in this interview fast.ai co-founder Jeremy Howard explains how to use their free software and courses to become an effective deep learning practitioner.
Lee_Stott
Nov 18, 2020 Place Educator Developer Blog
2.3KViews
0likes
0Comments
Responsible Synthetic Data Creation for Fine-Tuning with RAFT Distillation
This blog will explore the process of crafting responsible synthetic data, evaluating it, and using it for fine-tuning models. We’ll also dive into Azure AI’s RAFT distillation recipe, a novel approach to generating synthetic datasets using Meta’s Llama 3.1 model and UC Berkeley’s Gorilla project.
Sharda_Kaur
Oct 12, 2024 Place Educator Developer Blog
2.1KViews
2likes
0Comments