Blog Post

Educator Developer Blog
2 MIN READ

Analyzing Wildlife Data with Microsoft Fabric: end-to-end workshop

bethanyjep's avatar
bethanyjep
Icon for Microsoft rankMicrosoft
Nov 08, 2023

Introduction

Have you ever wondered how to make sense of large-scale and complex data sets? In the world of wildlife research, scientists face the challenge of analyzing millions of images captured by camera traps to study the diversity of wildlife in different ecosystems. But how do they handle such vast amounts of data and extract meaningful insights?
 
You can access the workshop at: https://aka.ms/fabric-e2e-serengeti
 
This workshop, dives into the fascinating world of data analytics using Microsoft Fabric and explore how it can be used to analyze the Snapshot Serengeti dataset. The workshop will take you through the entire process of building an end-to-end data analytics solution, including loading the data into a Lakehouse, exploring the data using SQL, visualizing the data using Power BI, and even training a machine learning model.

This workshop covers:

  1. Load data into Microsoft Fabric using Data Factory pipelines.
  2. Leverage SQL queries to explore and analyze the data.
  3. Create reports & visualize the data using Power BI.
  4. Use Apache Spark for data processing and analytics.
  5. Train & evaluate a machine learning model using the data science workload in Microsoft Fabric.

Why Microsoft Fabric

Microsoft Fabric is a unified data platform that offers a comprehensive suite of services for data science, data engineering, real-time analytics, and business intelligence. It provides a powerful set of tools and services that can help you analyze and extract insights from your data.
You can get started with Microsoft Fabric using the Fabric (preview) trial.

Training and Evaluating the Machine Learning Model

A key part in the workshop is training our data using a convolutional neural network (CNN) to classify the images in the dataset into different species. We will load a pre-trained DenseNet model from the torchvision library and modify its classifier layer to output 50 classes instead of the default 1000 classes. We will then train the model using the training dataset and evaluate its performance on the test dataset.

Conclusion

You can access the workshop at: https://aka.ms/fabric-e2e-serengeti

By the end of the workshop, you will have a better understanding of how to use Microsoft Fabric to create an end-to-end data analytics solution that can handle large-scale and complex data sets. So why wait? Dive into the world of Microsoft Fabric!
Updated Nov 07, 2023
Version 1.0
No CommentsBe the first to comment