synapse spark

69 Topics

Improve Spark pool utilization with Synapse Genie
Synapse Genie Framework improves Spark pool utilization by executing multiple Synapse notebooks on the same Spark pool instance. It considers the sequence and dependencies between notebook activities in an ETL pipeline, which results in higher usage of a full cluster for resources available in a Spark pool.
InnovatorsClub
Jan 04, 2023 Place Azure Synapse Analytics Blog
12KViews
18likes
9Comments
Building the Lakehouse - Implementing a Data Lake Strategy with Azure Synapse
This blog introduces the audience to the world of Lakehouse data platform architecture, how they can implement it with Azure Synapse, and key considerations to keep in mind while building it.
ArshadAliTMMBA
Sep 08, 2022 Place Azure Synapse Analytics Blog
65KViews
16likes
20Comments
The best practices for organizing Synapse workspaces and lakehouses
While designing the Lakehouse solution, you should carefully organize your databases and tables based on the underlying folder structure. In this article, you will find some best practices and recommendations that can help you to organize your lakehouses if you are using Synapse Analytics workspace to implement them.
JovanPop
Nov 24, 2021 Place Azure Synapse Analytics Blog
38KViews
16likes
3Comments
Synapse – Data Lake vs. Delta Lake vs. Data Lakehouse
As a data engineer, we often hear terms like Data Lake, Delta Lake, and Data Lakehouse, which we might be confusing at times. In this blog we’ll demystify these terms and talk about the differences of each of the technologies and concepts, along with scenarios of usage for each.
giulianorapoz
Dec 08, 2022 Place Azure Synapse Analytics Blog
54KViews
14likes
0Comments
Data mesh: A perspective on using Azure Synapse Analytics to build data products
This is a multi-part blog series, and it discusses various aspects of implementing data mesh architecture on Azure. This part focuses on data as a product principle and presents a perspective on using Azure Synapse Analytics as a data product. We discuss (at a high-level) data product functions & capabilities and apply that lens to Synapse Analytics. We discuss how workspaces can be partitioned to give domains scale and agility to build data products.
amanjeet
Nov 03, 2022 Place Azure Synapse Analytics Blog
18KViews
11likes
4Comments
The Data Lakehouse, the Data Warehouse and a Modern Data platform architecture
There are two contradictory themes about how to build a modern data platform being proposed to data architects today. This article discusses why is there such a big disparity between two approaches, how we make sense of these competing patterns and why the modern data warehouse architecture provides a flexible and pragmatic approach.
GregLoxton
Mar 18, 2022 Place Azure Synapse Analytics Blog
38KViews
11likes
3Comments
Apache Spark in Azure Synapse - Performance Update
How fast is Apache Spark in Azure Synapse? FAST!
euanga
Mar 30, 2021 Place Azure Synapse Analytics Blog
25KViews
10likes
7Comments
Using OpenAI GPT in Synapse Analytics
Azure OpenAI hardly needs an introduction, but for those who managed to evade all tech new lately, let me give you a brief overview. Azure OpenAI is a suite of natural language processing (NLP) models developed by OpenAI. The models can be used in a very wide range of applications, including text generation, summarization and translation.
tcosters
Mar 01, 2023 Place Azure Synapse Analytics Blog
20KViews
8likes
1Comment
Speed up your data workloads with performance updates to Apache Spark 3.1.2 in Azure Synapse
Apache Spark 3.1.2 performance in Azure Synapse gets even faster!
balajisankaran
Sep 21, 2021 Place Azure Synapse Analytics Blog
11KViews
8likes
5Comments
Strengthen Delta Lake in Synapse with auto maintenance job
In every data engineering program, there is a need for upkeep on a Delta Lake. This blog presents a way to automate such a maintenance process in Synapse Analytics.
InnovatorsClub
Feb 10, 2023 Place Azure Synapse Analytics Blog
12KViews
7likes
1Comment