This course is designed to build your foundational skills in data engineering on Microsoft Fabric, focusing on the Lakehouse concept. This course will explore the powerful capabilities of Apache Spark for distributed data processing and the essential techniques for efficient data management, versioning, and reliability by working with Delta Lake tables. This course will also explore data ingestion and orchestration using Dataflows Gen2 and Data Factory pipelines. This course includes a combination of lectures and hands-on exercises that will prepare you to work with lakehouses in Microsoft Fabric.
AUDIENCE
The primary audience for this course is data professionals who are familiar with data modeling, extraction, and analytics. It is designed for professionals who are interested in gaining knowledge about Lakehouse architecture, the Microsoft Fabric platform, and how to enable end-to-end analytics using these technologies.
Job role: Data Analyst, Data Engineer, Data Scientist
PREREQUISITES
You should be familiar with basic data concepts and terminology.
Module 1: Introduction to end-to-end analytics using Microsoft Fabric
Module 2: Get started with lakehouses in Microsoft Fabric
Lab: Create and ingest data with a Microsoft Fabric Lakehouse
Module 3: Use Apache Spark in Microsoft Fabric
Lab: Analyze data with Apache Spark
Module 4: Work with Delta Lake tables in Microsoft Fabric
Lab: Use delta tables in Apache Spark
Module 5: Ingest Data with Dataflows Gen2 in Microsoft Fabric
Lab: Create and use a Dataflow (Gen2) in Microsoft Fabric
Module 6: Use Data Factory pipelines in Microsoft Fabric
Lab: Ingest data with a pipeline
Duration: 1 day
Instructor: Microsoft Certified Trainer
Microsoft Certificate of Achievement