20773 : Analyzing Big Data with Microsoft R

The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.

The primary audience for this course is people who wish to analyze large datasets within a big data environment.

The secondary audience are developers who need to integrate R analyses into their solutions.

In addition to their professional experience, students who attend this course should have:

  • Programming experience using R, and familiarity with common R packages
  • Knowledge of common statistical methods and data analysis best practices.
  • Basic knowledge of the Microsoft Windows operating system and its core functionality.
  • Working knowledge of relational databases.

After completing this course, students will be able to:

  • Explain how Microsoft R Server and Microsoft R Client work
  • Use R Client with R Server to explore big data held in different data stores
  • Visualize data by using graphs and plots
  • Transform and clean big data sets
  • Implement options for splitting analysis jobs into parallel tasks
  • Build and evaluate regression models generated from big data
  • Create, score, and deploy partitioning models generated from big data
  • Use R in the SQL Server and Hadoop environments 

Module 1: Microsoft R Server and R Client

Module 2: Exploring Big Data

Module 3: Visualizing Big Data

Module 4: Processing Big Data

Module 5: Parallelizing Analysis Operations

Module 6: Creating and Evaluating Regression Models

Module 7: Creating and Evaluating Partitioning Models

Module 8: Processing Big Data in SQL Server and Hadoop

Vui lòng bấm "Course Outline" để xem thêm nội dung chi tiết.

Thời lượng: 3 Ngày

Giảng Viên: Microsoft Certified Trainer

Chứng nhận hoàn thành khóa học của Microsoft

CÁC KHÓA HỌC LIÊN QUAN

20764 : Administering a SQL Database Infrastructure

20765 : Provisioning SQL Databases

20761 : Querying Data with Transact-SQL

20762 : Developing SQL Databases

20767 : Implementing a SQL Data Warehouse

20768 : Developing SQL Data Models

20775 : Performing Data Engineering on Microsoft HD Insight

20776 : Performing Big Data Engineering on Microsoft Cloud Services

20773 : Analyzing Big Data with Microsoft R

10988 : Managing SQL Business Intelligence Operations

55163 : Data Modeling with SQL BISM Tabular Mode

55246 : SQL Always On High Availability with SQL 2016

55073 : Master Data Services, Data Quality Services with SQL 2012-2014 and Excel

55096 : Securing Data on Microsoft SQL Server 2012

20467 : Designing Business Intelligence Solutions with Microsoft SQL Server 2014

20466 : Implementing Data Models and Reports with SQL Server 2014

20465 : Designing Solutions for Microsoft SQL Server 2014

20464 : Developing Microsoft SQL Server 2014 Databases

55119 : SQL Server 2012 Reporting Services

55120 : Quick Microsoft SQL 2012 Integration Services

55144 : SQL Server Performance Tuning and Optimization

50555 : Visualizing Data with SQL 2008 R2 and Report Builder 3.0

55005 : Microsoft Report Builder 3.0 with SQL 2008R2, SQL 2012, and SQL 2014

55123 : Writing Reports with Report Builder and SSRS Level 1

55128 : Writing Reports with Report Builder and SSRS Level 2

55204 : Writing Reports with Report Designer and SSRS 2014 Level 1

55170 : Writing Reports with Report Designer and SSRS 2016 Level 2

40562 : Microsoft Cloud Workshop - Migrating SQL databases to Azure

55240 : Writing Reports with Report Designer and SSRS Level 3

DP-070 : Migrate Open Source Data Workloads to Azure