Performing Data Engineering on Microsoft HDInsight (MOC-20775)


Course Number: MOC-20775

Duration: 5 days (32.5 hours)

Format: Live, hands-on

Microsoft HDInsight Training Overview

This Microsoft course 20775, Performing Data Engineering on Microsoft HDInsight training, teaches students how to plan and implement big data workflows in HDInsight.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

Objectives

  • Work with Hadoop, the MapReduce paradigm, and HDInsight
  • Deploy HDInsight clusters
  • Assign permissions
  • Load data into HDInsight
  • Troubleshoot HDInsight
  • Implement batch solutions
  • Design batch ETL solutions for big data with Spark
  • Analyze data with Spark SQL, Hive, and Phoenix
  • Work with Azure Stream Analytics
  • Create Spark structured streaming applications
  • Develop big data real-time processing solutions with Apache Storm

Prerequisites

In addition to their professional experience, students who attend this course should have:

  • Programming experience using R and familiarity with common R packages
  • Knowledge of common statistical methods and data analysis best practices.
  • Basic knowledge of the Microsoft Windows operating system and its core functionality.
  • Working knowledge of relational databases.

Outline

Expand All | Collapse All

Introduction
Getting Started with HDInsight
  • Big Data
  • Hadoop
  • MapReduce
  • HDInsight
Deploying HDInsight Clusters
  • HDInsight cluster types
  • Managing HDInsight Clusters
  • Managing HDInsight Clusters with PowerShell
Authorizing Users to Access Resources
  • Non-domain Joined clusters
  • Configuring domain-joined HDInsight clusters
  • Manage domain-joined HDInsight clusters
Loading data into HDInsight
  • HDInsight Storage
  • Data loading tools
  • Performance and reliability
Troubleshooting HDInsight
  • Analyze HDInsight logs
  • YARN logs
  • Heap dumps
  • Operations management suite
Implementing Batch Solutions
  • Apache Hive storage
  • Querying with Hive and Pig
  • Operationalize HDInsight
Design Batch ETL solutions for big data with Spark
  • What is Spark?
  • ETL with Spark
  • Spark performance
Analyze Data with Spark SQL
  • Implement interactive queries
  • Perform exploratory data analysis
Analyze Data with Hive and Phoenix
  • Implement interactive queries for big data with interactive hive.
  • Perform exploratory data analysis by using Hive
  • Perform interactive processing by using Apache Phoenix
Stream Analytics
  • Stream analytics
  • Process streaming data from stream analytics
  • Managing stream analytics jobs
Spark Streaming using the DStream API
  • Dstream
  • Create Spark structured streaming applications
  • Persistence and visualization
Develop big data real-time processing solutions with Apache Storm
  • Persist long term data
  • Stream data with Storm
  • Create Storm topologies
  • Configure Apache Storm
Analyze Data with Spark SQL
  • Implement interactive queries
  • Perform exploratory data analysis
Conclusion

Training Materials:

All Microsoft training students receive Microsoft official courseware.

Software Requirements:

Attendees will not need to install any software on their computer for this class. The class will be conducted in a remote environment that Accelebrate will provide; students will only need a local computer with a web browser with a stable Internet connection. Any recent version of Internet Explorer, Mozilla Firefox, or Google Chrome will be fine.

When you contact us about purchasing this class, we will provide a live demo of the online lab environment so that you may explore the web browser interface in more detail.



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan


© 2013-2021 Accelebrate, Inc. All rights reserved. All trademarks are owned by their respective owners.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.