Applied Machine Learning using Python and Apache Spark


Course Number: PYTH-232
Duration: 3 days (19.5 hours)
Format: Live, hands-on

ML with Python and Spark Training Overview

This Applied Machine Learning using Python and Apache Spark training teaches attendees Machine Learning (ML) concepts, terminology, and usage. Students learn how to perform and scale ML tasks using Python libraries (including NumPy, Pandas, Matplotlib, and Scikit-learn) on the Apache Spark platform.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

In addition, some courses are available as live, instructor-led training from one of our partners.

Objectives

  • Gain a basic understanding of Machine Learning
  • Understand the differences between supervised and unsupervised learning
  • Understand how to use Python libraries to explore, clean, and prepare data
  • Describe the role of ML and where it fits into IT strategies
  • Explain the technical and business drivers that result from using Machine Learning
  • Understand techniques like classification, clustering, and regression
  • Discuss how to identify which techniques should be applied for a specific use case
  • Understand popular machine offerings, including Amazon Machine Learning, TensorFlow, Azure Machine Learning, Google Cloud, Spark MLlib, Python, R, and more
  • Install and set up Anaconda
  • Use Jupyter Notebooks
  • Understand the popular Machine Learning algorithms, including linear regression, decision tree, logistic regression, K-nearest neighbor, K-means clustering, and more
  • Use Python libraries like NumPy, Pandas, Matplotlib and Scikit-learn
  • Understand Apache Spark Processing Framework and distributed architecture
  • Compare Machine learning using Python versus Apache Spark
  • Use Databricks cloud with Apache Spark MLlib

Prerequisites

All attendees must have familiarity with Python. Having a working knowledge of Spark is a plus, but not required.

Outline

Expand All | Collapse All

Introduction
  • History and background of Machine Learning
  • Compare traditional programming to Machine Learning
  • Supervised and unsupervised learning overview
Machine Learning Patterns
  • Classification
  • Clustering
  • Regression
Gartner Hype Cycle for Emerging Technologies
  • Machine Learning offerings in the industry
  • Install and set up Anaconda
  • Descriptive statistics
  • Jupyter Notebooks
Essential Libraries
  • NumPy
  • Pandas
  • Matplotlib
Exploratory Data Analysis
Getting Data
Feature Selection
Essential libraries
  • Scikit-learn
Transforming Data
Binary Encoding
One-Hot Encoding
Feature Engineering
Algorithms
  • Linear regression
  • Naive Bayes
  • Decision tree
  • Random forest
  • Logistics regression
  • Support vector machine
  • K-nearest neighbor
  • K-means clustering
Data Modeling
Apache Spark Overview
  • Spark libraries
Machine Learning using Python Versus using Spark
Databricks Cloud Community Account Setup
Measuring Performance
  • Confusion Matrix
  • ROC Curve, Area Under Curve (AUC)
Refining the Model
Hyperparameter Tuning
Grid Search
Spark MLlib
Conclusion and Next Steps

Training Materials

All Machine Learning training students receive comprehensive courseware.

Software Requirements

  • Windows, Mac, or Linux with at least 8 GB RAM
    • Most class activities will create Spark code and visualizations in a browser-based notebook environment. The class also details how to export these notebooks and how to run code outside of this environment.
  • A current version of Anaconda for Python 3.x
  • Related lab files that Accelebrate will provide
  • Internet access


Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan