Course Number: DATA-119

Duration: 2 days (13 hours)

Format: Live, hands-on

Big Data Analytics Options on AWS Training Overview

This Data Analytics on AWS training course gives attendees a comprehensive overview of AWS' core offerings, including S3, Redshift, QuickSight, and Glue. Participants learn to choose which big data AWS services best align with their desired outcomes and power their projects with the appropriate AWS data analytics tools.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

In addition, we offer some courses as live, instructor-led online classes for individuals.

Objectives

  • Understand how to implement data warehouses using AWS Lake Formation service
  • Use S3 through the management console
  • Understand the architecture of Snowflake data platform
  • Use Snowflake web UI (a.k.a Web Portal, Snowflake Manager, and Snowflake Console)
  • Create databases, tables, and warehouses in the Snowflake Web UI
  • Understand how Amazon QuickSight builds visualizations, perform ad hoc analysis, and business insights
  • Explore the main capabilities of AWS Glue
  • Create a Glue crawler to work over a collection of CSV files using a customized classifier to infer their schemas
  • Create and run an AWS Glue ETL job

Prerequisites

Participants must have a general knowledge of programming.

Outline

Expand All | Collapse All

Introduction
The AWS Lake Formation Service
  • First, What is a Data Lake?
  • Data Lakes vs. Traditional Data Warehouses
  • Characteristics of Data Warehouses and Data Lakes
  • What is AWS Lake Formation?
  • What are the Benefits of Using Lake Formation?
  • How Lake Formation Works
  • The Lake Formation Dashboard
  • AWS Lake Formation Pricing
AWS Simple Storage Service
  • What is AWS Simple Storage Service (S3)?
  • AWS Storage
  • Regions
  • S3 Regions
  • Getting started with S3
  • Using BitTorrent
  • More on Buckets
  • Bucket Configurable Properties
  • Advanced S3 Bucket Properties
  • The Bucket Creation Dialog in the Management Console
  • Bucket Permissions
  • Bucket-level Operations
  • Authorization of REST Requests
  • Adding Cross-Origin Resource Sharing Configuration
  • Event Notifications
  • The Requester Pays Option
  • The Object Key
  • Object Versioning
  • Example of Object Properties
  • Object Storage Class Levels
  • Object-level Operations
  • Object Lifecycle Configuration
  • Amazon S3 Data Consistency Model
  • Observable Data Consistency Behaviors
  • Eventually Consistent Reads vs. Consistent Reads
  • Amazon S3 Security
  • S3 Use Case: Backup and Archiving
  • Another S3 Use Case: Static Web Hosting
  • More on Static Web Hosting
  • S3 Static Website Hosting Dialog in Management Console
  • S3 Use Case: Disaster Recovery
  • AWS S3 Pricing
  • Storage Pricing  
  • Request Pricing
  • Data Transfer Pricing
  • Amazon S3 Transfer Acceleration
  • How to Enable Transfer Acceleration
  • Enabling Transfer Acceleration in the Management Console
  • Amazon S3 SLA Definitions
  • Amazon S3 SLA Service Commitment
  • S3 CLT
Redshift
  • Overview
  • Terms
  • Data Warehouse
  • Traditional Extract Transform Load (ETL)
  • Data Lake
  • Database
  • Redshift Features
  • High Performance
  • Simple Management
  • Cost-effective
  • Elasticity
  • Query your Data Lake
  • Security
  • Partners Offer Certified Solutions
  • Data Warehouse Challenges
  • Redshift Spectrum
  • Where is the Data?
  • Scalability with Redshift Spectrum
  • Combine the data warehouse and the data lake
  • SQL Queries across S3 and Redshift
  • Redshift Query
Redshift Optimizations
  • Queues
  • Superuser Queue
  • Default Queues
  • User Defined Queues
  • Workload Management (WLM)
  • Concurrency level
  • User groups
  • Query groups
  • Memory %
  • Timeout
  • Sort keys
  • Compound Sort Keys
  • Interleaved Sort Keys
  • Cleaning Up Data
Visualization and Reporting
  • Amazon QuickSight
  • SPTCE
  • Data Analyses
  • Visuals
  • Sheets
  • Stories
  • Dashboards
  • Typical Amazon QuickSight Workflow
  • Create a Data Set
  • Create an Analysis
  • Create a Visual Manually
  • Amazon Athena
  • Amazon Athena and AWS Data Catalog
  • Query Data Using Amazon Athena
  • Create a Report Using Tablea
  • Create a Report Using Tableau
Introduction to AWS Glue
  • What is AWS Glue?
  • AWS Glue Components
  • Managing Notebooks
  • AWS Glue Components
  • Putting it Together: The AWS Glue Environment Architecture
  • AWS Glue Main Activities
  • Additional Glue Services
  • When To Use AWS Glue
  • Integration with other AWS Services
AWS Glue PySpark Extensions
  • AWS Glue and Spark
  • The DynamicFrame Object
  • The DynamicFrame APT
  • The GlueContext Object
  • Glue Transforms
  • A Sample Glue PySpark Script
  • Using PySpark
  • AWS Glue PySpark SDK
Conclusion

Training Materials

All AWS for Data training students will receive comprehensive courseware.

Software Requirements

A modern web browser and an Internet connection.



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan