Course Number: DATA-119
Duration: 2 days (13 hours)
Format: Live, hands-on

Big Data Analytics Options on AWS Training Overview

This Data Analytics on AWS training course gives attendees a comprehensive overview of AWS' core offerings and teaches students how to use AWS Lake Formation Service, Simple Storage Service (S3), Snowflake Cloud Data Platform, Amazon QuickSight, visualization and reporting tools, AWS Glue  PySpark extensions, and more. Participants learn to choose which big data AWS services best align with their desired outcomes and power their projects with the appropriate AWS data analytics tools.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

In addition, some courses are available as live, instructor-led training from one of our partners.

Objectives

  • Use S3 Through Management Console
  • Sign Up for the Free Trial of Snowflake
  • The Snowflake Web UI
  • Create and Work with Databases in Snowflake
  • Understand AWS Glue
  • Use AWS Glue Crawlers and Classifiers
  • Create an S3 Bucket for AWS Glue ETL Script Output
  • Create and Work with Glue Scripts
  • Use PySpark API Directly
  • Understand AWS Glue ETL Jobs

Prerequisites

Outline

Expand All | Collapse All

Introduction
The AWS Lake Formation Service
  • First, What is a Data Lake?
  • Data Lakes vs. Traditional Data Warehouses
  • Characteristics of Data Warehouses and Data Lakes
  • Now, What is AWS Lake Formation?
  • What are the Benefits of Using Lake Formation?
  • How Lake Formation Works
  • The Lake Formation Dashboard
  • AWS Lake Formation Pricing
AWS Simple Storage Service
  • What is AWS Simple Storage Service (S3)
  • AWS S3
  • Storage
  • Regions
  • S3 Regions
  • Getting started with S3
  • Using BitTorrent
  • More on Buckets
  • Bucket Configurable Properties
  • Advanced S3 Bucket Properties
  • The Bucket Creation Dialog in the Management Console
  • Bucket Permissions
  • Bucket-level Operations
  • Authorization of REST Requests
  • Adding Cross-Origin Resource Sharing Configuration
  • Event Notifications
  • The Requester Pays Option
  • The Object Key
  • Object Versioning
  • Example of Object Properties
  • Object Storage Class Levels
  • Object-level Operations
  • Object Lifecycle Configuration
  • Amazon S3 Data Consistency Model
  • Observable Data Consistency Behaviors
  • Eventually Consistent Reads vs Consistent Reads
  • Amazon S3 Security
  • S3 Use Case: Backup and Archiving
  • Another S3 Use Case: Static Web Hosting
  • More on Static Web Hosting
  • S3 Static Website Hosting Dialog in Management Console
  • S3 Use Case: Disaster Recovery
  • AWS S3 Pricing
  • Storage Pricing
  • Request Pricing
  • Data Transfer Pricing
  • Amazon S3 Transfer Acceleration
  • How to Enable Transfer Acceleration
  • Enabling Transfer Acceleration in the Management Console
  • Amazon S3 SLA Definitions
  • Amazon S3 SLA Service Commitment
  • S3 CLI
Introduction to the Snowflake Cloud Data Platform
  • What is Snowflake?
  • Certifications
  • Snowflake Conceptual Architecture
  • Core Underlying Design Considerations
  • Core Services and Tools
  • Snowflake Editions
  • The Standard Edition
  • The Enterprise Edition
  • The Business Critical Edition
  • Virtual Private Snowflake
  • Billing: The Cost Components
  • Data Storage Segments
  • Parts of Snowflake that Incur Compute-related Costs
  • Snowflake Quickstart
Snowflake's Web UI
  • Web UI (Web Portal)
  • The Landing Page
  • Snowflake Roles
  • The Roles UI
  • Databases
  • Shares
  • Data Marketplace
  • The Warehouses UI
  • Worksheets
  • History
  • A History Sample
  • Account
  • Operational Transparency: Controlling the Usage of Your Account
  • Create Network Policy Dialog (under Account > Policies)
  • Preview App
Visualization and Reporting
  • Amazon QuickSight
  • SPICE
  • Data Analyses
  • Visuals
  • Sheets
  • Dashboards
  • Typical Amazon QuickSight Workflow
  • Create a Data Set
  • Create an Analysis
  • Create a Visual Manually
  • Amazon Athena
  • Amazon Athena and AWS Data Catalog
  • Query Data Using Amazon Athena
  • What is Tableau?
  • Create a Report Using Tableau
  • Tutorial: Get Started with Tableau Desktop
Introduction to AWS Glue
  • What is AWS Glue?
  • AWS Glue Components
  • Managing Notebooks
  • Putting it Together: The AWS Glue Environment Architecture
  • AWS Glue Main Activities
  • Additional Glue Services
  • AWS Glue Pricing
  • When To Use AWS Glue?
  • Integration with other AWS Services
AWS Glue PySpark Extensions
  • AWS Glue and Spark
  • The DynamicFrame Object
  • The DynamicFrame API
  • The GlueContext Object
  • Glue Transforms
  • A Sample Glue PySpark Script
  • Using PySpark
  • AWS Glue PySpark SDK
Conclusion

Training Materials

All AWS for Data training students will receive comprehensive courseware.

Software Requirements

A modern web browser and an Internet connection.



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan