Course Number: DATA-128WA
Duration: 1 day (6.5 hours)
Format: Live, hands-on

DataOps Training for IT Professionals Overview

This DataOps for IT Professionals training course teaches attendees how to elevate the quality of their data, increasing the effectiveness of the analytical work based on this data that supports organizational decisions. Participants learn how to incorporate practical plans and technical assistance throughout the entire data lifecycle, including data acquisition, storage, processing, and consumption.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

In addition, some courses are available as live, instructor-led training from one of our partners.

Objectives

  • Understand the enterprise data processing and IT systems challenges
  • Correct "bad" input data
  • Perform data cleansing
  • Deal with missing and duplicate data
  • Enforce data consistency
  • Implement data governance

Prerequisites

All attendees must have practical work experience in data processing.

Outline

Expand All | Collapse All

DataOps Introduction
  • DataOps Enterprise Data Technologies
  • Enterprise Data Processing Challenges and IT Systems' Woes:
    • Data Quality
    • What Makes Information Systems Cluttered and Myopic
    • Fragmented Data Sources
    • Different Data Formats
    • System Interoperability
    • Maintenance Issues
  • Data-Related Roles
  • Data Engineering
  • What is DataOps?
  • The DataOps Technology and Methodology Stack
  • The DataOps Manifesto
  • Agile Development
  • DevOps
  • The Lean Manufacturing Methodology
  • Key Components of a DataOps Platform
  • Overview of DataOps Tools and Services
  • Overview of DataOps Platforms
Data Quality
  • Data Quality Definitions
  • Dimensions of Data Quality
  • Defining "Bad" Data
    • Missing Data
    • Wrong/Incorrect Data or Data Format
    • Inconsistent Data
    • Outdated (Stale) Information
    • Unverifiable Data
    • Withheld Data
  • Common Causes for “Bad" Data
    • Human Factor
    • Infrastructure- and Network-Related Issues
    • Software Defects
    • Using the Wrong Tool for the Job
    • Using Untrusted Data
    • Aggregation of Data from Disparate Data Sources that have Impedance Mismatch
    • Wrong QoS Settings of Queueing Systems
    • Wrong Caching System Settings, e.g. TTL
    • Not Using the "Ground Truth" Data
    • Differently Configured Development/UAT/Production Systems
    • Confusing Big-Endian and Little-Endian Byte Order
  • Ensuring Data Quality
    • Ensuring Integrity of Datasets 
  • Dealing with "Bad" Input Data
    • DDL-enforced Schema & Schema-on-Demand (-on-Read)
    • SQL Constraints as Rules for Column-Level and Table-Wide Data
    • XML Schema Definition (XSD) for XML Documents
    • Validating JSON Documents
    • Regular Expressions
    • Data Cleansing of Data at Rest
    • Controlling Integrity of Data-in-Transit
    • Database Normalization
    • Using Assertions in Applications
    • Operationalizing Input Data Validation
  • Data Consistency and Availability
  • Dealing with Duplicate Data
  • Dealing with Missing (NaN) Data
  • Master (Authoritative) Data Management
  • Enforcing Data Consistency with the scikit-learn LabelEncoder Class
  • Data Provenance
  • The Event Sourcing Pattern
  • Adopting the Culture of Automation
  • On-going Auditing
  • Monitoring and Alerting
  • UiPath
  •  Workflow (Pipeline) Orchestration Systems
How to Lead with Data
  • Enterprise Architecture Components
    • Business Architecture
    • Information Architecture
    • Application Architecture
    • Technology Architecture
  • DataOps Functional Architecture
  • The Snowflake Data Cloud
  • Cloud Design for System Resiliency
  • New Data Architecture:
    • Data Ownership
    • Shared Environment Security Controls
Data Governance (Optional)
  • The Need for Data Governance
  • Controlling the Decision-Making Process
  • Controlling "Agile IT"
  • Types of Requirements
    • Product
    • Process
  • Scoping Requirements
  • Governance Gotchas
  • Governance Best Practices
Conclusion

Training Materials

All DataOps training attendees receive comprehensive courseware.

Software Requirements

  • Computer with Internet connectivity
  • Ability to install software on the computer
  • Recent 64-bit OS, such as Windows 10/11, macOS, or Linux


Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan