Comprehensive Generative AI Engineering for Data Scientists and ML Engineers


Course Number: AI-134WA
Duration: 5 days (32.5 hours)
Format: Live, hands-on

Generative AI Training for Data Scientists Overview

This Comprehensive Generative AI (GenAI) course is for Machine Learning and Data Science professionals who want to dive deep into the world of GenAI and Large Language Models (LLMs). This course covers various topics, from the foundations of LLMs to advanced techniques like fine-tuning, domain adaptation, and evaluation. Participants gain hands-on experience with popular tools and frameworks, including Python, Hugging Face Transformers, and open-source LLMs.

Location and Pricing

Accelebrate offers instructor-led enterprise training for groups of 3 or more online or at your site. Most Accelebrate classes can be flexibly scheduled for your group, including delivery in half-day segments across a week or set of weeks. To receive a customized proposal and price quote for private corporate training on-site or online, please contact us.

In addition, some courses are available as live, instructor-led training from one of our partners.

Objectives

  • Understand the architecture, training techniques, and evaluation methods for Large Language Models (LLMs)
  • Fine-tune and adapt open-source LLMs for domain-specific tasks and applications
  • Develop and optimize prompts for improved LLM performance and output quality
  • Implement advanced techniques such as Retrieval Augmented Generation (RAG) and vector embeddings
  • Evaluate and compare LLM performance using appropriate metrics and benchmarks

Prerequisites

  • Practical experience (6+ months) minimum in Python - functions, loops, control flow
    • Data Science basics - NumPy, pandas, scikit-learn
  • Solid understanding of machine learning concepts and algorithms
    • Regression, Classification, Unsupervised learning (clustering, Neural Networks)
  • Strong foundations in probability, statistics, and linear algebra
  • Practical experience with at least one deep learning framework (e.g., TensorFlow or PyTorch) recommended
  • Familiarity with natural language processing (NLP) concepts and techniques, such as text preprocessing, word embeddings, and language models

Outline

Expand All | Collapse All

LLM Foundations for ML and Data Science
  • Overview of Generative AI and LLMs
  • LLM Architecture and Training Techniques
    • Deep dive into the transformer architecture and its components
    • Exploring pre-training, fine-tuning, and transfer learning techniques
Prompt Engineering for LLMs
  • Introduction to Prompt Engineering
    • Techniques for creating effective prompts
    • Best practices for prompt design and optimization
  • Developing prompts for various NLP tasks
    • Text classification, sentiment analysis, named entity recognition
LLM Evaluation and Comparison
  • Overview of metrics and benchmarks for evaluating LLM performance
  • Techniques for comparing LLMs and selecting the best model for a given task
  • Evaluating and comparing LLMs for a specific NLP task
Fine-Tuning and Domain Adaptation
  • Introduction to Open-Source LLMs
    • Advantages and limitations in ML and data science projects
  • Preparing domain-specific datasets for fine-tuning LLMs
  • Techniques for adapting LLMs to new domains and tasks using transfer learning
  • Fine-tuning and adapting an open-source LLM for a specific domain
Advanced Fine-Tuning and RAG Techniques
  • Advanced fine-tuning techniques for LLMs
  • Implementing Retrieval Augmented Generation (RAG)
    • Improving LLM output quality and relevance
  • Building a RAG-powered LLM application for a specific use case
Vector Embeddings and Semantic Search
  • Introduction to vector embeddings and their applications in NLP
  • Using vector embeddings for semantic search and recommendation systems
    • Generating vector embeddings from text data
    • Implementing a similarity search using libraries like Faiss or Annoy
LLM Optimization and Efficiency
  • Techniques for optimizing LLM performance
    • Quantization and pruning
  • Applying optimization techniques to reduce LLM model size and inference time
  • Strategies for efficient deployment and serving of LLMs in production
Ethical Considerations and Best Practices
  • Addressing biases and fairness issues in LLMs
  • Ensuring transparency and accountability in LLM-powered applications
  • Best practices for responsible AI development and deployment
  • Navigating privacy and security concerns when working with LLMs and sensitive data
Conclusion

Training Materials

All Generative AI training students receive comprehensive courseware.

Software Requirements

All attendees must have a modern web browser and an Internet connection.



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan