Text Analytics and Natural Language Processing (NLP) with R

RPROG-114 (3 Days)

Request Pricing

NLP Training Overview

Accelebrate's Natural Language Processing (NLP) with R training course teaches attendees how to use R programming to explore and analyze text data.  This class comprehensively covers methods for ingesting text data from a variety of sources such as plain text files, pdfs, or the web, and then processing that data using the latest natural language processing and deep learning techniques.

Location and Pricing

Most Accelebrate courses are taught as private, customized training for 3 or more attendees at our clients' sites worldwide. In addition, we offer live, private online classes for teams who may be in multiple locations or wish to save on travel costs. Please visit our client list for organizations for whom we have delivered onsite training. To receive a customized proposal and price quote for private on-site or online training, please contact us.

NLP Training Objectives

Students will be able to

  • Import text data from a variety of source formats
  • Tokenize text data to meaningful units
  • Wrangle text data using specific textual functions
  • Compute aggregating measures on tokenized data
  • Translate between text data formats
  • Complete a sentiment analysis
  • Perform document classification
  • Perform topic modeling
  • Built a simple neural network appropriate for NLP modeling

NLP Training Outline

Expand All | Collapse All | Printer-Friendly

Working with unstructured text data
  • string methods
  • regex
  • reading in text files
  • review of base (R/Python)
Importing
  • parsing data from a text file
  • importing it into a tidy structure
  • parsing data from a pdf
    • From a “pile of pdfs”
  • scraping data from the web
  • Discussion of other methods
    • OCR
    • Handwriting recognition
Managing Text Data 1
  • a tidy text format
  • Overview of text data formats
    • tidy text
    • token list
    • Bag of words
    • document term matrix or document frequency matrix (dfm/dt)
    • corpus
    • docvars
  • associated formats
    • stop words
    • Sentiment lexica
    • word vectors / models
Managing Text Data 2
  • tokenizing text
  • units of tokenization
    • tokens
    • lemma
    • stems
    • n-grams
    • sentences
    • Tweets
  • Tf-idf
  • Log-odds (tidylo)
Sentiment Analysis
  • Sentiment lexica
  • Sentiment analysis with inner_join
  • Analyzing by other units
  • Valence shifting
  • VADER
Document Classification
  • Text similarity - stringiest
    • Cosine
    • Edit distance
  • Machine Learning for document classification
    • Naive Bayes model
Topic Modeling / Document Clustering
  • LDA
  • stm
Text and Deep Learning
  • Deep learning introduction
  • Architecture of neural networks
  • Tensorflow + keras
  • Word vectors
    • word2vec
    • Text2vec
    • GloVe
    • Spacy
  • Combining Deep Learning and NLP
    • CNN
    • RNN
    • LSTM
  • Named Entity Recognition (NER)
  • Part of Speech tagging (POS)
  • Dependency Parsing
Conclusion
Request Pricing
Lecture percentage

40%

Lecture/Demo

Lab percentage

60%

Lab

Course Number:

RPROG-114

Duration:

3 Days

Prerequisites:

Students must have completed Accelebrate's Intro to R Programming training or have the equivalent experience. Students should have a working knowledge of the R language, RStudio, and the dplyr/tidyverse packages.

Training Materials:

All R Programming training students receive a copy of O’Reilly's Text Mining with R and related courseware.

Software Requirements:

  • R 3.0 or later with console
  • IDE or text editor of your choice (RStudio recommended)

Contact Us:

Accelebrate’s training classes are available for private groups of 3 or more people at your site or online anywhere worldwide.

Don't settle for a "one size fits all" public class! Have Accelebrate deliver exactly the training you want, privately at your site or online, for less than the cost of a public class.

For pricing and to learn more, please contact us.

Contact Us Train For Us

Toll-free in US/Canada:
877 849 1850
International:
+1 678 648 3113

Toll-free in US/Canada:
866 566 1228
International:
+1 404 420 2491

925B Peachtree Street, NE
PMB 378
Atlanta, GA 30309-3918
USA

Subscribe to our Newsletter:

Never miss the latest news and information from Accelebrate:

Microsoft Gold Partner

Please see our complete list of
Microsoft Official Courses

Recent Training Locations

Alabama

Huntsville

Montgomery

Birmingham

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

San Francisco

Oakland

San Jose

Orange County

Los Angeles

Sacramento

San Diego

Colorado

Denver

Boulder

Colorado Springs

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Miami

Jacksonville

Orlando

Saint Petersburg

Tampa

Georgia

Atlanta

Augusta

Savannah

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Ceder Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

Baton Rouge

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Hagerstown

Frederick

Massachusetts

Springfield

Boston

Cambridge

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Saint Paul

Minneapolis

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Reno

Las Vegas

New Jersey

Princeton

New Mexico

Albuquerque

New York

Buffalo

Albany

White Plains

New York City

North Carolina

Charlotte

Durham

Raleigh

Ohio

Canton

Akron

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Tulsa

Oklahoma City

Oregon

Portland

Pennsylvania

Pittsburgh

Philadelphia

Rhode Island

Providence

South Carolina

Columbia

Charleston

Spartanburg

Greenville

Tennessee

Memphis

Nashville

Knoxville

Texas

Dallas

El Paso

Houston

San Antonio

Austin

Utah

Salt Lake City

Virginia

Richmond

Alexandria

Arlington

Washington

Tacoma

Seattle

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Edmonton

Calgary

British Columbia

Vancouver

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan

© 2013-2019 Accelebrate, Inc. All Rights Reserved. All trademarks are owned by their respective owners.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.