Mining Data from Text

This course discusses text and document feature vectors that can be passed into machine learning models, topic modeling using Latent Semantic Analysis, Latent Dirichlet Allocation, Non-negative Matrix Factorization, and keyword extraction using RAKE.

by Janani Ravi

Get started Preview course

What you'll learn

A large part of the appeal of deep learning models is their ability to work with unstructured data types such as text, images, and video. However such models are only as good as the feature vectors that they operate on.

In this course, Mining Data from Text, you will gain the ability to build highly optimized and efficient feature vectors from textual and document data. First, you will learn how to represent documents as numeric data using simple numeric identifiers for individual words as well as more elegant methods such as term frequency and inverse document frequency. Next, you will discover how to perform topic modeling using techniques such as latent semantic analysis, latent Dirichlet allocation, and non-negative matrix factorization. Finally, you will explore how to implement keyword extraction using a popular algorithm - RAKE. When you’re finished with this course, you will have the skills and knowledge to move on to build efficient and optimized feature vectors from a large document corpus and use those feature vectors in building powerful machine learning models.

Try this course for free

Access this course and other top-rated tech content with a free trial.

Free individual trial Free team trial

Have questions?

Get them answered now.

Start a live chat

Course Info

Rating

(25 reviews)

Level

Intermediate

Last updated

Jun 28, 2019

Duration

2h 21m 58s

Course Overview | 1m 40s

About the author

Janani Ravi

A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.

More Courses by Janani

Mining Data from Text

What you'll learn

Table of contents

Course Overview 1m 40s

Modeling Text Using Natural Language Processing 39m 48s

Building Classification Models Using Text Data 22m 33s

Understanding Topic Modeling 16m 10s

Implementing Topic Modeling 47m 50s

Understanding and Implementing Keyword Extraction 13m 55s

About the author