SmartInternz

Objectives of the FDP

To understand the need and complexities in handling analysis of large volumes of data.
To provide a basic understanding of the concepts and methods for implementing machine learning.
To provide an in-depth understanding of various machine learning algorithms.
Understand the concepts of Big Data Analytics.
To gain hands on experience in machine learning using PySpark.
To develop predictive models against real world business scenarios using PySpark.

Learning Outcomes

Upon completion of the faculty development programme, a participant will be able to:

Understand the importance of machine learning in data analysis and modeling.
Get exposure to a variety of machine learning algorithms.
Understand the process of automation with machine learning using Spark.
Gain experience in doing independent study and research on real world big data problems using scalable machine learning practices.

Modules to be covered

The six day faculty development programme emphasizes on discussing various concepts from fundamentals of machine learning and big data analytics to development and deployment of predictive data models using Spark. Brief discussions include:

Day	Topics
Day-1	Introduction to Machine Learning Types of Machine Learning Understanding Math for Machine Learning Understanding Big Data Analytics A quick overview of Hadoop environment Getting started with Spark Understanding Spark Programming model Spark Clusters and dataframes Machine Learning algorithms by Spark Spark vs Hadoop Designing a Machine Learning System Business use cases on Customer Segmentation, Personalization Data cleansing Preprocessing and Preparing Data with Spark Exploring and visualizing data Data processing and transformation Feature Extraction
Day-2	Building Classification Models with Spark Introduction to types of Classification Models Logistic Regression Decision Trees Random Forest Classifier Evaluating performance of models Improving model performance and hyper parameter tuning Understanding Accuracy and Prediction Precision and Recall Working with ROC curves and AUC
Day-3	Developing Regression Models with Spark Linear Regression Multiple Linear Regression Feature Engineering Model evaluation Understanding Mean Squared Error and Root Mean Square Error Mean Absolute Error and R-squared coefficient Improving model performance and hyper parameter tuning
Day-4	Developing a Clustering Model with Spark Introduction to types of clustering K-Means clustering Feature Engineering Hierarchical clustering Evaluating model performance Internal and External evaluation metrics Improving model performance
Day-5	Building a Recommendation Engine with Spark Content based filtering Collaborative filtering Training and using a recommendation model Evaluating performance of recommendation models Working with Mean Squared Error Mean average precision at K
Day-6	Natural Language Processing Introduction NLP Framework-Tokenization, Stemming, Count Vectorization ,TF-IDF Sentiment Analysis using NLP

Big Data Analytics, Machine Learning with Hadoop and PySpark

For Faculty : INR 6,000

For Working Professional : INR 10,000

Start Date: 13th April, 2020

Objectives of the FDP

Learning Outcomes

Speaker

Mr.P.Mohan

Modules to be covered

Registrations

PRIVACY & TOS

NAVIGATE

CONTACT US