Externship Program - Applied Data Science
Module-1: Introduction to Data Science
- What is data science
- Why is it important
- Use Cases of Data Science
- The Various Data Science Disciplines
- Data Science and Business Buzzwords
- What is the difference between Analysis and Analytics
- ML In Data Science
- Data Science Methodology
Module-2: Python for Data Science
- Python Basics
- Python Packages
- Working with NUMPY
- Working with Pandas
- Introduction to Data Visualization
- Exploratory Data Analysis with Matplotlib and Seaborn
- Basic Plotting with Matplotlib and Seaborn
Module-3: Mathematics for Data Science
- Introduction to descriptive Statistics
- Mean, Median, Mode
- Skewness
- Range & IQR
- Sample vs. Population
- Variance & Standard deviation
- Impact of Scaling & Shifting
- What is a distribution?
- Normal distribution
- Z-Scores
- Central Limit Theorem
- Hypothesis Testing
- Correlation, And Regression
- Linear Algebra
- Calculus
Module-4: Data Wrangling Techniques- Introduction to Data preprocessing
- Importing the Dataset
- Handling Missing data
- Working with categorical Data
- Splitting the data into Train and Test set
- Outlier Analysis
- Feature Scaling
Module-5: Supervised Learning - Regression
- Introduction to Regression
- Linear Regression
- Multi Linear Regression
- Polynomial Regression
- Ridge Regression
- Lasso Regression
Module-6: Supervised Learning - Classification
- Introduction to Classification
- Logistic Regression
- Decision Tree Classification
- Random Forest Classification
- K-nearest Neighbors
- Naïve-Bayes
- Support Vector Machine
- Ensembling Techniques
Module-7: Model Evaluation Metrics
- Regression Evaluation Metrics
- MAE
- MSE
- R Squared
- RMSE
- Confusion Metrics
- Accuracy
- Precision
- Recall F1 Score
- AUC ROC Curves
Module-8: Model Hyper-parameter Optimization
- Oversampling
- Undersampling
- Ensembling Techniques
- SMOTE
- Grid Search
- Randomize Search
Module-9: Unsupervised Learning
- Introduction to Clustering
- K-Means Clustering
- Hierarchical Clustering
- Clustering use cases
Module-10: Build & Deploy ML Application
- Introduction to different modes of deployment
- Working with Flask Framework
- Building application with flask framework
- Integrating Machine Learning model with web application