Diploma In Data Science Syllbus
1 | What is Data Science? |
2 | Importance and Applications |
3 | Data Science Workflow |
4 | Roles in Data Science (Data Analyst, Data Engineer, Data Scientist) |
5 | Tools and Technologies used |
1 | Python Basics (variables, data types, operators) |
2 | Control Structures (if, for, while) |
3 | Functions & Lambda Expressions |
4 | Lists, Tuples, Sets, Dictionarie |
5 | File Handling |
6 | Introduction to Jupyter Notebooks |
7 | Installing and using packages (pip, conda) |
1 | NumPy: Arrays, Array operations, Broadcasting, Indexing & Slicing |
2 | Pandas: Series and DataFrames |
3 | DataFrame operations (filtering, sorting, grouping, merging) |
4 | Handling missing data |
5 | Reading and writing data (CSV, Excel, JSON) |
1 | Identifying missing, duplicate, or inconsistent data
|
2 | Handling missing data (drop, fill) |
3 | Data type conversion |
4 | Outlier detection and treatment |
5 | Feature scaling (Normalization, Standardization) |
6 | Encoding categorical variables (Label, One-Hot) |
1 | Plotting basics (line, bar, pie charts) |
2 | Histogram, Boxplot, Scatter plot |
3 | Pairplot, Heatmap (Seaborn) |
4 | Customizing plots (labels, titles, legends, colors) |
5 | Plotting trends and distributions |
1 | Descriptive Statistics (mean, median, mode, std, variance) |
2 | Probability Basics (independent, dependent events) |
3 | Probability Distributions (Normal, Binomial, Poisson) |
4 | Central Limit Theorem |
5 | Hypothesis Testing (Z-test, T-test, Chi-square) |
1 | Data Summary and Insights |
2 | Univariate and Bivariate Analysis |
3 | Correlation Matrix |
4 | Outlier Analysis |
5 | Pattern Identification |
6 | Data Profiling |
1 | Definition & Importance |
2 | ML Lifecycle |
3 | Types of Machine Learning |
4 | Applications of ML |
5 | Tools and Libraries (scikit-learn, TensorFlow, Keras - briefly) |
1 | Supervised Learning: Concept and Examples |
2 | Unsupervised Learning: Concept and Examples |
3 | Key Differences
|
4 | Use cases for each |
1 | Linear Regression: Assumptions, Interpretation |
2 | Model Fitting and Evaluation (R², MSE) |
3 | Logistic Regression: Binary Classification |
4 | Sigmoid Function |
5 | Confusion Matrix Basics |
1 | Decision Tree: Gini, Entropy, Information Gain |
2 | Pruning and Overfitting |
3 | Random Forest: Ensemble Concept |
4 | Feature Importance |
5 | Hyperparameter Tuning |
1 | Confusion Matrix |
2 | Accuracy, Precision, Recall, F1-Score |
3 | ROC Curve, AUC |
4 | Cross-Validation |
5 | Bias-Variance Tradeoff |
1 | Creating New Features |
2 | Feature Transformation (log, binning, interaction terms)
|
3 | Feature Selection Techniques (Univariate, Recursive, Tree-based) |
4 | Dimensionality Reduction (PCA basics) |
1 | Dataset selection |
2 | Business Problem Understanding |
3 | Model Building and Evaluation
|
4 | Insights and Reporting |
1 | Problem Definition/h4> |
2 | Data Collection |
3 | EDA & Preprocessing |
4 | Model Training & Testing |
5 | Presentation of Results |
1 | Capstone Project |
2 | Multiple Model Evaluation |
3 | Report Submission |
4 | Viva / Presentation |
5 | Feedback and Improvement |
1 | Introduction to Power BI |
2 | Data Sources and Data Loading |
3 | Data Transformation (Power Query)
|
4 | Data Modeling |
5 | DAX (Data Analysis Expressions) |
6 | Data Visualization |
7 | Reports and Dashboards |
8 | Power BI Service |
9 | Security and Performance |
10 | Power BI Deployment and Use Cases |
1 | Introduction to R & Applications and Features |
2 | Installing R & RStudio |
3 | Basic Syntax & Operators
|
4 | Data Types , Type Conversion |
5 | Data Structures |
6 | Control Statements & Loops |
7 | Functions in R and Data Handling |
8 | Data Analysis |
1 | Excel Basics Revision |
2 | Advanced Formulas & Functions |
3 | Data Analysis Tools
|
4 | Pivot Tables & Pivot Charts |
5 | Data Management |
6 | Macros & VBA Basics |
7 | Advanced Excel Applications |