Biases in data that we should all be aware of to build a reliable and fair machine learning model

Image created by the author

What have I learned working in the data science field?

1. You will spend 80% of your time in data preparation

Important statistical concepts for data scientists and how to use them

Popular activation functions and their use

Determining the best degree of polynomial to choose in a polynomial regression.

A reference checklist for Data Analytics professionals


Choosing the right hyperparameter values using Cross-Validation

  1. It’s prone to overfitting with many input features and,
  2. It cannot easily express non-linear/curvy relationships.

How to augment machine intelligence and what the future will look like for humans in terms of jobs

Image created by author

Understanding how z-scores were invented are how they are used

  1. What is a Z-score — Formula and definition.
  2. How to use Z-score using a toy example.
  • z < 1.81 - Distress “Zone”
  • 1.81 < z< 2.99 - Grey “Zone”
  • z > 2.99 - Safe “Zone”

Feature scaling using python. Understanding why feature scaling is required and the two common types of feature scaling methods

  1. What is feature scaling and why it is required in Machine Learning (ML)?
  2. Normalization — pros and cons.
  3. Standardization — pros and cons.
  4. Normalization or Standardization. Which one is better.
Image created by Author

Swapnil Kangralkar

Data Scientist and Project Management Professional at Government of Canada. Visit for more.

