Topic: data-analysis

Hypothesis Testing
August 30, 2021

Hypothesis Testing

Purpose Difference between Inferential Statistics & Hypothesis Testing Hypothesis Testing Critical Region Acceptance Region Types of Test…

Pandas Basics
August 22, 2021

Pandas Basics

Table of Contents Frequency Table using pd.crosstab References What is Pandas Pandas is a library specifically aimed at simplifying the…

Central Limit Theorem
August 11, 2021

Central Limit Theorem

Sampling Terminology Why taking large number of samples is good? Central Limit Theorem Confidence Level Margin of Error Confidence Interval…

Inferential Statistics
July 11, 2021

Inferential Statistics

Purpose of Inferential Statistics From a small dataset, we have to figure out information such that it applies to the whole population. We…

Numpy Basics
July 07, 2021

Numpy Basics

Why Numpy over Lists? Speed of computation is faster Designed for data analysis Vector operations can be performed Concise and easy to read…

EDA - Exploratory Data Analysis
June 28, 2021

EDA - Exploratory Data Analysis

Purpose Companies usually collect a lot of data over time. Once they have enough data, they realize, they can't do much with it. They come…

Data Visualization and EDA
June 20, 2021

Data Visualization and EDA

Purpose Understanding various types of Plots available for analyzing data Prologue Data visualisation is an important skill to possess for…

Standardisation of Data
June 20, 2021

Standardisation of Data

Purpose Whatever data you get will most likely not be in a format that can be processed directly. There will some form of data cleaning…

Operations on Pandas DataFrames
June 16, 2021

Operations on Pandas DataFrames

Merging DataFrames Note: can be set to left / right / inner / outer and it is similar to SQL is the column on which merge can occur…

Reading data from external sources into a DataFrame
June 16, 2021

Reading data from external sources into a DataFrame

Reading from text file Reading from Database Scraping data using bs4 Reading data from PDF files (pyPDF2) Reading from text file Reading…

Treating Missing values in DataFrame
June 16, 2021

Treating Missing values in DataFrame

Identify missing values Note: If there were any rows missing all values, we would simply drop them. Treating missing Values There are…

Operations on Numpy Arrays
June 15, 2021

Operations on Numpy Arrays

Reshape the array to a desired size Note: We can use '-1' as the last dimension during reshape and numpy will figure out the required…

Machine Learning References
November 02, 2020

Machine Learning References

One Hot Encoding Why One Hot Encoding? Hackernoon Wikipedia

Five Powerful and Easy to Learn Numpy Operations
October 24, 2020

Five Powerful and Easy to Learn Numpy Operations

Introduction To put it simply, if you have ever used C/C++, Numpy is the equivalent of Arrays in Python. Along with all the basic operations…