Topic: data-analysis
Hypothesis Testing
Purpose Difference between Inferential Statistics & Hypothesis Testing Hypothesis Testing Critical Region Acceptance Region Types of Test…
Pandas Basics
Table of Contents Frequency Table using pd.crosstab References What is Pandas Pandas is a library specifically aimed at simplifying the…
Central Limit Theorem
Sampling Terminology Why taking large number of samples is good? Central Limit Theorem Confidence Level Margin of Error Confidence Interval…
Inferential Statistics
Purpose of Inferential Statistics From a small dataset, we have to figure out information such that it applies to the whole population. We…
Numpy Basics
Why Numpy over Lists? Speed of computation is faster Designed for data analysis Vector operations can be performed Concise and easy to read…
EDA - Exploratory Data Analysis
Purpose Companies usually collect a lot of data over time. Once they have enough data, they realize, they can't do much with it. They come…
Data Visualization and EDA
Purpose Understanding various types of Plots available for analyzing data Prologue Data visualisation is an important skill to possess for…
Standardisation of Data
Purpose Whatever data you get will most likely not be in a format that can be processed directly. There will some form of data cleaning…
Operations on Pandas DataFrames
Merging DataFrames Note: can be set to left / right / inner / outer and it is similar to SQL is the column on which merge can occur…
Reading data from external sources into a DataFrame
Reading from text file Reading from Database Scraping data using bs4 Reading data from PDF files (pyPDF2) Reading from text file Reading…
Treating Missing values in DataFrame
Identify missing values Note: If there were any rows missing all values, we would simply drop them. Treating missing Values There are…
Operations on Numpy Arrays
Reshape the array to a desired size Note: We can use '-1' as the last dimension during reshape and numpy will figure out the required…
Machine Learning References
One Hot Encoding Why One Hot Encoding? Hackernoon Wikipedia
Five Powerful and Easy to Learn Numpy Operations
Introduction To put it simply, if you have ever used C/C++, Numpy is the equivalent of Arrays in Python. Along with all the basic operations…

