Here in this post, we will see how we can read most common file formats used in Data Science (using Python) with code. When you are working as a Data Scientist or Data Analyst, your most of the time is utilized in reading the dataset from different sources (including data saved in different file formats and […]
Handling missing values using Python in Data Science
When you start your journey towards data science or data analysis, one thing is for sure that the major task in both these positions is of handling missing values using Python or R whatever platform or language you choose. It’s said that almost 75 – 80% of the time, a data scientist or data analyst […]
How to Split Data for Machine Learning with scikit-learn
In this post, we will see how to split data for Machine Learning with scikit-learn/sklearn as its always a best practice to split your data into train and test set. As in our previous post, we defined Machine Learning as an art and science of giving machines especially computers an ability to learn to make […]
Building simple Linear Regression model using Python’s Sci-kit library
Here in this post, we will build a simple linear regression model using Python‘s Sci-kit learn/Sklearn library. When it comes to defining Machine Learning, we can say its an art and science of giving machines especially computers an ability to learn to make a decision from data and all that without being explicitly programmed. The […]