Posts

Showing posts with the label Programming

Concept of Decision Tree Classification | Machine Learning | Data Science | Mathematics

Image
Concept of Decision Tree Algorithm | Machine Learning | Data Science | Mathematics Decision Tree Algorithm for Classification Decision Tree Algorithm is one of the most popular algorithms and widely used in machine learning. It is a type of supervised learning-based algorithm, can be used for both classification and regression. Photo by Fabrice Villard on Unsplash Let's see first how it works? A simple decision tree example So, now we are enough aware of the decision tree, so let's get deeper. Impurity It is a measurement that how much our data is impure, means how much homogeneity is present in your data. Image Source: Research Gate For measuring impurity we have several measures from which we will learn these two:  1. Entropy: Entropy is nothing but the randomness in your dataset. Which increase predictability. It is directly proportional to the non-homogeneity in your dataset. It measures the purity of the split. Use:  We analyse the entropy on every node in the decision tr...

Plotly & Cufflinks | A Data Visualisation Library with Modern Features | Python | Data Science | Data Visualisation

Image
Plotly & Cufflinks | A Data Visualisation Library with Modern Features | Python | Data Science | Data Visualisation Plotly Plotly gives you lots of interactive and dynamic data visualisation & UI tools for data science, Machine Learning & Engineering. It has some great features which we will discuss in this post. So, be ready for the tutorial. Installation Guide on your Machine: For Plotly put this on your terminal and run. pip install plotly or conda install -c plotly plotly Cufflinks For using Plotly library we have to configure it with the pandas, so the cufflinks library giver us a privilege to do it. Because the Plotly is built on the top of d3.js . Installation Guide on your Machine: For cufflinks put this on your terminal and run. pip install cufflinks or conda install -c conda-forge cufflinks-py Let's begin... Import all the Libraries needed. import pandas as pd import numpy as np import cufflinks as cf from plotly.offline import download_plotlyjs,init_notebook_...

Concept of Support Vector Regression(SVR) | SVM | Mathematics | Machine Learning

Image
Concept of Support Vector Regression(SVR) | SVM | Mathematics | Machine Learning SVM stands for Support vectors machine, it is a famous classification and regression algorithm. We will today talk about Support vector regression more. So, let's begin... Look at this graph... Suppose you have to classify the elements, how you will you do? You will do it something like this... Now, let's suppose you have this graph... So, for the separation for this, we don't have any simple line separation method, so we will add one more axis to it i.e. z-axis. From upside, it will look like this... This task which we have done in previous graphs, this is actually SVM does. Support Vector Machine is a supervised machine learning algorithm, which is used for classification and regression challenges. Terminologies: There are a few points to learn before going to further lesson... Hyperplane: This is a line we draw ago for classification of data classes in SVM. And in support vector regression ...

Simple and Multiple Linear regression in just 10 lines of code | Python | SciKit Lib | Machine Learning | Sklearn

Image
Simple and Multiple Linear regression in just 10 lines of code | Python | SciKit Lib | Machine Learning | Sklearn 10-20 years ago, machine learning, data science or artificial intelligence were not a topic which was needed to be added in our daily talk. But today we have lots of data collected, then it is becoming in the limelight and already taken lots of space in the IT Industry as well. So, when we have lots of data then we can use it to predict the future context which can take place in coming time. Or even we can use these data to let the machine learn that, how a human works. So, the machine can take part in humans daily routine. With the help of machine learning, we can do such things and transform the way of living for humans, where the machine will work for humans. In this post, we will learn machine learning algorithms, which is currently in use for prediction by the top IT companies and top businesses already.  Linear regression is the algorithm which major business uses...

A Quick Guide to Data pre-processing for Machine Learning | Python | IMPUTATION | STANDARDISATION | Data Analysis | Data Science

Image
Data Pre-processing | Imputation | Standardisation | Rescaling | Python Before feeding your data to the machines you have to prepare this data before inserting it into your Machine Learning Algorithm. Quality of your data has to be good, it doesn't contain null values or out of the constraint values. Because the quality of your data is directly related to the quality of training of your model. For more info about Data-preprocessing's need and requirements,  Click Here. Photo by  Mika Baumeister  on  Unsplash 1. Imputation Imputation's simply meant the "change", this process will help you to change the missing value from your table. There are lots of algorithms which can't deal with the null values and might give you errors or badly trained model. Let's have a look at the data below, this data is about the salary of several domains... THE DATASET This data has some null values, which is unbearable by the machine learning ...