Jan 29 2015

How to create Box-Plot chart in Qlikview?

Qlikview, Boxplot

The use of this article is best illustrated by a case study. So let’s dive straight in. Business Situation: Recently, we entered 2015 and before starting new projects and initiatives, every company would want to review their performance in last year. Let’s think about a sales oriented company ABC. They have branches across India and these …

Continue reading »

Jan 27 2015

Introduction to Online Machine Learning : Simplified


Data is being generated in huge quantities everywhere. Twitter generates 12 + TB of data every day, Facebook generates 25 + TB of data everyday and Google generates much more than these quantities everyday. Given that such data is being produced everyday, we need to build tools to handle data with high 1. Volume : …

Continue reading »

Jan 26 2015

Learning path for SAS – from beginner to a Business Analyst


This is now becoming a theme! But some thing we are very excited about and our audience is loving. Those who are late in the journey, Learning paths are your guide to master a new tool or technique with help of very structured path and (mostly) freely available resources. We launched the learning paths for …

Continue reading »

Jan 22 2015

Decision Tree Algorithms – Simplified

Information Gain, Decision Tree

In last article, we looked at the basics of Decision tree and how it helps in classifications. We also looked at advantages and disadvantages of using decision trees. One of the advantage of using Decision tree is that it efficiently identifies the most significant variable and splits the population on it. In previous article, we developed …

Continue reading »

Jan 20 2015

Model Performance metrics: How well does my model perform? – Part 2


The popularity of the last article forces us to publish this article this soon. In the last article, we discussed a few performance metrics used for classification problems. We saw confusion matrix is most commonly used with class output models, however can also be used with probability output models using a threshold probability. We also …

Continue reading »

Jan 19 2015

QlikView learning path – the only resource you need to master QlikView

We launched our learning paths last week with Data Science in Python. The Python learning path received awesome response from not only our audience, but data science community world wide. The learning path received more views in a day than what some of our best written articles get in a month! Today, we launch the …

Continue reading »

Jan 15 2015

Decision Tree – Simplified!

Decision Tree, Machine Learning, Python, Orange, Kaggle

I started working as a business analyst in my previous organisation. I transitioned from a Business Intelligence (BI) Analyst to become a Business Analyst. During the initial days of tenure as a business analyst, I had a bias towards using a classification technique – DECISION TREE. This was because of its inherent simplicity and many advantages. We …

Continue reading »

Jan 13 2015

Launch of learning path – Data Science in Python

We are jumping on our feets right now! We can’t find any other way to express our excitement. We said that 2015 is going to be a year when Analytics Vidhya will become the place to learn analytics and data science. We launched our discussion forums 2 weeks back and there are awesome discussions already …

Continue reading »

Jan 11 2015

Model performance metrics: How well does my model perform? – Part 1


In case you are preparing for an analytics interview, you have hit a jackpot. This blog will give you answers to at least 2 – 3 questions, which are likely to be asked in the interview. In case you already know some of the metrics discussed in this article, it will still be worth reading to brush …

Continue reading »

Jan 09 2015

Comprehensive Introduction to merging in SAS

SAS, Merge

In my previous article, “Combining data sets in SAS – Simplified“, we discussed three methods to combine data sets – appending, concatenating and Interleaving. In this article, we will look at the most common and frequently used method of combining data sets – MERGING or JOINING. The need for joining / merging datasets: Before jumping …

Continue reading »

Older posts «