Solutions for Skilltest in Statistics Revealed

Kunal Jain Last Updated : 10 Nov, 2016

14 min read

Introduction

Statistics is one of the founding pillars for a career in data science and business analytics. Unless a person understands the basics of statistics well, he will not be able to perform well in data science. We launched Statistics skill test to help our community with a tool to assess their skills in statistics. You can look at the leaderboard of the skill assessment platform here

More than 1800 people registered on the hackathon and 533 people actually assessed themselves in 2 hours.

For all those who could not attend the skill assessment, check out how many questions you can answer correctly. I am sure you will take away learning points form this article and improve your knowledge about statistics.

For those who enjoyed the experience and would want to undergo this again on a more advanced topic, here is your chance to register in Statistics Skill Test – 2 . Also, check out our skill test on R.

Overall Results

Who could have asked for a better way to analyze the results of a statistical skill test on this topic? Here is the distribution of the scores:

Here are a few measures of the distribution:

Mean = 14.99

Median = 16

Mode = 14

Let us look at the variance:

Standard Deviation = 8.13

95% confidence interval – [0, 30.94)

So, congratulations for the top 5 people (31 and above) to set themselves above the rest of the population.

If your score is more than 21, you are in the top 25 percentile – you deserve a pat!

On the other hand, people with score less than 9 probably need to spend more time on these concepts – believe me, it wasn’t tough!

Useful resources to learn Statistics

Skill Test Questions and Answers

The skill test consisted of 40 questions selected very carefully based on the concepts which we think any individual pursuing a career in analytics should have them on their tips.

Read on to find out detailed solution of the all the questions.

1) Which measure of central tendency describes the following right-skewed distribution in the best manner?

a)Mean

b)Median

c)Mode

d)All of these

Ans: b) Median

In skewed distributions, the mean will be in one extreme(towards the skew) and mode on the other. Whereas the median lies in the centre.

2) Which measure of central tendency describes the following nominal/categorical distribution in the best manner?

a)Mean

b)Median

c)Mode

d)All of these

Ans: c) Mode

Mean and median don’t make sense in categorical distributions. So mode describes central tendency at best.

3) Which measure of central tendency describes the following left-skewed distribution in the best manner?

a)Mean

b)Median

c)Mode

d)All of these

Ans: b) Median

In skewed distributions the mean will be in one extreme(towards the skew) and mode in the other. Whereas the median lies in the centre.

4) Which measure of central tendency suits the best for this bi-modal distribution?

a)Mean

b)Median

c)Mode

d)Mean or Median

Ans: b) Median

In Bimodal distributions, if distribution is symmetric then mean or median could be the representative for Central tendency whereas in this case due to skewness which can be clearly seen in the image, the mode lies at the left ‘bump’ and the mean lies close to the left ‘bump’ too(due to the left skew). Whereas the median should lie fairly at the centre.

5) Which measure of central tendency suits the best for a normal distribution?

a)Mean

b)Median

c)Mode

d)All of these

Ans: d) All of these

Mean = Median = Mode for a normal distribution, as evident in the image.

6) Which of the following distribution satisfy the following relationship: Mode > Median > Mean?

a)Positive skewed

b)Negative skewed

c)Normal

d)Bi-modal

Ans: b) Negatively skewed

In skewed distributions the mean will be in one extreme(towards the skew) and mode in the other. Whereas the median lies in the centre. In this case mean lies towards the left(the skew).

End Notes

I hope you had fun participating in the assessment challenge and reading this article. We tried to answer all your queries but if we still haven’t cleared all your doubts , then feel free to post your questions in the comments below. And since it was a new thing which we tried to enrich your experience we would like to know your thoughts / suggestions / feedback on the same. This will help us serve you better and help us understand where should we improve.

Also, make sure you register in Statistics Skill Test – 2 and the upcoming skill test on R tomorrow.

You want to apply your analytical skills and test your potential? Then participate in our Hackathons and compete with Top Data Scientists from all over the world.

Kunal Jain

Kunal Jain is the Founder and CEO of Analytics Vidhya, one of the world's leading communities of Al professionals. With over 17 years of experience in the field, Kunal has been instrumental in shaping the global Al landscape. His expertise spans diverse markets, from developed economies like the UK to emerging ones like India, where he has successfully led and delivered complex data-driven solutions. As a recognized thought leader, Kunal has empowered countless individuals to realize their Al ambitions through his visionary approach to Al education and community building. Before founding Analytics Vidhya, Kunal earned both his undergraduate and postgraduate degrees from IIT Bombay and held key roles at Capital One and Aviva Life Insurance across multiple geographies. His passion lies at the intersection of analytics, Al, and fostering a thriving community of data science professionals.

Free Courses

4.6

Building and Evaluating RAG System

Learn to build RAG system applications, create AI agents, and deploy.

4.8

Build Products 10x Faster with GenAI : Hands On

Master prompt engineering,build AI apps with LangChain & deploy custom GPTs.

4.6

Evaluation Metrics for Machine Learning Models

This course covers evaluation metrics to improve ML model performance.

4.9

Introduction to Data Visualization

Learn the essentials of data visualization with real-world examples

4.6

Big Mart Sales Prediction Using R

Use R to solve Big Mart Sales Prediction with regression techniques.

Arpit Agrawal

Can you please explain question no 22 again as only one graph has been posted .I did not understand how to check bias and variance

Show 1 reply

B.Rabbit

Point estimate is a basically a sample statistic with which we estimate the population parameter. The central value theta is the value of population parameter we are trying to estimate. Now if you look at each option and calculate rough intervals of each point estimate the interval which is the smallest and contains the population parameter is b). This is also synonymous with bias-variance tradeoff.

Rajaram K

Q 19, i think Option A is the answer. Chances of a randomly selected sample's mean to be equal to that of a population mean are very low. Anyone any thought?

I don't understand on what grounds you say "Chances of a randomly selected sample’s mean to be equal to that of a population mean are very low." Please explain. Suppose say the population is 1,2,3,4,5(population mean = 3) and we draw a sample of size 2, in this case say 2,4(sample mean =3). Here population mean = sample mean. Hence A can't be the answer

Raghuvaran

I really enjoyed the test. Thanks for hosting for such a test , also thanks for giving the solutions too which is very useful in understanding the correct answers. I have doubt in the below formula Confidence interval = (sample mean – Margin of error, sample mean + Margin of error) After Magin of error, there is a comma, I don't know how to interpret in the formula. Please help me in understanding the formula clearly.

Suppose sample mean = 50, Margin of error = 10 then the confidence interval is (40, 60). Hope this helps!

Reading list

Solutions for Skilltest in Statistics Revealed

Introduction