Kunal Jain — February 25, 2015
Beginner Big data Business Analytics Business Intelligence Resource

Over the last 2 years, this is the most common query I receive from our readers:

Which data science / analytics training should I go for?

The query comes in varied shapes and size, but the inherent question is still the same.

 

I can empathize with people facing these questions – the number of tools, analytical techniques under application and trainings provider, all have increased many-fold in last few years. If the trends and projections are to be believed, this is probably just the start of a growth phase.

Let’s take an example, as a person switching from software industry, do you learn SAS or do you learn R? Or should you learn Big Data tools and techniques? How about machine learning? Data Visualization tools? Even if you zero in on one of these, the next question which arises is where and how to undergo these trainings?

best big data science trainings

I am sure most of the person in this situation feel like the person in the image above. This is where a framework can help you.

 

Framework to choose right analytics training:

I aim to provide a framework to you to decide:

  • Which tool to learn?
  • Which techniques to focus on?
  • How to learn?
  • Where to learn?

You can apply it at various stages of your analytics career to find out what should you be learning next.

 

Overview of the framework:

The answer to first 2 questions in this framework are in form of levels or steps. You start from level 0 and move one step at a time. So if you are a complete fresher start from Level 0 of tools and level 0 of techniques. But, if you are a fresher with statistics background, start with Level 1 of tools (assuming you know Excel) and Level 1 of techniques (move to level 2 if you know predictive modeling)

Once you have finalized the tools and techniques to learn, move on to step 3 and step 4 of the process.

 

Step 1: Which tool to learn?

Level 0: Excel.

If you don’t know excel, you should learn it first. You should be able to play with Pivot tables, do simple data manipulations and apply lookups in Excel.

Level 1: SAS / R / Python

This is going to be your work horse. You can choose any of these languages. For a more detailed comparison, have a look at this article.

Level 2: QlikView / Tableau / D3.js

You should add up your repository with one of the visualization tools.

Level 3: Big Data tools

This in itself can be multiple levels – start with Hadoop stack – HDFS, HBase, Pig, Hive, Spark

Level 4: NoSQL Databases

Again, you can read an overview of NoSQL databases here and start by learning the most popular one – MongoDB.

Exception 1: If you come from MIS / reporting background, you can start from learning visualization tools like QlikView and Tableau (Level 2) and then go to Level 1

Exception 2: If you come from software engineering / web development and know one of the 2 languages – Java or Python, you can start from Big Data tools as well (level 3)

 

Step 2: Which techniques should you be learning?

Now that you know, which tool would you want to learn, let us look at the techniques to learn. Again the structure is similar

Level 0: Basics of statistics – Descriptive and Inferential statistics

Level 1: Basic predictive modeling – ANOVA, Regression, Decision trees, Time Series

Level 2: All other remaining machine learning techniques except Neural nets

Level 3: Neural nets and deep learning

 

Step 3: How should you learn?

How should you learn is dependent on 2 factors:

  • Resources you can spend on learning; and
  • Your self learning motivation.

This image explains the selection:

how_to_learn2

On one extreme, you have option to join open courses – where you spend low (almost zero) resources, but need high self learning motivation. On the other hand, you have courses run by big universities like Stanford / MIT / North Western, where you will need to spend money and will get help and mentor-ship from experts over longer duration. You can choose the style of your learning depending on where you fit in.

Please note that irrespective of which method and blend you choose, you will need to aid these trainings by hands on projects and practice. No resources or trainings can cover that for you. Here are a few examples of these projects.

For people relying completely on self learning, our learning paths can be of great help. There is one for Python, SAS, Weka and Qlikview each and several more under development.

 

Step 4: Where to learn?

Now that you know, what to learn and how to learn, you can shortlist various options available. You should talk to people who have undergone that training / course and gather some reviews. You can also use our training listing page and apply filters to shortlist the trainings available for various tools and techniques. We have more than 300 trainings listed here and are in process of adding more trainings and courses.

 

End Notes:

So, there you go! You should have a way to find out your way through this data science course juggle. Hope you find this framework immensely useful. I have tried to put a framework to the most common query I get from our audience. The idea is to enable you to make the right decision to the extent possible. If you think, you are in a situation which doesn’t get addressed by the framework above, please feel free to ask those questions through comments / discussion portal.

P.S. These are my views. A lot of these recommendations are based on my experience and what I think is the right choice. As you can expect, some of these questions don’t have a right or wrong answer. They are subjective in nature. So, if you have a different opinion about something I have mentioned, please feel free to let me know.

If you like what you just read & want to continue your analytics learning, subscribe to our emailsfollow us on twitter or like our facebook page.

About the Author

Kunal Jain

Kunal is a post graduate from IIT Bombay in Aerospace Engineering. He has spent more than 10 years in field of Data Science. His work experience ranges from mature markets like UK to a developing market like India. During this period he has lead teams of various sizes and has worked on various tools like SAS, SPSS, Qlikview, R, Python and Matlab.

Our Top Authors

Download Analytics Vidhya App for the Latest blog/Article

26 thoughts on "How to choose the right data science / analytics / big data training?"

Suravi Kalita
Suravi Kalita says: February 25, 2015 at 4:31 am
Nicely written article. Reply
Darshana
Darshana says: February 25, 2015 at 6:03 am
Dear Kunal, Thanks a lot for sharing very interesting insights about choosing the right program for analytics and big data. :) Regards, Darshana Reply
Ruthger
Ruthger says: February 25, 2015 at 8:46 am
Hi Kunal, Very nice and clear article! What I actually missed were basic Unix Shell programming skills. It can be extremely useful to know how to use commands like grep, awk and sed etc to perform essential data cleaning and pre-processing of the data before bringing these data as for ex. a .csv file into Excel or R. Could you expand a bit on what you feel are the advantages of QlikView / Tableau / D3.js beyond for example making the graphics in R? All the best! Ruthger Reply
Kunal Jain
Kunal Jain says: February 25, 2015 at 9:30 am
There are 2 advantages where I think a data visualization tool can come very handy: 1. Understanding and exploration of Huge Data - For example, while working on Avazu CTR Kaggle problem, we were working on 7GB data with anonymized columns. It was becoming time consuming to load this data in R and perform exploratory analysis. With QlikView, we could load the entire data in less than 5 minutes and then perform exploratory analysis very quickly. What helps is quick slice and dice and drill throughs available. So you can quickly identify high and low value population and segregate them in your modeling in R 2. The second application is in finally delivering your insights to the customers. Once your analysis is complete, you can use story-telling feature of these visualization tools to present your findings. You can bookmark the graphs and access them quickly on the go. If people want to explore additional information - it is typically far more easier to do so rather than opening RStudio and then writing / running the codes. Hope this helps you answer the question. Regards, Kunal Reply
Amit
Amit says: February 25, 2015 at 12:13 pm
Nice article, People, please do not attend Great Lakes program in Chennai, it's waste of money. poorly conducted and highly rated program, not fit for people with experience, this is not a value for money proposition. It's highly theoretical and no exposure to Big Data and Hadoop without which no one in Industry would be ready to take you in for Sr position. Please refrain from joining the course in Chennai. I part of second batch which was started last year. Capstone project will be flop, industry tie up's are just marketing they have no written approval from companies like HCL, Accenture, IBM, Cognizent etc. Please take your informed decision by evaluating and comparing various programs and institutes, this is my feedback for the institute. Reply
Prasenjit
Prasenjit says: February 25, 2015 at 6:22 pm
Hi Kunal, As always your articles have been very much informative and the posts related to Learning Path on SAS, R, Python, Weka were really helpful for analytics newbies trying to get into/switch/shift from other knowledge driven industries. Just to add few more pointers on the usage of tools like SPSS, Stata and Matlab would be icing on the cake for a data science professional for interpreting ANOVA tables, solving linear regressions and multivariate analysis. There is a coursera level1 course starting(23 March) in Applied Regression Analysis as a part of which they are providing free access to Stata software ...link --> https://www.coursera.org/course/appliedregression Some more pointers on the "Where to learn" would be for a "not-so-reputed" institute Calcutta Business School who have tie ups with SAS institute to nurture students in Analytics and Data Science stream with concepts of Data Mining, Machine Learning, Big Data and exposure to tools like SPSS, R, Matlab,Big Data(apache stack), Tableau and SAS(Base, EGBS, Eminer, Content Categorizer Studio, DI, VA). I have sent you a message couple of days back in Linkedin with the course content attached for your perusal. Many thanks for all of your efforts and you have been doing a fabulous job in making things easier for Analytics professional who needs a little help and push to succeed. Would request you and your team to publish a Learning Path on 1) Big Data concepts with references to Hortonworks/Cloudera platform or base Apache Stack. 2) Fraud, Risk, AML related analytics case study Kind Regards, Prasenjit Reply
Pankaj
Pankaj says: February 25, 2015 at 7:10 pm
Hi Kunal, Nice article! Gives a good direction to anyone looking to jump in this field! Thanks :) Regards, Pankaj Reply
vaibhav kumar
vaibhav kumar says: February 26, 2015 at 7:46 pm
Hi Kunal, first of all thank you for clearing our doubts and providing useful information. I did my graduation last year and currently pursuing certification course in business analytics. I do have interest in this domain and wanted to pursue masters in business analytics(MBA). Is a good step??? and please do suggest me college apart from iim's. as i came across galgotia's univesity in noida ,started offering mba in analytics in 2013, chandigarh university,empi college,BML Munjal university (establishedlast year) and niit university (rajasthan). PLEASE DO SUGGEST ME COLLEGES ,AS I'M VERY MUCH CONFUSED. Reply
anil
anil says: February 27, 2015 at 11:30 am
Hi Kunal Sir, From past few months i read all your threads and now i am a big fan of yours sir and desperately motivated for Data analytics. I am a software professional having 3 years experince in ETL process, data analysis and ETL testing and very good in oracle SQL/PLSQL and oracle SQL analytical functions. I had also enrolled BIGDATA HADOOP developer course from SimpliLearn, but i want become a data analyst or data sceince. I am very confused to found the right way to jump into ocean of Data Analytics,therefore requesting you to please suggest the right way to get into data analytics feild, Thanks, Anil Kumar Reply
anil kumar
anil kumar says: February 27, 2015 at 12:41 pm
Hi Kunal Sir, From past few months i read all your threads and now i am a big fan of yours sir and desperately motivated for Data analytics. I am a software professional having 3 years experince in ETL process, data analysis and ETL testing and very good in oracle SQL/PLSQL and oracle SQL analytical functions. I had also enrolled BIGDATA HADOOP developer course from SimpliLearn, but i want become a data analyst or data sceince. I am very confsed to found the right way to jump into SEA of Data Analytics,therefore requesting you to please suggest the way to get into data analytics feild, Thanks, Anil Kumar Reply
Pankaj Singh
Pankaj Singh says: March 02, 2015 at 5:03 am
Thank you Kunal, This is really great article. it help me a lot. Regards, Pankaj Reply
rajshekar targar
rajshekar targar says: March 04, 2015 at 11:40 am
why learn both qlikview and SAS. both are BI tools.can't we learn anyone of them? Reply
aniruddha
aniruddha says: March 06, 2015 at 5:30 am
Hi, I am currently doing mba financial services from a tier 2 B school in Mumbai;posses a BE (IT) background with experience in pl/sql programming in a MNC.I am interested in Credit Risk analytics like modeller or model appreciation etc.Currently I am learning R;but I doubt whether R is used in this segment or not.Would it be better to learn SAS. Reply
krishna satish
krishna satish says: March 06, 2015 at 10:15 am
Hi, I am working as a Sr.Software Developer in a MNC . i work under Mainframe Technology and i have a total of 4+ years of experience. i am planning to learn new technology/concept/platform to grow up in my career. i am just aware of EXCEL but not to the level(LEVEL 0) you mentioned above). Hence could you please guide me on my framework. your response would be much appreciated Reply
karthik v
karthik v says: March 06, 2015 at 4:50 pm
it's really superb sir .... thank you very much Reply
Kunal Jain
Kunal Jain says: March 31, 2015 at 9:40 am
Krishna, I would suggest learning excel. We run 2 courses on Excel at one of the partner website: 1. http://vtc.internshala.com/signup/course_details.php?course=analytics101 2. http://vtc.internshala.com/signup/course_details.php?course=excel101 You can have a look at both the courses. Regards, Kunal Reply
Kunal Jain
Kunal Jain says: March 31, 2015 at 9:43 am
Aniruddha, If you are sure about making a career in Credit Risk Analytics – you should learn SAS over R. Regards, Kunal Reply
Kunal Jain
Kunal Jain says: March 31, 2015 at 9:58 am
Rajshekar, Qlikview is a data visualization tool (BI). But SAS is pre-dominantly seen as a Business Analytics tool in industry. Regards, Kunal Reply
Suravi Kalita
Suravi Kalita says: March 31, 2015 at 4:28 pm
I am trying to register for the excel course but could not because it is not letting me register without the discount coupon code. Reply
Pankaj Kumar
Pankaj Kumar says: May 27, 2015 at 5:47 am
Hi Kunal, I have been selected for Pgpba program at Great lakes...and also at Praxis Business School, Kolkata...praxis PGPBA is a full time program with internship & placement... Currently I am working in IBM .I have 3 yrs of experience in Market research..Please guide me which of two is a better option...what kind of opportunities I could get after PGPBA from great lakes...Please reply. Reply
Himank
Himank says: July 11, 2015 at 7:05 am
Hi Kunal I have done B.E. in computer science and working on Qlikview for ETL as well as for Data Visualization from past 3 Years. I m keen too learn something else also n researching for any analytical course or something good that will help my career. It will be greatful if you suggest whats best for me. Thanks in advance Himank Reply
Rayaan
Rayaan says: July 13, 2015 at 2:20 pm
Hi Kunal, Thanks for the insights in this article, However I'll ask you a more specific case about myself. I'm in to data warehousing (Informatica) and RDBMS (Oracle & TD) with 7 yrs of experience. with sudden outburst of requirements in BigData industry I'll like to make a shift to these technologies. My interest lies in NOSQL Dbs & in the field of data science. Could you please chalk out the plan for me. Also let me know how much of Java is required to be able to functions as BIG DATA analyst or to effectively perform my activities. Thanks in advance Rayaan Reply
Kamal T
Kamal T says: July 16, 2015 at 4:22 am
I first encountered the learning paths but then I was confused which one to start with. This article is a way to decide that (and much more). This is exactly what a beginner would need! Bookmarked Reply
Vikram
Vikram says: August 16, 2015 at 2:15 pm
Hi Kunal, Nice Article . I am a Mechanical Engineer working as an Operational Engineer in a MNC. I have experience of working with Qlik View. But I don't have experience of working with SQL or R or SAS. Is it possible to learn SQL or SAS without prior programming knowledge of programming. And is experience in SAS and SQL necessary to work in Business Analytics. Please guide Reply
Vijay
Vijay says: December 14, 2015 at 9:31 am
Hi Kunal, I'm an engineering graduate with 13 years of experience in corporate sales. I'm looking at changing my career into analytics one due to my interests for that field and it's prospective opening ... I have fine few coursera courses in R programming and machine learning, etc... Pls suggest me a path to help me land in this industry as currently bi don't have any experience in this field ... Suggestions pls, will doing a certification programs offered by ISB, IIMB, etc will help ... Looking forward to your advice in how I should place myself, steps, etc ... Looking forward to your valuable feedback Reply
Anuradha
Anuradha says: May 12, 2016 at 12:05 pm
Very informative and useful article Reply

Leave a Reply Your email address will not be published. Required fields are marked *