Python vs. R vs. SAS – which tool should I learn for Data Science?
Overview
- The long-running debate of R vs SAS has now been joined by Python
- Each of R, SAS and Python have their pros and cons and can be compared over criteria like cost, job scenario and support for the different machine learning algorithms
- You can also choose any of the three tools depending on which stage of your Data Science career you are in
Note: This article was originally published on Mar 27th, 2014 and updated on Sept 12th, 2017
Introduction
We love comparisons!
From Samsung vs. Apple vs. HTC in smartphones; iOS vs. Android vs. Windows in mobile OS to comparing candidates for upcoming elections or selecting captain for the world cup team, comparisons and discussions enrich us in our life. If you love discussions, all you need to do is pop up a relevant question in middle of a passionate community and then watch it explode! The beauty of the process is that everyone in the room walks away as a more knowledgeable person.
I am sparking something similar here. SAS vs. R has probably been the biggest debate the data science industry might have witnessed. Python is one of the fastest growing languages now and has come a long way since it’s inception. The reason for me to start this discussion is not to watch it explode (that would be fun as well though). I know that we all will benefit from the discussion.
This has also been one of the most commonly asked questions to me on this blog. So, I thought I’ll discuss it with all my readers and visitors!
Hasn’t a lot already been said on this topic?
Probably yes! But I still feel the need for discussion for following reasons:
- The data science industry is very dynamic. Any comparison which was done 2 years ago might not be relevant any more.
- Traditionally Python has been left out of the comparison. I think it is more than just a worthy consideration now.
- While I’ll discuss global trends about the languages, I’ll add specific information with regards to Indian analytics industry (which is at a different level of evolution)
So, without any further delay, let the combat begin!
Background
Here is a brief description about the 3 ecosystems:
- SAS: SAS has been the undisputed market leader in commercial analytics space. The software offers huge array of statistical functions, has good GUI (Enterprise Guide & Miner) for people to learn quickly and provides awesome technical support. However, it ends up being the most expensive option and is not always enriched with latest statistical functions.
- R: R is the Open source counterpart of SAS, which has traditionally been used in academics and research. Because of its open source nature, latest techniques get released quickly. There is a lot of documentation available over the internet and it is a very cost-effective option.
- Python: With origination as an open source scripting language, Python usage has grown over time. Today, it sports libraries (numpy, scipy and matplotlib) and functions for almost any statistical operation / model building you may want to do. Since introduction of pandas, it has become very strong in operations on structured data.
Attributes For Comparison
I’ll compare these languages on following attributes:
- Availability / Cost
- Ease of learning
- Data handling capabilities
- Graphical capabilities
- Advancements in tool
- Job scenario
- Deep Learning Support
- Customer service support and Community
I am comparing these from point of view of an analyst. So, if you are looking for purchasing a tool for your company, you may not get complete answer here. The information below will still be useful. For each attribute I give a score to each of these 3 languages (1 – Low; 5 – High).
The weightage for these parameters will vary depending on what point of career you are in and your ambitions.
1. Availability / Cost
SAS is a commercial software. It is expensive and still beyond reach for most of the professionals (in individual capacity). However, it holds the highest market share in Private Organizations. So, until and unless you are in an Organization which has invested in SAS, it might be difficult to access one. Although, SAS has brought in a University Edition that is free to access but it has some limitations. You can also use Jupyter Notebooks in there!
R & Python, on the other hand are completely free. Here are my scores on this parameter:
SAS – 3
R – 5
Python – 5
2. Ease of Learning
SAS is easy to learn and provides easy option (PROC SQL) for people who already know SQL. Even otherwise, it has a good stable GUI interface in its repository. In terms of resources, there are tutorials available on websites of various university and SAS has a comprehensive documentation. There are certifications from SAS training institutes, but they again come at a cost.
R has the steepest learning curve among the 3 languages listed here. It requires you to learn and understand coding. R is a low level programming language and hence simple procedures can take longer codes.
Python is known for its simplicity in programming world. This remains true for data analysis as well. While there are no widespread GUI interfaces as of now, I am hoping Python notebooks will become more and more mainstream. They provide awesome features for documentation and sharing.
SAS – 4.5
R – 2.5
Python – 3.5
3. Data Handling Capabilities
This used to be an advantage for SAS till some time back. R computes every thing in memory (RAM) and hence the computations were limited by the amount of RAM on 32 bit machines. This is no longer the case. All three languages have good data handling capabilities and options for parallel computations. This I feel is no longer a big differentiation. They’ve all also brought on Hadoop and Spark integrations, with them also supporting Cloudera and Apache Pig.
SAS – 4
R – 4
Python – 4
4. Graphical Capabilities
SAS has decent functional graphical capabilities. However, it is just functional. Any customization on plots are difficult and requires you to understand intricacies of SAS Graph package.
R has highly advanced graphical capabilities along with Python. There are numerous packages which provide you advanced graphical capabilities.
With the introduction of Plotly in both the languages now and with Python having Seaborn, making custom plots has never been easier.
SAS – 3
R – 4.5
Python – 4.5
5. Advancements in Tool
All 3 ecosystems have all the basic and most needed functions available. This feature only matters if you are working on latest technologies and algorithms.
Due to their open nature, R & Python get latest features quickly. SAS, on the other hand updates its capabilities in new version roll-outs. Since R has been used widely in academics in past, development of new techniques is fast.
Having said this, SAS releases updates in controlled environment, hence they are well tested. R & Python on the other hand, have open contribution and there are chances of errors in latest developments.
SAS – 4
R – 4.5
Python – 4.5
6. Job Scenario
Globally, SAS is still the market leader in available corporate jobs. Most of the big organizations still work on SAS. R / Python, on the other hand are better options for start-ups and companies looking for cost efficiency. Also, number of jobs on R / Python have been reported to increase over last few years. Here is a trend widely published on internet, which shows the trend for R and SAS jobs. Python jobs for data analysis will have similar or higher trend as R jobs:
The graph below shows R in Blue and SAS in Orange.

This one on the other hand, now shows R in Blue and Python in Orange.
Overall, the market based on languages can be pictured as such:
SAS – 4
R – 4.5
Python – 4.5
7. Customer Service Support & Community
R and Python have the biggest online communities but no customer service support. So if you have trouble, you are on your own. You will get a lot of help though.
SAS on the other hand has dedicated customer service along with the community. So, if you have problems in installation or any other technical challenges, you can reach out to them.
SAS – 4
R – 3.5
Python – 3.5
8. Deep Learning Support
Deep Learning in SAS is still in it’s beginning phase and there’s a lot to work on it.
On the other hand, Python has had great advancements in the field and has numerous packages like Tensorflow and Keras.
R has recently added support for those packages, along with some basic ones too. The kerasR and keras packages in R act as an interface to the original Python package, Keras.
SAS – 2
Python – 4.5
R – 3
Other Factors:
Following are some more points worthy to note:
- Python is used widely in web development. So if you are in an online business, using Python for web development and analytics can provide synergies
- SAS used to have a big advantage of deploying end to end infrastructure (Visual Analytics, Data warehouse, Data quality, reporting and analytics), which has been mitigated by integration / support of R on platforms like SAP HANA and Tableau. It is still, far away from seamless integration like SAS, but the journey has started.
Conclusion
We see the market slightly bending towards Python in today’s scenario. It will be pre-mature to place bets on what will prevail, given the dynamic nature of industry. Depending on your circumstances (career stage, financials etc.) you can add your own weights and come up with what might be suitable for you. Here are a few specific scenarios:
- If you are a fresher entering in analytics industry (specifically so in India), I would recommend to learn SAS as your first language. It is easy to learn and holds highest job market share.
- If you are some one who has already spent time in industry, you should try and diversify your expertise be learning a new tool.
- For experts and pros in industry, people should know at least 2 of these. That would add a lot of flexibility for future and open up new opportunities.
- If you are in a start-up / freelancing, R / Python is more useful.
Strategically, corporate setups that require more hands-on assistance and training choose SAS as an option.
Researchers and statisticians choose R as an alternative because it helps in heavy calculations. As they say, R was meant to get the job done and not to ease your computer.
Python has been the obvious choice for startups today due to its lightweight nature and growing community. It is the best choice for deep learning as well.
Here is the final scorecard:
These are my views on this comparison. Now, it’s your turn to share your views through the comments below.
191 thoughts on "Python vs. R vs. SAS – which tool should I learn for Data Science?"
Nikhil Goyal says: March 28, 2014 at 12:49 am
I can say by experience that R is a lot more fun than SAS. Exploring all the packages and conversations on Stackoverflow . . .Tuhin Chattopadhyay says: March 28, 2014 at 4:48 am
Good one Kunal! Liked it.Rashmi says: March 28, 2014 at 5:20 am
I liked each and every article of yours..................... thanks a lot......Chandan says: March 28, 2014 at 5:23 am
Sir, If MATLAB would be included in comparison then it would be best because MATLAB is also very popular and used by a umpteen number of people for data analytics..so plz include MATLAB also in comparison then it will give us clear picture...Nitesh Singh says: March 28, 2014 at 6:21 am
Good ,entire blog gives clear insight of these three tools .Nitesh Singh says: March 28, 2014 at 6:23 am
Good !! Entire blog gives clear insight of these 3 tools .Vinay Verma says: March 28, 2014 at 8:52 am
Very Well written and balanced article Kunal. Being bit biased to SAS, I'd recommend students and seasoned Analytic professionals to look at SAS Visual Analytics, and SAS Visual Statistics, the latest offerings from SAS which are changing the way people see Commodity Analytics solutions.King says: March 28, 2014 at 10:45 am
Hello guys, Thanks for starting this topic. In my opinion languages of the future for analytics are as follows: R => No. 1 => King (Currently R is the King but in future Python will give tough fight to R as Python is both General purpose programming language and data analysis tool due to enhanced libraries like Pandas, Scipy, Numpy as opposed to R which is only statistical analysis tool......In data science domain 70% time of Data scientist 's job is data munging that is cleaning the data and only 30% is real statistical analysis, hence Python seems much more robust and 4 years down the line Python will be the King and R will be the Queen) Python => No. 2 => Queen I don't see any future for SAS.....I think in next 4 years, SAS will not be in the market, it will be completely dead. These are not predictions ....these are real fates of these languages in future.Eloy Sasot says: March 28, 2014 at 4:39 pm
Great post Kunal, thank you!Kunal Jain says: March 28, 2014 at 6:08 pm
Dear King, While you are right on high adaption on R and Python bringing good capabilities to the table. It is too early to write-off / discount SAS. Most of the organizations which are in SAS would continue to use it. Further, there are some interesting products SAS has offered lately - Visual Analytics, High Performance Computing. Also, if cost is the biggest reason for SAS to lose market, they can always play around with price points. Thanks, Kunalchinmay says: March 29, 2014 at 12:55 am
I would vote for R too, Kunal. There is hardly anything SAS can do that R can't. Plus the R package ecosystem is what makes R so attractive. SAS is a legacy investment for many enterprises and hence it will be around but with academia adopting R at such rate it won't be surprising to see R take over SAS in industry soon too...Pavan says: March 29, 2014 at 5:15 am
Nice Post ! In present Scenario people are migrating towards Open-Source platforms rather than counting days with proprietary Software products. Well, if we consider job market and usage only in India "SAS" grabs the kings place. When we look globally "R" & "Python" are leading all their way in analytics Industry. People mostly tend towards R and python due to their drastic raise in development and documentation in terms of Research and Education. As of now there might be some market for SAS, but in future it's market share and usage might decline when compared to it's closest rivals.King says: March 29, 2014 at 2:07 pm
Python is general purpose language like C++ , Java which are used for production development and also Python is good for data analysis like R, so major advantage is that companies using different languages for these two functions will use only Python which adds to higher compatibility between two functions of the company. Other thing is Python, R both are interpreted languages and C++, Java are compiled languages so Python is slower than C++, Java, but why Python is getting attracted in scientific computing, data analysis or quantitative analysis or automated trading is that there is a project called "Cython" (integrating C++ and Python ) which is 98% same as Python but just 2% change in syntax like Python is dynamically typed and Cython is statically typed so Cython is as fast as C++ hence in both quantitative analysis as well as production development common language can be used. That is if project "Cython" picks up and more number of statistical libraries are added in python in future then Python (in turn Cython) will not only compete with R but also it will give huge competition to Java, C#. By the way code development speed for C++ : Python is 7 : 1 that is to develop same piece of functionality C++ takes 7 hours if Python takes 1 hour. Due to all these reasons I said in next 4-5 years Python will be the King. Other thing is that most of top foreign universities have already removed C++ and Java from their syllabus and added Python instead. For pure statistical analysis R is the king till now.Kunal Jain says: March 29, 2014 at 2:44 pm
King, Thanks for this info. You have absolutely right point point the functionalities Python / Cython brings to the table. As a matter of fact, all my learning hours this year have been invested in Python. My only point of difference is that SAS can not be written off yet. It continues to have highest job market share and has a foot in door with a lot of Organizations. Let me explain my view from a different angle. If you are a fresher coming out of college, SAS is still the best bet to get you a job. Since the learning time for any of these languages is 2 - 3 months, and the time period for industry to shift from SAS (if it happens) would be in multiples of it. Hence, I would recommend a fresher to learn SAS, get a job and then diversify your portfolio by adding one of these languages - R / Python. Regards, KunalKunal Jain says: March 29, 2014 at 2:53 pm
Chinmay, Completely agreed on the ecosystem and additional functionalities R brings to the table. Whether R takes up SAS in industry, or Python encroaches R's territory, only time will tell. Python brings in the benefit of ecosystem (to a lower degree though, but given the replacement of C++ by Python as first choice of programming, the ecosystem is set to increase.) and takes fraction of time to code compared to R (especially for newbies), it also won't be surprising if Python emerges as the market leader. Regards, KunalKing says: March 29, 2014 at 3:38 pm
Kunal, I think only one reason why SAS is currently in the Indian market is because huge supply of skilled human resource for SAS as opposed to R and Python. So I think opposite way if you are proficient with R and Python then you will have less competition in the market and more chances of getting placed. Also companies like Oracle and other companies are coming up with the products which are sort of wrappers for R language and that will also increase R future prospects in India. Regards, KingKumar Ravi says: March 29, 2014 at 8:20 pm
Hi All, It's a really nice post and very-very healthy discussion... In my opinion also future of SAS is not as easy as it's present. R is growing very fast and because it is being accepted and widely used in Universities from where the new breed of the people will come they will be well equipped with R and/or Python. But since SAS is quite widely used in Financial Industries where correctness, stability, accuracy, adaptability in market i.e. acceptance in market, reliability and high support environment is required which is still quite high with SAS will not allow SAS to go out from Financial Sector (from other sector it may go early), it will be replaced in future but not in just 4-5 years, later on when the market share of SAS will start falling, SAS may come up with different pricing strategy, offers which might look lucrative to financial industries at that time. Hence learn well what is widely used and accepted in current market and start learning the language of future, definitely learning 2-3 languages is always good than to stick to a single language. Keep exploring new advancements in these languages and keep discussing always. Thanks Kunal for such a nice and healthy topic to bring on table for discussion and thanks to all contributors. Please share some more information/insight on this topic. Regards Kumar RaviRoxy says: March 29, 2014 at 10:01 pm
Very helpful article and very clearly structured. Congratulations! I have had experience with all 3 languages/programs, and I think your analysis is quite valid; albeit with an Indian perspective. I would like to add that I find SAS JMP a major addition to SAS - I believe it is free with base SAS, and otherwise it is a very cheap alternative as a stand-alone. It's forte is preliminary data analysis, but with its own programming language (JSL, which is somewhat C-like) it can be used to automate to almost any degree). It has the shortest, smoothest learning curve by a country mile! Also its partial integration with EXCEL is extremely useful with clients with low levels of sophistication. These considerations mean that, working in an educational (50 PhD students, but non DGI), I am using this tool almost exclusively, to the detriment of my facility with the other 3 main languages.krishna says: March 31, 2014 at 5:21 am
really its a great discussion..Raghav says: March 31, 2014 at 8:47 am
Hi Kunal , Very informative thread on SAS & RKunal Jain says: March 31, 2014 at 8:59 am
Thanks Kumar Ravi! Well articulated.Kunal Jain says: March 31, 2014 at 9:05 am
Thanks Roxy! Nice point about JMP. A point worth mentioning is the Excel add-on for SAS. I have used it frequently and found it very useful way to import data in SAS and then use Pivot tables over and above it. Regards, KunalKunal Jain says: March 31, 2014 at 9:37 am
Chandan, Matlab lies somewhere in between on most of the parameters. I had used Matlab long time back and have not followed the software very actively. Hence some of these views may not be entirely accurate / updated. If you find this to be the case, please update me. Here are my views: Availability / Cost - Matlab is expensive, though lower than SAS. Exact quote might vary depending on the requirements. Ease of learning - I found SAS easier to learn and would rate MATLAB to be easier than R Data handling capabilities - At par with other tools. Graphical capabilities - Good, similar to R Advancements in tool - Similar to SAS. Happens only in releases Job scenario - Matlab has lower market share as compared to SAS and R. It would be higher than Python as of now. Customer service support and Community - Good Hope this helps. Regards, KunalAman Agrawal says: March 31, 2014 at 1:49 pm
vry informative..Chandan says: March 31, 2014 at 5:01 pm
Thanx for reply..........Shakthi says: April 01, 2014 at 5:51 pm
Thank you sir... I have read few of your blogs.. plz keep writing and guiding us.. I'm a student from NIIT Chennai, pursuing Business Analytics course.Chetan Ahuja says: April 02, 2014 at 9:46 am
R is the BJP(if they win), Python( like AAP) , for SAS (CONGRESS?)pranay says: April 02, 2014 at 4:32 pm
sir, should i go for SAS TRAINING & CERTIFICATION or BUSINESS analytics training using SAS,EXCEL. WHICH WILL GIVE BETTER JOB prospective for FRESHERS????? PLSS suggest some institutes in bangalore Either online or class room I have completed B tech electronics and have worked in a core company whose experience wont count at all I have also appeared final yr MBA this yr. PLSS reply...Jean-Marc says: April 04, 2014 at 7:29 pm
Hi, Great article. I enjoyed looking and thinking about how you scored each language on each point. You obviously put real thoughts into this topic. Here's a different (and complementary) perspective from industry. I was just at Big Data Paris earlier this week. Some observations on the trade show floor and the presentations: 1) SAS had a booth, but it was empty most of the time. Not a single presentation mentioned SAS as a data analysis platform to invest in. My interpretation of this is similar to what's been said so far: they have a large installed base, but not where new developments and new business investments are going. Having said that, SAS isn't going away. Remember COBOL, a language of the 1970's. It was still a BIG deal in 2000 with the infamous Y2K problem that all large businesses still running mainframes had. In a similar way, SQL is the leader in databases today, but Hadoop is quickly taking over market share due to the power of map-reduce and its open source aspect, and pushing SQL aside. Perhaps it's no surprise that Intel invested $740M in Cloudera this week (Hadoop integration leader) and canceled their own big data program, pushing Cloudera's valuation to $4.1B (they are still a privately owned startup). Hadoop is the new generation, while SQL is the old guard. Perhaps the same can be said with SAS vs. R/Python? 2) There was a huge focus on Hadoop as the DB platform, coupled with R as the main engine for serious data analytics. Many presentations couple that with several other specialized tools for simple visualizations (Tableau, etc.) so that the business can enable non technical users fairly easy and provide simple ways to explore and visualize data straight out of the Hadoop DB ecosystem. In other words, the simpler tasks of SAS are being replaced by visualization software tools like Tableau while the more serious data analysis seems to be going to R which, as stated elsewhere in this post, is experiencing growing momentum with new packages, etc almost every day. 3) Very little mention of Python as the core data analysis platform. No doubt Python is quickly becoming the language of choice for general purpose work and many data analysis tasks (Pandas, matplotlib, etc.), but at least in this conference, it was not seen as what to use for a core enterprise big data analysis platform. Please don't misinterpret this comment however. Python was still mentioned, but more in light of general purpose work and some analytics. Key takeaway: it's important to realize that today's installed base is not the future. Where investments are going is what the future is about. And at least from this conference, it looks like enterprises are investing heavily into the Hadoop / R ecosystem (and Python). Given these are open source platforms and adopted by leading companies like Google and facebook, you can expect the momentum to accelerate. This does not bode well for SAS's future in my opinion, which will become a legacy product (but won't go away for a long time still). I hope this sheds some light with a (different) industry perspective. Thanks again for the great article.Kunal Jain says: April 05, 2014 at 3:22 am
Jean-Marc, Very well put across and a very clear perspective on how things stand. I have a couple of points to add: 1. The evolution is at a different stage in different markets. In India, for example, SAS Annual forum is still one of the most sought after event to attend. 2. You are talking about a forum on big data, which is still a low percentage (but with high growth) of the entire spectrum. I agree with you that future lies in open source ecosystems, but for people looking to enter industry, SAS may still provide the best hope (for at least couple of years, if not more). Regards, KunalJean-Marc says: April 05, 2014 at 6:21 am
Kunal, Very good point and I wholeheartedly agree with you. In fact, and supporting your points, if you are a new graduate, it's always best to learn the tools of the installed base (e.g. SAS). If you are more senior however, you should consider learning the tools and methods where new corporate investments are going. This is really a reflection of the nature of projects a junior vs. more senior individual are likely to get involved in (junior = tactical work vs. senior = strategic work). Of course, there are lots of exceptions to this! Best regards, Jean-MarcKunal Jain says: April 06, 2014 at 7:17 am
That's a very fair point Jean-Marc. Thanks for bringing it up. Regards, KunalDhana says: April 06, 2014 at 2:01 pm
Hi Kunal, Just wondering why you haven't included IBM SPSS in your comparisons? When it comes to cost, SPSS is expensive too, however it lets you to create your own statistical models like R. I would like to vote for SAS at first hand, this is because SAS allows to automate tasks, for example, read batch of excel or text files into one SAS data set. (I must also admit that I never tried the automated tasks in R or Python yet). Thank youKunal Jain says: April 08, 2014 at 3:40 am
Dhana, The main reason for not including SPSS is because I have not got many queries asking for this comparison. Additionally, in the commercial softwares, SAS holds the highest market share. That gives readers a fair idea about the spectrum. If you want to add your views on SPSS, please do so for benefit of the bigger community. Regards, KunalKunal Jain says: April 08, 2014 at 4:06 am
Pranay, The training depends on how you want to pursue your career. If you want to get into technical data analyst role - SAS training / certification is more useful. However, if you want to build a career as business analyst, certification courses are better as they provide more case studies, are typically more spaced out so that you can apply concepts at work. Online / offline have their own pros and cons. Online provides flexibility, where as offline provides more opporunity to interact with instructor and class. See what fits you better. Regards, KunalZJ says: April 09, 2014 at 5:02 am
I really can't agree that R can handle data as well as SAS. How to handle gigabyte size data with R? Using Revolution R, but that comes at a cost, this would negate the cost aspect of R. So SAS is still the way to go if you routinely have to manipulate gigabyte size dataVarun Rajasekaran says: April 11, 2014 at 2:17 pm
Hi Kunal, Very well written, clear and concise article!!! I was searching for such an article since many days, finally found yours!! I am currently working as a Solution Developer in Tata Technologies,Pune with almost 5 years of experience in enhancement and support of legacy IT applications belonging to a major European automotive giant. Recently I have completed a course in Business Analytics where I have learnt Excel VBA, SAS, SPSS and application of analytical concepts in SAS and SPSS. I have some queries : 1) I am interested to work as an Analytics Consultant. So which certification in SAS would be beneficial ?Is BASE SAS certification enough? OR do I need to do 'SAS Certified Statistical Business Analyst Using SAS 9'? 2) Would it be better to enter the industry as a SAS Developer\Programmer and then move into the Analytics profile? 3) The job openings for Analytics in Pune is very less compared to Bengaluru, Noida/Gurgaon and Mumbai.Also many companies are asking for minimum 2-3 yrs of relevant experience. How to get this relevant experience?Also what is your take on the analytics job market in Pune in the near future?Hunter Dou says: April 12, 2014 at 3:11 pm
I would say that R is more and more popular in academies and universities and those people being skilled in R will make R more competitive in market. At the same time, SAS is also investing a lot to compete and its integration with big data with Hadoop ecosystem is accelerating. And as a enterprise analytical platform, SAS still has its advantage especially in Financial sectors which does not take cost as the highest consideration in selecting tools. At the same time, SAS has been more and more cheaper for university students to compete R in future users.Kunal Jain says: April 15, 2014 at 5:34 am
Varun, Here are answers to your questions: 1. SAS Certified Statistical Business Analyst Using SAS 9 is more beneficial. You can also attend Predictive modeling. 2. If you can get a break through directly, that is preferred profile rather than entering as a developer. 3. Pune offers limited job opportunities. Bangalore, Deli NCR and Mumbai combined will have 90%+ market share in jobs. It would help if you are in any of these cities. Consultancies typically hire straight out of college, but because of high demand they are looking at lateral hires. You can look at them (e.g. Fractal, Mu-Sigma etc.) Thanks, KunalKunal Jain says: April 15, 2014 at 5:58 am
ZJ, This used to be a big problem with R. However, a lot of work has happened on this front on R. You will find some of the work if you research. Following is a good article with some handy tips: http://theodi.org/blog/fig-data-11-tips-how-handle-big-data-r-and-1-bad-pun Thanks, Kunalvipin says: April 15, 2014 at 6:19 am
Hi Kunal, I am working as Java Developer with 22 months of experience but want to make a career change . I want to become a data scientist.which certification of SAS i should start with and other things to get a job in this field. - I am doing some courses on Coursera like Machine Learning,Big Data in Education,Statistical InferenceZJ says: April 15, 2014 at 6:24 am
Yeah. Until the day I know for sure before I start a project that it will work then it's still not solved yet. Good article though!ganeshbabu says: April 16, 2014 at 3:28 am
Nice Article Kunal. It has given real insights.Willy Kuo says: April 16, 2014 at 10:11 am
Take a look at following project, you might give python higher scores in most of parameters http://numba.pydata.org/ http://blaze.pydata.org/index.html http://bokeh.pydata.org/ And, it's better to mention pandas and statsmodel in python. It learns something from R (such like Dataframe). And it runs much faster then R.Dire Chidi says: April 17, 2014 at 1:14 pm
Kunal , You article is significant to me ,because i have been searching for such information for a while now. I really want to learn one of this languages,which one should i start with and were?,do you have any online training course on this ?. Please let me know RegardsProf Toby Mammen says: April 18, 2014 at 7:22 am
Kunal, That's a brillint article that has added value to a lot of people. It will enable a lot students graduating to get a perpespective regarding the current industry scenario. This article and the comments that follow makes my task easier in counselling students. Thanks Once again.Kunal Jain says: April 20, 2014 at 3:47 pm
Thanks Prof. Toby for your appreciation. This value add to audience is what keeps us ticking. Regards, KunalKunal Jain says: April 20, 2014 at 4:00 pm
Dire, As mentioned in the article, the right answer about which one to start actually depends on your experience and role. Let me know these details and I might be of more help. Thanks, KunalKunal Jain says: April 20, 2014 at 4:12 pm
Willy, Thanks for these links. Have just glanced through them and they are definitely representative of the work happening on Python. Pandas have obviously been a significant update to Python and has made it a worthy competitor of R. Thanks, KunalKunal Jain says: April 20, 2014 at 4:24 pm
Vipin, Coursera is the right place to start this learning. You can start applying them by participating in competitions on Kaggle. Look at this article for some more handy tips. Thanks, KunalTed Dunning says: April 21, 2014 at 4:48 pm
You count off for R because it doesn't have support. That really isn't correct. Revolution Analytics provides professional support for R much the way that SAS (the company) provides support for SAS (the product family). The only differences are that the company doesn't have the same name as the product and in the case of R, much of the code is written by the community.Kunal Jain says: April 22, 2014 at 12:08 pm
Ted, Thanks for your comment. If I look at the Revolution Analytics for comparison, the cost advantage I have provided to R goes away. We can compare it either ways. Thanks, KunalKing says: April 25, 2014 at 4:27 am
Kunal, Revolution Analytics's packages rhadoop, rmr2 etc are free of cost......but one has to know the installation and working of these packages. And you are right that Revolution R enterprise is licensed and not free. Thanks, KingVarun Rajasekaran says: April 25, 2014 at 7:37 am
Hi Kunal, In continuation of my previous post, can you suggest me the training materials/books/onsite websites for 'SAS Certified Statistical Business Analyst Using SAS 9 ' exam ? I have enquired regarding the official SAS crash course training for the same, but the cost is a whopping Rs.72,000/-..... Also, could you give me some tips on mastering statistics concepts and SAS,Excel and SPSS? I guess your answer will be to practise as much as possible right? Also, could you tell me some links where I can get real-life case studies and datasets? I have found some which are given below: 1. http://www.principlesofeconometrics.com/sas.htm 2. http://support.sas.com/learn/statlibrary/statlib_eg4.2/top_data.htm 3. http://support.sas.com/learn/statlibrary/statlib_eg4.2/anova_1.htm#KB5Kunal Jain says: April 27, 2014 at 7:23 pm
King, Thanks for the information on these packages. Regards, KunalKunal Jain says: May 03, 2014 at 5:19 am
Varun, If money is a constraint, look at Jigsaw Academy. They have good course and about half of the price sompared to SAS. It is also more spaced out. Will share case studies through an upcoming article. Regards, KunalTatiana says: May 08, 2014 at 12:15 pm
Thank you for the article. I was about to choose what to learn SAS or R. Looks like R is a winner. I used MATLAB and found very easy to use. Everything is done for you, just plug and play. Easy C. However, I found that in MATLAB some algorithms are hard to modify, you almost need to rewrite them. But it manages data structures and file with relative ease. It also have quite a good visualisation tolls. Many Thanks for the article.Steeve Brechmann says: May 12, 2014 at 6:14 pm
Have you try the Wolfram Language? It does everything for me. Personally, I use the Wolfram Language and Python for all my data science work.Kunal Jain says: May 13, 2014 at 5:12 am
Steeve, Had tried Wolfram language some years back. Have not tried it recently. It would be great if you can add some details on the attributes of comparison. It will help other audience on the blog. Thanks, KunalDavid says: May 29, 2014 at 1:42 pm
GNU Octave is an open source tool that is MatLab like and generally compatible http://www.gnu.org/software/octave/ I have seen this tool used in some university analytics classes along with R and Python. This reduces the cost factor for comparison.Angus says: May 30, 2014 at 8:38 pm
One big factor that will keep SAS in the market is government regulation. Banks, clinical research organizations and any other industry that is required to report to the government will be required to use SAS.Kunal Jain says: June 01, 2014 at 12:51 pm
Completely agree Angus. Also, in response to the open source platforms, SAS has now released its own lite version, which can be downloaded for free. It also plans to being a few courses on MOOC platforms, which would change the scenario completely. It is all set to become more interesting in coming days. Regards, KunalKishon Dave says: June 11, 2014 at 11:50 pm
Kunal, Thanks for hosting the discussion. You mentioned that R's in-memory data processing is no longer a limitation. Can you please clarify what are the techniques to overcome this limitation that R has?Gabriel Preda says: June 12, 2014 at 4:29 pm
Great presentation. Very informative, well structured. Take a look also to this, if you do have the time: http://www.chrisstucchio.com/blog/2013/hadoop_hatred.htmlKunal Jain says: June 15, 2014 at 8:14 pm
Kishon, What I meant was that with 64 bit architecture, there is no absolute limit to what R can handle (which was not the case with 32 bit machines). Regards, KunalFiroj says: June 17, 2014 at 8:20 pm
Hi Kunal, I have gone through the all comments on this block. It's very informative & well structured. I want to switch the technology. I have 3.5 years in IBM(SPSS) Dimensions(Scripting) field.Currently working on Dimensions-6.0 version in Hyderabad. I want to learn SAS as new technology. Can you let me know, is it good to start with SAS technology with base & advanced sas course? Is their job opportunity in Mumbai or Pune for SAS which allows me to show my current technology experience? Also let me know, which domain I need to select (Clinical\Finance\Insurance) to make career? Regards, Firoj Shaikh.Seleena says: June 18, 2014 at 6:27 am
Hi Kunal, Thanks a lot for this article!! I was confused to pick the right course to go for. The article has really helped me. could you please help me with what kind of knowledge would one require to take up R? (I'm relatively new to analytics)Kunal Jain says: June 23, 2014 at 10:44 pm
Seleena, There are various courses running online - you can check out Jigsaw for a paid / assisted version or Data Science specialization for a free community based learning. None of these course require you to know analytics before hand. Regards, KunalKunal Jain says: June 23, 2014 at 10:48 pm
Firoj, Here are my suggestions: 1. Base and advanced SAS are good place to start 2. There are multiple opportunities in each of the city. You can check out our jobs page for more details. 3. I would say to learn SAS with a broad application and choose the domain later on. Regards, Kunalneha says: July 02, 2014 at 4:18 pm
Hi, The post and discussion here is very informative. Thanks for this. I would like to take guidance on what path to choose in Analytivs. I have 7 years of experience in BI and data warehousing and want to enter into Analytics. Should I start with SAS or R? Moreover, since I am more interested in consulting what courses should I go for? Thanking in advance. NehaMayur P says: July 03, 2014 at 6:43 am
Dear Kunal, Thanks for this Post, its just getting more confusing for me though. I am into SAP MM Support & wanna move into Business Analytics. So want to learn course on Analytics, but now dont know from where to start. 1. Kindly provide me info for Classroom foundation courses on Analytics. 2. Will they provide SAS or R software ? 3. Hows EduPristine for learning Business Analytics ? I am thinking on joining it as its classroom. Kindly reply, thanks in advance.Kunal Jain says: July 09, 2014 at 7:14 pm
Mayur, We are coming up with a comparison in short time, so stay tuned for recommendation on it. Edupristine has mixed reviews, so I would not recommend it. A better way might be to join SAS certification from one of the certified partners or taking up online instructor lead courses. Regards, KunalKunal Jain says: July 09, 2014 at 7:19 pm
Neha, SAS would be my recommendation, given its market share in jobs. Also, it would also help in your BI role. Regards, Kunalvittal says: July 11, 2014 at 9:41 am
Good informative article mr kunal,i am learning my masters in data science and we are taught data aanalysis using spss ,how spss is different from sas and other programming paltform and languages,let me know.kindly reply . thanksFadi Ali says: July 18, 2014 at 11:46 am
Thanks very much for your thoughts, Kunal. We are about to open a financial instruments trading company. We will trade currencies, stocks and others. We originally are manual traders who want to "translate" our efforts into alogs. You know this needs programming , stats and back testing. We will start from Dubai and we are totally ignorant of what tools we shall have. Do you have an advice in that? I mean: 1. What kind of "technical" person we have to put as an architect to manage all this; 2. What kind of programming tools a programmer employed with us must have; 3. What kind of programs such as SAS, R, Python or Matlab we shall have "we can pay for expensive ones but we are only keen to have THE BEST"; 4. We are very keen to kill latency, so also what kind of persons and tools we shall have for that. I am searching and reading and have a lot of info that indeed does not give me the STRONG and VALID start up. If you could help. "I don't know if we can arrange something business wise". Thanks a lot in advance. FadiJim says: July 24, 2014 at 4:29 am
SAS is NOT AT ALL easier to learn compared to R. The very old fashioned macro language make SAS really awkward. You can easily forget everything and code like a novice in SAS if you don't code it constantly. Why? Because there is no consistent logic and intuition in SAS. Coding in SAS, you have to memorize many inconsistent syntax and the rules of macro. Coding in SAS you spend 90% of the time debugging. The major advantage of SAS, in my opinion, is speed. That's the only advantage I can think of about SAS.Kunal Jain says: July 28, 2014 at 6:53 am
Vittal, SPSS falls in the same category as SAS - a proprietary software. Your focus should be on the fundamentals of various techniques, rather than the tool. If you pick up SPSS, you can always learn SAS. Regards, KunalKunal Jain says: July 28, 2014 at 6:56 am
Fadi, It is best to take this discussion one on one, as I need to understand the background in more details before advising any thing. Please drop me a mail on [email protected] and we can take the discussion forward. Regards, KunalPuneet says: August 02, 2014 at 4:26 am
Thanks Kunal for sharing your invaluable knowledge regarding this comparison. I live in Ontario, Canada. Can you please guide me about job market regarding SAS vs. SQL in Canada these days and probably one year from now in business analytics/data analytics? I have fairly good knowledge of SAS and SQL. I shall graduate after one year. I am doing my doctorate in Applied Mathematics from the University of Waterloo. In which directions should I proceed for certifications in SAS/SQL to be data analytics/business analytics? Thanks in advance, Best Regards, PuneetRajesh Reddy says: August 02, 2014 at 4:34 pm
thanks for informationashu says: August 04, 2014 at 6:00 am
i want to know a book on SAS for beginnersRamesh says: August 07, 2014 at 10:08 am
Nice articles. Great information. I have more that 20 years of industry experience (11 years in IT software alone in tech writing and testing) and want to move into Analytics. Given this scenrio, would you think "R" is good choice?Cat Chenal says: August 07, 2014 at 9:01 pm
Thanks for the link!Abhishek says: August 22, 2014 at 6:50 am
Good work !!! concise , crisp and to the point.R.Venkatesh says: September 03, 2014 at 8:32 am
I completed my MBA (Finance) in 1998, for the past 16 years i am managing my own small scale plastic recycling industry. Now I would like to shift to Business Analytics. Is it worth to pursue PGCBA course from NIIT? Will my experience be counted? How about the job opportunities for me? Please suggest. Thanks.Kunal Jain says: September 03, 2014 at 6:41 pm
Venkatesh, The course from NIIT has not received good reviews and would be waste of time. A better and far more effective thing is to look for online courses offered by Coursera (if you think, you can learn on your own) or Jigsaw Academy (if you need trainer assistance). Regards, Kunalkriss says: September 03, 2014 at 10:25 pm
Hi, I am curious what about performance? Would be great to add this parameter. As far as I know R is very slow. Matlab is fast and Python somewhere in the middle (depend on used libraries). I do not know what about performance of SAS/SPSS. Someone know?tk SIngh says: September 05, 2014 at 7:15 am
Hi Kunal' Recently i have completed my SAS course Base and Advance. Can you please suggest whether i can attempt the worldwide certification exam or i should get some experience before attempting. And please suggest some supporting courses which will help me in future with SAS. I can go for R or Ms sql Courses Like.Tarun says: September 06, 2014 at 4:00 pm
Hi sir, Recently i have complete sas and looking for job. i want to get suggestion from you that can i search job or i should some other course like R or phython;. And one thing more i can go for EXAM OR i should have to get experience. please suggest meMel says: September 10, 2014 at 11:43 pm
All are great languages. Having used Python, R, S, Stata, LIMDEP, and virtually everything out there over the last 25 years, when it comes to crunching out large scale production on complex data projects - and going home at 5 PM - SAS is by *far* the better choice. It's expensive, but so much of the dirty work is handled for you. It's not just a language but a complete processing system. There are stats procedures that are no naitive to SAS (very few) but you can program any estimator using the matrix language (IML) and seriously. If cost is a problem, check out World Programming System - WPS. They are a SAS clone language with basic SAS elements covered for a fraction of the cost. Cross platform to boot. Everytime I have to use these other systems I essentially cringe because I know it's going to take longer than I really want to spend. Life is short, the information value of a number is fleeting, SAS.vasavi says: September 12, 2014 at 10:38 am
hi shakthi I would like to know how the business analytics course in NIIT in Chennai and which branch you are taking up the course. I'm planning to take up the course but still in search for a right institute.So pls help me with some info about the traning and the placement oppurtunities that they provide for business analytics. Here is my email id: Kurr[email protected] Thank u VasaviZubair says: September 15, 2014 at 6:00 pm
Which version of SAS are you using?? I am certain its not latest. SAS has best in class Graphic capabilities..! way way ahead than R. I use both I love SAS.Kunal Jain says: September 17, 2014 at 6:16 pm
Zubair, I differ with your opinion here. SAS has pre-defined graphs, which are not flexible to changes. Try creating a heat map based on conditional formating in SAS and you will probably see what I mean. Regards, KunalJohn H. Gonzalez says: September 19, 2014 at 10:46 am
I have learned about R but it seems that most of the companies are interested in using SAS. Also that i had Read some where written by the medical statistician that if they make some mistakes, or we havecome across some BUGS using SAS it will recompensate it but R does not have any feature like that....!!!Sana says: October 07, 2014 at 10:33 am
hi kunal I have a very little knowledge of c programming but i can manage with the basics. I have an interest in learning SAS. Since i have a poor background of c programming will it be hard for me to learn SAS?Kunal Jain says: October 08, 2014 at 3:35 am
Sana, Poor background in C has no bearing on the ability to learn SAS. Functional SAS is easier to learn (compared to C) and also has an option of using Graphical interface for a lot of operations. Hope this helps. Regards, KunalHEMA says: October 15, 2014 at 9:26 am
hi sir, i have 2 and half years old baby... so i didnt worked last 3 years. before 3 yrs i was worked in quantum, MR dimensions on market research. i learned SAS on that time. now i am confused which one i can choose. can anyone help me which tool i can choose?Sujit Rout says: October 24, 2014 at 10:42 am
Hi Kunal, The post and discussion here is very informative. Thanks for this. I would like to take guidance on what path to choose in Analytics. I have 3+ years of experience in MIS and data Analyst filed. Basically I am working in Excel, SQL and MYSQL. Which course I have to do, so It will give better Job Opportunity and How much cost I have to do for each and every course and also please tell me name of professional institute in noida providing SAS. Should I start with SAS or Python or R? Moreover, since I am more interested in consulting what courses should I go for? In SAS which version I have to used? Regards Sujit Sec-12, Noida.Appreciative Vistor says: October 24, 2014 at 6:43 pm
Thank you for your post. It is helpful to have such a well-informed and balanced discussion of such a timely topic in an open forum. I appreciate the time and effort you put in to execute your careful review, as well as the time and effort you have put into moderating the discussion.shekhar says: October 25, 2014 at 12:47 pm
Hi Kunal, Nice article Great information Presently i am a Hadoop Administrator from past 1.5 Years, My Background is EMC SAN DMX/VMAX & Redhat Linux. I want to go into Data Analysis/ Data Sciences 1. How to become a Data Scientist? 2. Please guide me from where should I start (Technologies/Projects). 3. Is it mandatory to have a background into languages like R/Python/SAS/any other ( I feel comfortable with python), which one is most required as per Industry Demands? Look forward to hear from you, Shekhar Nagle +91 97666 97897 [email protected] PuneSomesh says: November 13, 2014 at 7:50 am
Hi Kunal, very good scorecard, it should hold validity till mid 2015 then we should see lot of shift, with decision engineer's coming over data scientist and product as an outcome being more realtime than tool for analysis data i see a healty mix of open source and SAS/SPSS like tools, toards end of 2015 python will take the lead over R and SAS in terms of available people who know it, since web designing used it a lot and it is an engineering language. I have worked and tested about 50+ analytical softwares and worked for top 8 domains of service industry both as consultant as well as decision engineer. My vote clearly is for the software that would handle unstructured data well over time like, text, speech and finally video. or provide the best way of analysing and using it in prescriptive decision grids of PMML in real time. The cost of software should go down and the ROI should go up for Analytics by 4X or more.. Good read.. very interesting!!!Somesh says: November 13, 2014 at 7:51 am
Hi Kunal, very good scorecard, it should hold validity till mid 2015 then we should see lot of shift, with decision engineer's coming over data scientist and product as an outcome being more realtime than tool for analysis data i see a healthy mix of open source and SAS/SPSS like tools, toards end of 2015 python will take the lead over R and SAS in terms of available people who know it, since web designing used it a lot and it is an engineering language. I have worked and tested about 50+ analytical softwares and worked for top 8 domains of service industry both as consultant as well as decision engineer. My vote clearly is for the software that would handle unstructured data well over time like, text, speech and finally video. or provide the best way of analysing and using it in prescriptive decision grids of PMML in real time. The cost of software should go down and the ROI should go up for Analytics by 4X or more.. Good read.. very interesting!!!Fab says: November 13, 2014 at 1:03 pm
I think we are missing an important point here. While R and Python can be great for start ups and small companies that don't have a budget to pay for expensive solutions, I really can not imagine multi billion corporation using freeware for their risky business. Can you imagine a big credit card company running their Risk Management operation using R? I can't.prakarsh mishra says: November 21, 2014 at 7:56 pm
Hi kunal, the article is really good actually my confusio i am working with a Ecommerce company and its been 1 year now. I was from biotechnology background and shifted my field towards IT. My work here is to deal with quality and data base. So should i prefer sas for further growth or any other certifications. With growing market what you suggest in between the IT quality and business analyst. kindly guide thanku & regards Prakarsh mishraMithlesh says: November 29, 2014 at 8:00 pm
Hi All, I have 10yr of IT experience including 6 yrs. in information access and enterprise search. I want to change or can say broaden my job area in Big data search and analytics. Just want to know from you guys, from where i can start to prepare my career in analytics. Thanks in advance. MithleshKunal Jain says: December 01, 2014 at 10:11 am
Mithilesh, You can look at some of the courses from Bigdatauniversity to start with. Or undergo Cloudera certification. Both of them can give you good start into the domain. Regards, KunalAbhijit says: December 10, 2014 at 6:38 am
Hi Kunal, First of all i would like to thank you for the valuable information given above.I am working in a ecommerce industry from the past 2 years(basically into operations) and i would like to learn analytics.I would need your help to sought out few doubts i have. 1) Is it a good career option to learn analytics? 2) From your feedback provided above i believe u insist learning SASAbhijit says: December 10, 2014 at 6:45 am
Hi Kunal, First of all i would like to thank you for the valuable information given above.I am working in a ecommerce industry from the past 2 years(basically into operations) and i would like to learn analytics.I would need your help to sought out few doubts i have. 1) Is it a good career option to learn analytics? 2) From your feedback provided above i believe u insist learning SAS first and then go for R language.So should i go for a certified business analytics professional programme and then go for a certified R programmer course? 3)I have a 0 knowledge in analytics and a basic knowledge in SQL and C.Basically i need a job change and want to grow my career as a BA. What would be helpful.Kindy suggest. Look forward to hear from you, AbhijitWasifur says: December 23, 2014 at 10:17 am
Adding few important points, comparing SAS vs R is not comparing apple to apple. If we compare R with "Complete SAS" then R is nowhere in the league. If we compare R with Base SAS, then R is better, as it gives all the facility for free which base SAS gives. R is hard core programming, but open source (free), which can be used to develop enterprise wide analytics application. It has thousands of built in function to solve complex analytics problem. For non-programming background, little difficult to expertise. R is widely used in academic and SME. SAS is a complete end to end solution from Data management to Data visualization to ETL to BI report to Advance analytics. Even with SAS EIS and SAS EF we can develop enterprise wide large scale application. Also SAS SCL can be also be used as general purpose OO programming language like Java, C++. More important aspect, SAS EG or E-miner has user friendly GUI, which can be used by non-programmer comfortably. In industry SAS is widely used due to below few reasons. 1. Data security is very high, for this reason in BFSI domain SAS is no. 1 choice. 2. SAS has over 250 industry specific “point and click” solution like credit and market risk, Asset/ Performance management, Fraud/ Pricing/ Marketing analytics. 3. SAS DI is a very powerful ETL tool which augments the power of SAS in Data management. 4. SAS BI provides the facility to produce world class dashboard in real time. 5. In corporate time has more value than product cost. 90% of Fortune 100 companies uses SAS. 6. SAS visual analytics analyzes Big data in real time with great visualization as “Tableu”. In future when completion grows, Base SAS can be made free as SAS University edition. SAS is much more than just an analytic tool and it will always remain dominant in future.Roland says: December 24, 2014 at 11:13 am
Hi! Try the R packages data.table or dplyr and compare it with SAS or Pyhton? R is very Low?Roland says: December 24, 2014 at 11:15 am
I agree with you. R is easier to learn compare to SAS.Rekha says: January 03, 2015 at 11:04 am
Great comparison done! Thanks for sharing also! The Data Science function is not fully formal yet in many large F100 / F1000 enterprises. There are few roles starting to appear on both business & IT domains of Data Science. Hence expecting to find a comprehensive enterprise class Data Science stack will be really hard. There's lot of hype and glamour surrounding Big Data, where both commercial and enterprise Hadoop stacks have been deployed and playing a successful role. Even the Big 3 (IBM, Microsoft & Oracle) are working on their flavors of Hadoop either organically or through partnerships. These Hadoop stacks enable conveniences for languages, connectors, APIs, storage management, DR, High Availability, High performance, real-time vs batch processing SLAs etc. But there's little or no emphasize on analytics / statistical modules bundled with them. Few exceptions are Mahout, MapR, but mostly a standard based open source extension. As an Enterprise Architect, I prefer R (R Studio) for EDA with a more collaborative context on Git / SVN primarily for Data Scientists / Statisticians / Data Modellers. Even though I'm yet to do this, we are actively looking into Cloud based Hadoop stacks which has native support to run R large scale on Big Data sets. From an analytics consumption perspective, we are exploring Tableau (can really augment R graphs well) and our current landscape (MicroStrategy & Microsoft BI - Fancy Excel).Ashish Gautam says: January 07, 2015 at 10:53 am
Hi Kunal, I have done B.tech in Electrical and Electronics and have a experience of more than 1 year in Market Research field. Now I want to go in Business Analytics and i'm interested in doing a course in Analytics. So please suggest me a good training institute in Delhi NCR. Regards, Ashish GautamAyush says: January 16, 2015 at 12:16 am
Hi kunal, I am working as data analyst having 2 years experience but with no knowledge of SQL. My educational background is B-tech in electronics and communication. I have opted for R and hadoop language programme. So please suggest how it will help me in boosting my career seeing current industry trends. Also suggest if I should have done SAS instead. Thanks in advance. Please reply.Raghunathan says: January 26, 2015 at 1:25 pm
Dear Kunal, Sorry I am visiting your blog only now - regret how I missed it all these days! Very informative and current. By the way, I am working in IT of a large PSU for over 20 years. I am working with JSP with Oracle backend. I have worked on Data mining techniques. I am getting interested in Analytics, in a bid to analyse some of our business data (no official investment though). I am doing R with Coursera as I heard that SAS is a priced package, although said to be more comprehensive and feature-rich than R. However, I wish to know the following: If SAS is easier to learn and use, then can I have the link to download a free version of SAS (eg., Base SAS)? Pl. provide the details.Raghunathan says: January 26, 2015 at 1:27 pm
Dear Kunal, Also, I'd like to know how far I can use SQL with R to handle data as we are already using SQL with Oracle?deepak gandhi says: February 22, 2015 at 9:43 am
hi at present iam doing msc-statistics final year,and also iam learning sas, so can you tell how be the future and opportunities for meRadhika says: February 24, 2015 at 7:17 pm
Nice one Kunal!ERICK OKELLO says: February 25, 2015 at 6:41 am
I would still go for R any day any time. It has high functional capabilities.Supratik Roy says: February 26, 2015 at 12:19 pm
Hello Kunal! I am a B.Tech in Chemicals Engg and a MBA with marketing.My entire career for 11 yrs till date in in Sales and marketing for specialty chemicals and bulk chemicals used for industries like Paints , Construction Chemicals, Adhesives etc. For the last 2 years I am working on forecasting of sales , conjoint analysis for new product development, penetration rate of existing products for my organisations apart from sales. Mean while I ve bagged cirtification on R Programming language and Business Analytics. Frankly speaking I am getting bored with this Sales job and want to concentrate fully on Market Research using Business Analytics. Can you provide me some light as guidance???aniruddha says: March 06, 2015 at 5:23 am
Hi, I am currently doing mba financial services from a tier 2 B school in Mumbai;posses a BE (IT) background with experience in pl/sql programming in a MNC.I am interested in Credit Risk analytics like modeller or model appreciation etc.Currently I am learning R;but I doubt whether R is used in this segment or not.Would it be better to learn SAS.Kunal Jain says: March 11, 2015 at 12:46 pm
Hi Ayush, As you have mentioned in the question that you have opted for R and hadoop. If you have started learning R and Hadoop, then I would suggest you to proceed. As Foundation course in SAS is the basic course. If you haven't enrolled yet for course in R and Hadoop. I would recommend you to move forward and learn foundation course in SAS rather than R. Regards, KunalKunal Jain says: March 11, 2015 at 1:02 pm
Hi Ashish, If you want to go for certification in Business Analyst, you can undergo the certification from SAS Institute. SAS Institute provides trainings for these certifications in Delhi, Bombay and Banglore. However, if cost is a constraint, you can do freely available online trainings. You can also look at our SAS learning path here. Regards, KunalAmol K says: March 20, 2015 at 11:22 am
Hello Aniruddha, R provide good support for Credit Risk Analytics. Using Logistic Regression or CART in R you can differentiate between “low risk” and “high risk” debtors prior to granting credit and classify them as default and non-default and identify the key determinants of default behavior. On the similar line you can also develop a predictive model in R order to identify the propensity of the accounts to pay or not pay their bounce EMI within duration of one month. Only the problem is that you need to write your own R code/R Program. Regards, Amolgvrp says: March 20, 2015 at 1:49 pm
Dear Kunal, I am B.Com Graduate having around 7 years of experience in IT Recruitments.For the last 3 years I have been at home doing Freelance Recruitments.Now the fact is I am not getting enough leads from my associates and tell u honestly my life is at stake.My currents earnings are almost NIL. Now my plan is get trained in Data Analytics with R and want to switch my career as DATA ANLYST. If I take up this course can I get a job at this point. Pls help...thanks.. gvrpKunal Jain says: March 20, 2015 at 4:40 pm
Hi Abhijit, 1. Yes, It is expected to be one of the high growth sectors in next 5-10 years. A word of caution is that employers are currently looking for people with some experience. 2. At this stage, I would suggest you to focus on a single language – SAS. You can do a certification course of Business Analyst with SAS then apply those learning to a few problems. 3. Apply your learnings to a few datasets and participate in a few Kaggle competitions. Gain hands-on experience and approach to the job openings from our job portal. If you do well in Kaggle competitions, you can get good job opportunities. Regards, KunalKunal Jain says: March 20, 2015 at 5:55 pm
Mithlesh, Would recommend learning Big data Tools (Hadoop/Pig /Hive/MapReduce) to expand into Big data Analytics. You can look at Wiley certified Big Data courses or the course on Edureka. Regards, KunalKunal Jain says: March 31, 2015 at 7:34 am
Gvrp, Which course are you referring to? Regards, KunalGvrp says: March 31, 2015 at 4:10 pm
Dear Kunal, Thanks for your reply. I am referring to Data Analytics with R. Now my plan is get trained in Data Analytics with R. Reds GvrpPriti says: April 22, 2015 at 5:06 pm
Hi Kunal, I was thinking of taking R programming, but just came to know about SAS and not sure at this point which way to go? I live in california and would like to get in medical industry and would stay there for next 10 years. I am 55 and not too ambitious at this age. guide me with your expertise which way to go . R is difficult and not sure about programming. I do have background in computers and worked in companies as QA engineer. thanksPriti says: April 22, 2015 at 5:07 pm
I would like to know also from which site I can do the course and how to proceed.Harsh says: April 26, 2015 at 6:27 pm
Hi Kunal .. Such a nice info you have here and others too. Need your help to understand following few points. 1) I have been working in Supply Chain industry for past 2 years 2) My profile is mostly into B2B and Channel sales 3) As I have a technical background and have interest in Analytics I want to change my domain 4) Very recently I came across SAS Certification, so I'm planning to do it 5) Considering my exp. sales what do you suggest SAS or R , I also want to add that I dnt liberty of leaving present JOB and do certification and serach for job.. I want something which I can pursue while doing my job 6) If SAS then which module should I opt for ? I could see many modules in SAS .. plz advice.naina says: April 27, 2015 at 1:22 am
Hi Kunal, Very interesting. I am in the job market in the SF Bay area and have got myself Python certified. But I am at a loss because I do not know SAS and all the banks I am targeting ask for SAS skills. For them my Python skills are of no use at this pt. Would you have an analysis on jobs in Banks and Financial Institutions in the US on SAS vs R vs Python and its market shareManikandan T V says: April 28, 2015 at 4:59 am
I felt python scikit has better documentation compared to R. Also for handling unstructured data python seems better. Any other views if you have please share.Paramdeep Singh says: April 29, 2015 at 11:11 am
Hi Kunal/Guys, I am going for a masters in public policy course in which i'll be learning Quantitative methods. The university is providing SAS as a part of course. I want your advice that whether I should take an extra course in R also and how you see this combination of SAS and R 2 years down the line. I have working experience in IT field as a programmer. RegardsJulie says: May 16, 2015 at 6:54 pm
Hello Kunal, Such an amazing read! I need some suggestion. I have worked as a Java Developer after my Masters degree for a year. Then unfortunately I have had a gap of 4 years. I am looking to enter into the job market now. But with my little experience, what should be the best path to enter into the job market? I have been looking into Certification courses in Business Analytics, Big Data, also many courses in R Programming, SAS, MicroStrategy, Hadoop. A few friends have suggested MicroStrategy. What is your take on it considering I am from Delhi? Thank you in advance. Julie.Gaurav Tiwari says: May 19, 2015 at 5:48 pm
Kunal sir, I am a b.tech grad (fresher) and want to start a career in analytics. Which certification shall i take and from where...as well as what are the entry level chance here.. Hope for a helpful reply. Thank YouSujit Mendon says: May 26, 2015 at 6:40 am
I agree with Kunal but still SAS is dying. R and python is in use by many start-up companies and lately many large organizations are shifting towards R. So again the future of SAS is questionable according to my opinion.Rajesh Rajamani says: May 27, 2015 at 3:51 am
I have used both R and SAS so far . But found R to be much simpler to adopt. Being an open-source tool it has end-less possibilities . Probably my views might change after I learn Python ..Steve D says: June 03, 2015 at 8:53 pm
R is not a low level programming language, it's high level - I'd even say it's higher than Python or SAS because you don't have to define data types. This is part of the reason it is slow. Also saying SAS is easier to learn because of the SAS SQL Proc seems odd - SQL is easier to learn, not SAS and R has the SQLDF package so you can write SQL with R and pandasql with Python in the same wayraja says: June 17, 2015 at 6:31 am
Hi i heard about python and r, but im not familiar with sas, in general , what is sas ,how does it help to projects, is it a programming lang or tool? my doubt is we use php,java for web applications building. how sas use? and its conecpts ?Sachin Khanapur says: June 23, 2015 at 9:52 am
I am too young in this field to contribute to this discussion but i must admit that every word on this discussion page is worth reading for some one like me who is planning to start a career in analytics. Thank you Kunal and all the participantsNikhil says: July 09, 2015 at 7:40 pm
Hi Kunal, Thanks for this post! I got what I was looking for. I have a total of 5.5 yrs of experience in Mainframe Operation. There is no programming involved in my job profile. I have decided to move on to the Analytic's side. Can you please help me out to choose a better one out of it from the job perspective and ease of learning, considering my past experience. Will the market accept a person with no experience and only certification on SAS? Thanks and regards, NikhilAaron says: August 06, 2015 at 5:38 pm
SAS, Python and R are useless in Derivative Modeling. C++ is King in Modeling Derivatives.Aaron says: August 06, 2015 at 5:53 pm
Matlab is still awesome! I love it!Upendra Chansoria says: September 18, 2015 at 5:57 am
Nice Post, Now these days SAS is more popular than other one.Rajendra says: September 20, 2015 at 10:42 am
Hello everyone and special thanks to Kunal for above discussion, it is really help full for beginners like me more over most of my doubts are clear but still I need some advise I have 9 years of experience in technical area only using one of the legacy systems now I want to switch to another technology for that I had decided R+SAS but where ever I going for the enquiry every institute is giving different picture I want to be at technical side only here is my confusion do I need to learn sas data science along with R+SAS or it is enough if I will learn R+SAS technology,please do advise this will be great help for me.Rasaq says: October 09, 2015 at 2:46 pm
You are completely right. Base SAS is now free as SAS Studio.Ritesh says: October 16, 2015 at 5:44 am
Well Said!! Thank You kunal Sir for the comparision. I have found the same comparison on a lot other websites. SAS is leader. But is there any way to learn SAS on beginners level and somewhat moderate level for free, as the training by SAS institute at pretty steepJC Prenner says: October 19, 2015 at 8:55 pm
You get what you pay for$$$ SAS wins!!! in most areas: Big Data #1 - Pages and doesn't run in memory so it doesn't run out of memory and crash Stats #1 - Verified by FDA its a standard, R contains a lot of junk written by "anyone" and not verified, over 45 regression packages ! which is correct to use? Visual Analytics #1 - R and Python does not exist Parallel CPU - SAS , R is just starting on H2O.... If you want a little school or small business analytics then go for free stuff like R, Python, Octave, the list is long.JC Prenner says: October 19, 2015 at 8:57 pm
Also, SAS is free to learn, download it and see: http://www.sas.com/en_us/learn/analytics-u.htmlHitesh Arora says: November 07, 2015 at 11:26 pm
Hi Kunal Did you notice that this article is a copy of your article? https://www.linkedin.com/pulse/sas-vs-r-python-carlos-r-bueno Cheers HiteshKunal Jain says: November 08, 2015 at 7:52 pm
Thanks Hitesh for highlighting this. I have already informed the Linkedin team about it. Hopefully it would be actioned quickly. Regards, KunalGP says: November 09, 2015 at 2:32 pm
Hi, I'm pursuing a management (analytics specialised) course. I'm interested in financial sector. Which Tool should I go for? This is my first year of the two year course. I want to get a good knowledge of the analytics tool, before going for my second year. How long will this take to excel in any tool. If you can provide the links for free tutorial for R, that'd be really helpful. P.S. I've sent you request on ur linkedin. Pls accept.Afsana says: November 15, 2015 at 4:27 pm
Hi, I am a fresher, MSc Statistics and also have programming background. Any help regarding learning SAS ????Akshay says: November 18, 2015 at 3:08 pm
Hi Kunal, I am working as SAP ABAP developer with 2 years of experience. I want to move to Analytics side. So, is there anything I can do in analytics so that my SAP experience does not get waste ?Akbar says: November 22, 2015 at 5:11 am
Hi All, I have 10+ Years of experience in Oracle Products. (SQL/PLSQL,Forms,Reports) with PMP certification. Now, to increase my salary i am looking out for options. Found Data analyst is good. Now, i am planning to switch (carrying my previous experience) to Data analytics – Really confused on where and how to start ? Should i go with SAS or R ? Also if anyone can suggest which will be more suitable for me ? Learning Path ?saurabh says: December 10, 2015 at 10:20 am
hi kunal sir My self saurabh kumar I completed my B.E in stream of information science in the year 2015 with 50% I am interested in doing course on SAS or R programming.. kindly suggest me which course is better for me as fresher, and in which field more job is avaliable kindly suggest me the good institute for SAS OR R programming, IN MUMBAI OR BANGALORE THANK U [email protected]\saurabh says: December 12, 2015 at 7:20 am
Dear sir My self saurabh kumar I completed my B.E in stream of information science in the year 2015 with 50% I am interested in doing course on SAS or R programming.. kindly suggest me which course is better for me as fresher, and in which field more job is avaliable kindly suggest me the good institute for SAS OR R programming, IN MUMBAI OR BANGALORE THANK U [email protected]Pujitha says: February 07, 2016 at 5:49 am
Dear kunal, I am Ece graduate with no work experience ,I want to enter into data analytics field, I don't have any coding experience ,I know C, What languages should I learn for a beginner like me,and to get job in the marketKunal Jain says: February 07, 2016 at 10:48 am
Start with R. Even though Python would be easier and language for future, currently there aren't many jobs on it in India Regards, Kunalbahram says: February 14, 2016 at 2:55 pm
Thanks for this.Sirisha says: February 22, 2016 at 2:18 pm
Dear Kunal, First of all i would like to thank you for above article and would need some suggestion here. Currently am working as QA engineer with 2 years of total experience. There is no programming involved in my job profile( 2 years back I learnt Java, but dont have work experience on coding ). Now I have decided to move on to the Analytics side for better opportunities. should i start with R or SAS ? Kindly advise based on my current experience.mayk7 says: March 11, 2016 at 2:28 pm
Hi Kunal...recently I came across this "Tibco Spotfire" analytics tool..how it is & what's the future of this tool. ? Regards Mayur P.Kuldeep Singh says: April 20, 2016 at 6:44 am
Hello Every One, Feeling to good to be a part of this discussion. I would like to add "Odoo". Odoo is ERP created using Python and is giving a strong competition to other ERP like SAP, R, MATLAP etc. Regards, Kuldeep SinghAbdul-Malick says: August 22, 2016 at 6:51 pm
Just a high level overview from my experience. I started out using SAS from university to my first job. Then got into R and experienced it's power and started not too long ago with python. Doing my research last year on this topic it was clear that SAS is dying... no longer in universities, too expensive and also not a wise choice for startups. SAS seems to cater mainly for credit risk analysts.... period. R and python however goes way beyond and into Data Science and many other opportunities. Not to sound harsh but in my view SAS shouldn't even be in competition with R and python. My primary language is R at this moment but I'm having more fun with python and I guess it will become my favourite as python feels more powerful iro it's diversity and applications. My advice to all Next Generation analysts is R or Python! Learn them both. After learning R first python become much easier to understand.Venkat says: October 02, 2016 at 6:24 pm
R and Python are opensource frameworks, Eclipse provides the ability to download plugins for both.once development platform for both , hence I don't have to haggle around with IDEs for development , I guess that's the beauty of using R and pythanAmanda Scott says: October 07, 2016 at 7:51 am
Great post. Thanks for sharing.Evander says: November 06, 2016 at 1:59 pm
I agreeAnkit Gupta says: September 12, 2017 at 6:02 pm
Considering the overall score, it seems that python is a better option for data science community. Nicely explainedAshok kumar says: September 12, 2017 at 9:07 pm
Thank you for the above nice article In my opinion, within a couple of years MATLAB will overtake SAS, because it is cheaper than SAS and machine learning, deep learning support are available whereas SAS doesn't have this kind of support.Bhavani Shankar says: September 12, 2017 at 10:08 pm
How about licensing system Most of the "R" CRAN packages are coming with GPL Licence (GNU License) with Copyleft option where as Python packages are coming with Apache, MIT or BSD LicenseShankaran says: September 12, 2017 at 10:20 pm
Have read the 2014 version of this and can now see the massive move in market. As a Analytics beginner with SAS, this article was very helpful.Vrutti Tanna says: September 13, 2017 at 11:17 am
Hi, The article is very useful, i am a fresher and working as data scientist since 3 and half months, i know all 3 tools but i am very very good at python and have good hands on python but i am average in R. do you think i need to learn R or python is ok for me to carry on my career as data scientist ? Need your suggestion Thanking you in advance.Walt Smith says: September 13, 2017 at 12:40 pm
Prophesies of the death of SAS are like Mark Twain's famous quip "the reports of my death are greatly exagerated." Here we are some 3 years after the projections made here of the death and/or decline of SAS and SAS is still the dominant analytics platform. I started learning SAS some 34 years ago (my first language in school was Fortran using punch cards). Over time I have learned and usesd professionally some 33 languages. And I'm currently working on python and R. I work in a forecasting department of some 30+ forecasters (largest forecasting group I've ever worked in). I know of no corporation that is reducing their use of SAS and certainly none that are switching from SAS to some other language. There may be some out there but I don't know of any. When we are hiring, analytic skills (econometrics, math, stat, etc) are dominant and if the person already knows SAS then that is preferred. Other skills can make the person even more valuable. Java, HTML, CSS, python, etc. (We don't even have R installed on our server.) SAS is not dead and will not be dieing anytime soon. Why? Because it provides comprehensive data handling, well tested and dependable basic to advanced analytics and, in the hands of an expert, a rapid development cycle. It doesn't yet make machine learning as easy as python does (which is why I'm learning python) and it has some other minor flaws but what language doesn't? That's my 2 cents worth based on 34+ years experience.AM says: September 13, 2017 at 12:54 pm
Hi Kunal, I am working in an IT company as Business Analyst. I would like to know which course/certification would be good for me to keep my job secure in IT industry. Could you please guide?Bhargav Pandya says: September 13, 2017 at 4:14 pm
As per the recent KDnuggets poll on R vs Python. It has been shown that Python has overtaken R in data analytics. look at here http://www.kdnuggets.com/2017/09/python-vs-r-data-science-machine-learning.html http://www.kdnuggets.com/images/google-trends-python-data-science-r-2012-2017.jpgJohnson says: September 13, 2017 at 7:22 pm
@ Jean-Marc. I like the your commentPaul Kostoff says: September 13, 2017 at 10:49 pm
I actually think R's steep learning curve is overblown. The fact that there is one widely accepted IDE (RStudio) is immensely helpful. Installing packages through RStudio is a breeze. Python may lead in deep learning, but Keras and TF are now available in R. All other ML tasks are easily performed in R as well. Anyway, end of story, you should work on becoming proficient in R and Python! They are both amazing languages for data science related tasks, and only continue to improve and grow.heisenberg says: September 14, 2017 at 11:00 pm
For larger organizations, still SAS is best option. Paying license fees gives them some security. SAS have to test every release rigorously, if you get any bugs you will get support from SAS team. Anything goes wrong in production you can blame them. But for R and Python it is not the case, some bugs could make your life hell and you can't blame anyone. Banks and Insurance companies can not afford things like this, so they still go with SAS. I have not used SAS much, but this are my personal views.Sanjay Sane says: September 14, 2017 at 11:35 pm
Absolutely True! R is simpler to learn than SAS in my view.Aparajit says: September 15, 2017 at 1:20 pm
Hello Kunal, Thank you for the article. It is very nice, clears a lot of doubts. I am a 3rd year student pursuing btech. In today's date if you compare these languages what will be your opinion? And please share your opinion regarding the 2019 industry trends, what do you think which language will provide good jobs in India and outside also?Yogalakshmi S says: September 18, 2017 at 12:05 pm
Well explained! just thought to share one thing with this community. We can access free SAS online compiler on web without even need to install it in your machine. I am practicing SAS with that only. As I have started just now to get my hands dirt with it, I am not able to say its pros and cons right nowSanjay Kumar says: September 20, 2017 at 11:25 am
I think ease of learning should not be a criteria for comparison. If a particular tool is useful but difficult to learn, a professional should go ahead and learn it rather than choose an easier option.Dalon says: October 03, 2017 at 8:11 am
Thank you Sajan, this is good overview.Mansi says: November 01, 2017 at 12:00 pm
Superb Article!!Yujung says: December 01, 2017 at 7:17 am
Great! This is one of the most cohesive and comprehensive article I have read on this subject. Thanks a lot.Jayaraj Chanku says: March 07, 2018 at 9:19 pm
Hi Kunal, Excellent post. I think R and python are awesome. Thanks for sharing this.Prachi says: April 05, 2018 at 12:00 pm
Great comparison. I completed SAS certification but still struggling for Job as everyone wants to experience in data analysis field. So for job category, I think we need to study all three languages.Aishwarya Singh says: April 05, 2018 at 12:21 pm
Hi Prachi, Thank you for the feedback.Oli Plaistowe says: April 27, 2018 at 12:16 am
Nice post I can’t help think why you didn’t include WPS in this analysis, especially when it takes out the cost equation. WPS has its own sas compiler and infact allows the user to combine languages of sas,r and python in a single program. WPS is the only product on the market that provides all three language execution with visual programming and machine learning. Another consideration for the future is the true cost of open source. Ad-hoc analysis you can play with many packages but as soon as you want to deploy in production there is a lot of hidden costs around governance of the package and code. Keep them coming! Oli