What is deep learning and why is it getting so much attention?

Kunal Jain 24 May, 2020 • 4 min read

A few days back, the content feed reader, which I use, showed 2 out of top 10 articles on deep learning.

This is when I thought I need a better understanding of what is deep learning. I probably noticed the term – deep learning sometime late last year. And it has grown in its presence around me since then. I wanted to make sure this is a global phenomena and not just me getting served specific content based on my searches or past history. So, I pulled up Google trends for deep learning. This is what, it showed:

Clearly, I was catching up a new trend. So, let me summarize my findings and views on the topic, basis what I have read in the last few days.

P.S. You might have figured out by now, that I am not an expert in Deep Learning (phew!). But, I am more motivated than most of the people to learn about them. I hope to provide a meaningful summary to starters and a few thought provoking questions to the experts. If you have any questions / opinion on the topic, please add them in the comments below.

What is Deep Learning?

Deep learning is probably one of the hottest topics in Machine learning today, and it has shown significant improvement over some of its counterparts. It falls under a class of unsupervised learning algorithms and uses multi-layered neural networks to achieve these remarkable outcomes. Here is a simple illustration from Analytic Store’s blog:

A large number of pixels are fed to the network as input, after which the network learns and evolves to recognize higher level features like faces and cats.

Here are a few achievements driving the attention to this area:

Amazing accuracy on quite a few Kaggle competitions – Dogs vs. Cats image recognition (98.9% accuracy), Saving the whales problem (98% accuracy)
Ability to learn and identify cats by using YouTube videos without any supervision

Following are a few events, which suggest the lookout for people with knowledge of deep learning:

Google’s acquisition of Deepmind Technologies
Google hiring Jeff Hinton, one of the thought leaders in this space (you can check out his course on neural networks on Coursera)
Facebook hiring Yann LeCun, a student of Jeff Hinton to lead its AI lab
Baidu hires Andrew Ng, another pioneer in the field (and co-founder of Coursera).

Applications of deep learning:

If you have not figured it out already, deep learning finds its applications in following areas:

Image recognition (e.g. Tagging faces in photos)
Voice recognition (e.g. Voice based search, Siri)
Pattern detection (e.g. Handwriting recognition)

But, neural networks have been there for decades, what is re-kindling this interest now?

Yes, neural networks have existed since ages. Interest in neural networks peaked in the 1980s and 90s and then died off because of the inherent problems with them and black box like approach.

There are a few reasons why this is happening now. The biggest one among them, being the drop in computational costs. Classification of cats through unsupervised learning of YouTube videos was achieved by deploying 16,000 computers in Google lab! The cost of deploying these algorithms is not small, even by today’s standards.

Learn Image recognition and pattern detection here

Resources in deep learning:

Here is a list of some good resources to start reading / following, if you are interested in this area:

www.deeplearning.net along with its tutorials
Jeff Hinton’s Neural network course on Coursera
Google+ community

Yann LeCun overview of Deep Learning with Marc’Aurelio Ranzato
Complete learning path for Deep Learning

Questions in my mind:

Some of the questions, which remain in my mind are:

This looks like a huge black box to deal with (something like a scaled up version of random forests), which might do well in data science competitions, but fails to deliver any business understanding. Will it be useful and impactful for larger community or stay with in labs of data giants working on huge data sets? For example, the model might say classify a particular person as a likely defaulter, but would not provide, why it is doing so.
The methods have shown some flaws, which are difficult to explain. According to a recent study, algorithms were able to classify the images on the left in the picture below, but were not able to classify the images on the right – which may seem very similar to human eyes. (Source: KDNuggets, original case study)

Overfitting / Choosing the right algorithm – Given the nature of these algorithms, you are building multiple middle (hidden) layers in your architecture. These would work well on problems with infinite (or very large) degrees of freedom. However, if you have limited degrees of freedom, we might end up with an overfitted model – probably time to go back to traditional methods.

End notes:

I have to admit, I started my research from a place where these algorithms looked more like a buzz. But given the attention from data giants, success in some of the Kaggle competitions and the reducing costs of computations, I am starting to believe that the hotness of the field is justified.

Whether it is actually justified or not, only time will tell. In the meanwhile, I’ll continue with my research and keep you posted on how are things panning out at my end. And to gain expertise in working in neural network try out our deep learning practice problem – Identify the Digits.

What do you think about Deep learning? Do you think it will change the way people look at machine learning today? Or do you think this might just be another hype? How would people solve for some of the challenges, I have mentioned in the post? Do let me know your thoughts through comments below.

If you like what you just read & want to continue your analytics learning, subscribe to our emails, follow us on twitter or like our facebook page.

Kunal Jain 24 May 2020

Kunal is a post graduate from IIT Bombay in Aerospace Engineering. He has spent more than 10 years in field of Data Science. His work experience ranges from mature markets like UK to a developing market like India. During this period he has lead teams of various sizes and has worked on various tools like SAS, SPSS, Qlikview, R, Python and Matlab.

Beginner Big data Business Analytics Deep Learning Resource

Responses From Readers

pradeep 24 Jun, 2014

Thank you Kunal for Sharing and the links related to deep learning. Very interesting. There are so many new stuffs to read, understand and apply.

Kumar Chinnakali 24 Jun, 2014

Thumbs up AV team ! Am in the learning journey of machine learning with Coursera. I heard the word Deep Learning few times in recent time, but thought it could be deepest version of machine learning. Now am little clear from your blog. To conclude i think deep learning is not hype and it will change the way people look at machine learning today.

srikar 24 Jun, 2014

Great job Kunal. AV is a great source for learning stuff. Keep up the good work. BTW, please send me a note on my gmail. I still plan to meet you in person. Will be in Bombay in August. Srikar

Show 1 reply

Kunal Jain 28 Jun, 2014

Done!

Rahul 25 Jun, 2014

I think there is very much yet to know about it. Also, deep learning is the early term, the actual name is something else which I'm not able to recall right now. As per the trend now a days, its just following up. I think there can be many more such terms which are very vague just yet to discuss.

Ashish 25 Jun, 2014

Sir can you please tell me how analytics is associated with machine learning?

Show 1 reply

Kunal Jain 28 Jun, 2014

Ashish, Machine learning is a branch of artificial intelligence (AI) based on two things – mathematical algorithms and automation. The idea is to automate the building of analytic models that use algorithms to “learn” from data in an iterative fashion. So, in a way machine learning tries to automate what an analyst does. Regards, Kunal

ton 29 Jun, 2014

Ya seeing as Ray Kurzwell just got hired in 2009 by google to do hierarchical hidden models, the same guy that made the ibm watson and Nuance who sold it to ibm. Ha, I mean wow his book on the singularity and how to create a mind was astonishing.

Shashi Sathyanarayana 23 Jul, 2014

Neural networks figure prominently in the deep learning revolution. If you have ever wondered how they work, here is a tutorial and a celebration of the discovery of an important how-to:

Shashi Sathyanarayana 10 Sep, 2014

Here is my comment again, this time with the link working. Neural networks figure prominently in the deep learning revolution. If you have ever wondered how they work, here is a tutorial and a celebration of the discovery of an important how-to:

Kumar Chinnakali 11 Sep, 2014

Team, Once upon time(three months ago) Analytics Vidhya, is helped me to deep understaning in on what is "Deep learning". Tons of Thanks. Am pleased to share with you that while learning more on the deep learning, i found that Deeplearning4j is the first commercial-grade deep-learning library written in Java. It is meant to be used in business environments, rather than as a research tool for extensive data exploration. Deeplearning4j is most helpful in solving distinct problems, like identifying faces, voices, spam or e-commerce fraud. For more details...http://deeplearning4j.org/ Yours in learning, Kumar Chinnakali

krikar 08 Dec, 2015

Is deep learning supervised or un-supervised ?

What is deep learning and why is it getting so much attention?

A few days back, the content feed reader, which I use, showed 2 out of top 10 articles on deep learning.

What is Deep Learning?

Applications of deep learning:

Resources in deep learning:

Questions in my mind:

End notes:

If you like what you just read & want to continue your analytics learning, subscribe to our emails, follow us on twitter or like our facebook page.

Frequently Asked Questions

Responses From Readers

Related Courses

A Comprehensive Learning Path for Deep Learning in 2023

Free

A Comprehensive Learning Path for Deep Learning in 2020

Free

A Comprehensive Learning Path for Deep Learning in 2019

Free

Write for us