Pranav Dar — Updated On May 31st, 2020
Beginner Data Science Deep Learning Github Listicle Machine Learning Python


I live GitHub! Not only can you follow the work happening in different domains, but you can also collaborate on multiple open source projects. All tech companies, from Google to Facebook, upload their open source project codes on GitHub so the wider coding / ML community can benefit from it.

But, if you are too busy, or find following GitHub difficult, we bring you a summary of top repositories month on month. You can keep yourself updated with the latest breakthroughs and even replicate the code on your own machine!

This month’s list includes some awesome libraries. From Google Brain’s AstroNet to an artificial neural network visualizer, we have curated a list of unique repositories that will expand your machine learning horizons.

Are you ready? Let’s look at last month’s top 7 then!

You can check out the top 5 repositories that we picked out in January here and February here.


Person Blocker

‘Person Blocker’ is a python library that automatically blocks out entire people in images using a pre-trained neural network. The algorithm uses Mask R-CNN that is pre-trained on the MS COCO dataset. And the cherry on top? No GPU required!

And not just people, the algorithm is able to block out entire objects as well. The algorithm recognizes 80 different types of objects, including vehicles, animals, electronic gadgets, among other things.

You can read more about this library on Analytics Vidhya’s blog here.



                                                                                         Source: Yahoo

Back in December 2017, the Google Brain team revealed it had discovered 2 new planets by applying Astronet – it’s deep neural network model for working on astronomical data. It was a monumental discovery that went to show the far-reaching impacts of machine learning in today’s world.

Now, Google Brain has released the entire code that went into making that technology and they’ve made it available for everyone. The model is based on a convolutional neural network (CNN).

We have you covered on this AVBytes article regarding AstroNet.


ANN Visualizer

ANN Visualizer is a python library that enables us to visualize an Artificial Neural Network using just a single line of code. It is used to work with Keras and makes use of python’s ‘graphviz’ library to create a neat and presentable graph of the neural network you’re building.

Check out Analytics Vidhya’s detailed coverage of this awesome library here.


Fast Pandas

Any python novice will tell you how flexible and powerful the pandas library is. Being a data scientist, you need to be equally flexible and think of different ways to approach a problem. The ‘Fast Pandas’ repository aims to benchmark the different available methods in such situations.

This is a very useful library and one we highly recommend trying out at least once.



TensorFlow.js is an open-source library that you can use to train and build machine learning models in your web browser, using JavaScript and APIs. If you’re familiar with Keras, the high level layers API will seem very familiar to you.

It’s available with GPU acceleration and also automatically supports WebGL. You can import existing pre-trained models and also re-train entire existing ML models within your web browser.

Check out our coverage of this here.



Caffe64 is a simple, small yet incredibly functional neural network library. We all know how onerous it is to install a neural network library. According to the developers, Caffe64 ditches all the hard work and is the “easiest to compile and most lightweight neural network library, period“.

If you’ve used caffe before, this will be a piece of cake for you!


TensorFlow Hub

TensorFlow Hub is a library to foster the publication, discovery, and consumption of reusable parts of machine learning models. In particular, it provides modules, which are pre-trained pieces of TensorFlow models that can be reused on new tasks. By reusing a module on a related task, you can:

  • train a model with a smaller dataset
  • improve generalization
  • significantly speed up training


Have you used any of these libraries before? How was your experience? Let us know in the comments section below!


Participate in the McKinsey Analytics Online Hackathon to win an all-expenses paid trip to an international analytics conference!

About the Author

Pranav Dar
Pranav Dar

Senior Editor at Analytics Vidhya. Data visualization practitioner who loves reading and delving deeper into the data science and machine learning arts. Always looking for new ways to improve processes using ML and AI.

Our Top Authors

Download Analytics Vidhya App for the Latest blog/Article

13 thoughts on "Top 7 Data Science & Machine Learning GitHub Repositories in March 2018"

Frank francisco
Frank francisco says: April 12, 2018 at 11:25 am
Check out etherscan ml for a solid blockchain machine learning repo built on ethereum. Not related but a big fan. Reply
Data Science Training In Pune
Data Science Training In Pune says: April 12, 2018 at 2:54 pm
In this article shows multiple domains...plz moreover information about data science. Reply
Sanil says: April 12, 2018 at 7:30 pm
Analytics Vidhya is doing great job of making this information easily available. Requesting to post more R related stuff too. Thanks. Reply
Jacob says: April 13, 2018 at 1:28 am
How did you select these as the "top 5"? Is this data driven and if so how precisely is it data driven? Reply
Pranav Dar
Pranav Dar says: April 13, 2018 at 10:00 am
Hi Jacob, There are a few factors that go into selecting the top GitHub repositories each month but the primary one is that it should benefit our data science and machine learning community. Then we look at what language was used, what real world cases or uses there are, etc. Reply
Pranav Dar
Pranav Dar says: April 13, 2018 at 10:11 am
Hi Sanil, Most of the breakthroughs and applications in deep learning are happening thanks to python related libraries. However we do cover R as well. Do check out our AVBytes section where we post updates and developments in python, R, SAS, Tableau, etc. Reply
Rahul says: April 13, 2018 at 11:51 am
Is there python library available to analyse high-dimensional hyperspectral data? I know about spectral-python, but it is not that good. Reply
Don Carpenter
Don Carpenter says: April 13, 2018 at 5:50 pm
This is garbage. Sorry but you say it yourself, Person blocker is JUST MaskRCNN with COCO and a filter. It achieves nothing, brings nothing new and is honestly useless as-is. Reply
Pranav Dar
Pranav Dar says: April 16, 2018 at 3:31 pm
Hi Rahul, I am also aware of the 'spectral-python' library only. You can try 't-SNE' for python in case that is of any use in this area. Reply
Pranav Dar
Pranav Dar says: April 16, 2018 at 5:54 pm
Hi Don, Thanks for your feedback. This can be used by someone in the image processing industry, to blur out images, sensor things in a video, etc. Of course it will take fine tuning to make it industry ready but this lays the groundwork for it. When we curate this list, we look at multiple factors like how many stars does the repository have and applications in the field, among other things. The idea is that readers can understand what's trending, and can replicate the code on their own machines. They can improve on the code, understand how it works, and learn from it, regardless of the background they are coming from. Reply
Varsha Kulkarni
Varsha Kulkarni says: April 18, 2018 at 12:32 pm
Nice article Reply
Francois du Toit
Francois du Toit says: July 14, 2018 at 1:57 pm
I find the pic blocker example very useful. Would have not known about this ground work if it was not listed. I will definitely apply this in further research. This gives me a fott in the door to read up what has been done by showing practical example. Reply
Pranav Dar
Pranav Dar says: July 17, 2018 at 9:35 pm
Thanks for reading through, Francois! I found that one to be remarkably useful as well. Let me know if you use it, would love to read about it! Reply

Leave a Reply Your email address will not be published. Required fields are marked *