If some one would ask me to mention 2 most important libraries in Python for data science, I’ll probably name “pandas” and “scikit-learn”. Pandas for the capability to read datasets in DataFrames, exploring and making them ready for modeling / machine learning and Scikit-learn for actually learning from these features created in Pandas.
While there are quite a few cheat sheets to summarize what scikit-learn brings to the table, there isn’t one I have come across for Pandas. Hence, we thought of creating a cheat sheet for common data exploration operations in Python using Pandas. If you think we have missed any thing in the cheat sheet, please feel free to mention it in comments.
The PDF version of the sheet can be downloaded from here (so that you can copy paste codes)
You can keep this cheat sheet handy while performing data exploration. Download the PDF Version here.