jissdeodates — September 8, 2021
Beginner Computer Vision Image Libraries Python

This article was published as a part of the Data Science Blogathon


We all know the phrase: “Every picture can tell us a story”. There could be a lot of information hidden inside an image and we could interpret it in different ways and perspectives. So, what is an image, and how to deal with an image? In simple words, we can say that an image is a visual representation of something and it could be dealt with easily using computer vision (from a machine learning perspective). What about videos? A video could be described as a recording of a moving set of images or continuous frames. But we can deal with videos using computer vision in another blog! 🙂

Some of you may already know about Computer Vision by now and can skip this paragraph, and but for those who do not have an idea about it, Computer Vision, mostly known in the abbreviated form ‘CV’, could be said as a field in Artificial Intelligence that extracts useful information from digital images, videos, etc. ‘CV’ has got a lot of real-world applications in almost all domains in this modern era including the medical field, automotive, manufacturing, agriculture, etc.

Image Processing With OpenCV | computer vision

Image 1


As a human, who is currently reading this post, you would be able to distinguish between the things or elements present in an image. But, what about machines? Could they see and think to distinguish it on its own? NO? then let’s make your machine do it.

In this blog, you would be familiarizing yourself with the basics of computer vision using python.

I believe you have already installed Python Shell/Jupyter Notebook/PyCharm or Visual Studio Code (to name a few) to program in python. Let’s install the widely used package (OpenCV) to get started with and we are going to run the codes in each cell in a Jupyter Notebook.

Installing OpenCV Package for Image Preprocessing

OpenCV is a pre-built, open-source CPU-only library (package) that is widely used for computer vision, machine learning, and image processing applications. It supports a good variety of programming languages including Python.

Install the OpenCV package using:

pip install opencv-python


pip install opencv-contrib-python

Run any of these commands on your terminal or if you are using Anaconda Navigator – Jupyter Notebook, you can change “pip” with the “conda” command and install the same.

Importing the Package

What is a package in Python? A package in Python is a collection of modules that contain pre-written scripts. These packages help us to import modules entirely or even individually. We can import the package by calling the “cv2” module like this:

import cv2

Reading an Image

Digital images could be classified into; colour images, grey-scale images, binary images, and multispectral images. A color image includes the color information for each pixel. Images having shades of grey as their only color are grayscale images while a binary image has exactly two colors, mostly black and white pixels. Multispectral images are images that capture image data ranging across the electromagnetic spectrum within some specific wavelength.

Let’s get back to the coding part and read an image, for example, the image is shown below:

reading image | Image Processing With OpenCV

This is an image of a mandrill. I am currently reading the image from my local directory.

# cv2.imread(path_to_image_with_file_extension, flag)

The usage code look likes this:

img = cv2.imread("mandrill.jpg", 1)

Here we are using the “imread” method of the cv2 package to read the image and the first parameter corresponds to the path of the image with its filename and extension, and the second one is the flag that you can set which tells the way, how to read in the image. If you like, you can replace the absolute path to the image here and try reading it from your local computer or even from the internet! If the image is present in your current working directory, you only need to specify the image name with its extension type.

As far as the second parameter is concerned, if you like to read it as a grayscale image, you can specify the parameter to 0, -1 for reading the image as unchanged (reads the image as alpha or transparency channel if it is there) and by default, it is 1, as a color image. You can also try other parameters from this link:


under the ImreadModes.

Properties of an Image


Every image has a shape. The length of boundaries exhibited by the picture might be referred to as the shape i.e, the height and width. Now that you know how to read an image, how about we check the shape of the image?


is the basic way to print the shape of the image, but we can extract the shape using:

h, w, c = img.shape
print("Dimensions of the image is:nnHeight:", h, "pixelsnWidth:", w, "pixelsnNumber of Channels:", c)

to get a better understanding.

For the color and unchanged mode, it would return 3 values including the height, width, and the number of channels present in the image. If you have used the grayscale mode, the shape would be 2, which will return the height and width of the image, but then you just have to use h and w variables only (exclude using “c” ), else you might be getting a value error saying “not enough values to unpack (expected 3, got 2)”.


We can know the type of the image using the “type” method. Using this method helps us to know how the image data is represented. Run the code as follows:


The result might be like this:

as the image type. It is a multidimensional container of items of the same type and size. You can refer more to the N-dimensional array at the following link:


The data type of the image you have just read:

Since the image is an N-dimensional array, we can check the data type of the image:


Image pixel values:

We can think of an image as a set of small samples. These samples are called pixels. For a better understanding, try zoom in on an image as much as possible. We can see the same divided into different squares. These are the pixels and when they are combined together they form an image.

One of the simple ways to represent an image is, in the form of a matrix. We can even create an image using a matrix and save it! Will show you how, later in this post. Take a look at this picture below:


Image pixel values | Image Processing With OpenCV

Image 2

This picture is rather an example of how the matrix representation of an image looks like. On the left, is an image of Lincoln, in the middle, the pixel values are labeled with numbers from 0 to 255, denoting their intensity or brightness and on the right, the numbers in matrix form themselves. Each value in the matrix corresponds to a pixel, which is the smallest element of information present in an image. Check out the image pixel values by just printing the variable that you loaded the image!


Image Resolution:

Image resolution could be defined as the number of pixels present in an image. The quality of the image increases when the number of pixels increases. We have seen earlier, the shape of the image which gives the number of rows and columns. This could be said as the resolution of that image. Some of the standard resolutions are that almost everyone knows are 320 x 240 pixels (mostly suitable on small screen devices), 1024 x 768 pixels (appropriate to view on standard computer monitors), 720 x 576 pixels(good to view on standard definition TV sets having 4:3 aspect ratio), 1280 x 720 pixels (for viewing on widescreen monitors),1280 x 1024 pixels (good for viewing on the full-screen size on LCD monitors with 5:4 aspect ratio), 1920 x 1080 pixels (for viewing on HD tv’s) and now we even have 4K, 5K, and 8K resolutions which are 3840 x 2160 pixels, 5120 × 2880 pixels and 7,680 x 4,320 pixels respectively supported by ultra high definition monitors and televisions.

When we multiply the number of columns and number of rows, we can obtain the total number of pixels present in the image. For example, in a 320 x 240 image, the total number of pixels present in it is 76,800 pixels.

Viewing the Image

Let us see how to display the image in a window. For that, we have to create a GUI window to display the image on the screen. The first parameter has to be the title of the GUI window screen, specified in string format. We can show the image in a pop-up window using the cv2.imshow() method. But, when you try to close it, you might feel stuck with its window. So to combat that, we can use a simple “waitKey” method. Try out this code part in new a cell:

cv2.imshow('Mandrill', img)
k = cv2.waitKey(0)
if k == 27 or k == ord('q'):

Here, we have specified the parameter ‘0’ in the “waitKey” to keep the window open until we close it. (You can also give the time in milliseconds, instead of 0, specifying how much time it should be opened.) After that, we can assign the variable to act for closing the window when we press the ‘ESC’ key or the key ‘q’. The cv2.destroAllWindows() method is used for closing or deleting the GUI windows from the screen/memory.

Saving the Image

Before saving the image, how about converting the image to grayscale and then save it? Convert the image to grayscale using:

gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

Now we can save the image:

cv2.imwrite('Mandrill_grey.jpg', gray)

and check the image being saved in the current working directory. The first parameter corresponds to the name of the file in which the image is to be saved and the second parameter is the variable that contains the image (pixel information).

Extracting the image bit planes and reconstructing them

We can divide an image into different levels of bit planes. For example, divide an image into 8-bit (0-7) planes, where the last few planes contain the majority of information for an image.

bit plans

Image 3

For doing this, we can import two more packages:

import matplotlib.pyplot as plt
import numpy as np

If you get an error while importing any of the packages, you can install them using:

conda install -c conda-forge matplotlib 
conda install -c anaconda numpy

Now we are defining a function to extract each of the 8 level bit planes of the image.

def extract_bit_plane(cd):
    #  extracting all bit one by one 
    # from 1st to 8th in variable 
    # from c1 to c8 respectively 
    c1 = np.mod(cd, 2)
    c2 = np.mod(np.floor(cd/2), 2)
    c3 = np.mod(np.floor(cd/4), 2)
    c4 = np.mod(np.floor(cd/8), 2)
    c5 = np.mod(np.floor(cd/16), 2)
    c6 = np.mod(np.floor(cd/32), 2)
    c7 = np.mod(np.floor(cd/64), 2)
    c8 = np.mod(np.floor(cd/128), 2)
    # combining image again to form equivalent to original grayscale image 
    cc = 2 * (2 * (2 * c8 + c7) + c6) # reconstructing image  with 3 most significant bit planes
    to_plot = [cd, c1, c2, c3, c4, c5, c6, c7, c8, cc]
    fig, axes = plt.subplots(nrows=2, ncols=5,figsize=(10, 8), subplot_kw={'xticks': [], 'yticks': []})
    fig.subplots_adjust(hspace=0.05, wspace=0.05)
    for ax, i in zip(axes.flat, to_plot):
        ax.imshow(i, cmap='gray')
    return cc

Now we are ready to call the function.

reconstructed_image = extract_bit_plane(gray)

We have reconstructed the image (almost similar) using the last three bit planes i.e, the sixth, seventh, and eighth planes. And the result looks like this:

reconstruct image | Image Processing With OpenCV

How about we construct a small image on our own? Let’s try it now!

Constructing a small synthetic image

We can try to generate a synthetic image containing four concentric squares with four different pixel intensity values,

40, 80, 160, and 220.

con_img = np.zeros([256, 256])
con_img[0:32, :] = 40 # upper row
con_img[:, :32] = 40 #left column
con_img[:, 224:256] = 40 # right column
con_img[224:, :] = 40 # lower row
con_img[32:64, 32:224] = 80 # upper row
con_img[64:224, 32:64] = 80 # left column
con_img[64:224, 192:224] = 80 # right column
con_img[192:224, 32:224] = 80 # lower row
con_img[64:96, 64:192] = 160 # upper row
con_img[96:192, 64:96] = 160 # left column
con_img[96:192, 160:192] = 160 # right column
con_img[160:192, 64:192] = 160 # lower row
con_img[96:160, 96:160] = 220

The resulting image would be looking like this:


We can take look at using various filters and transformations on images in another article.

Thank you for reading the article.


Image 4

Feel free to add your suggestions under the comments and do share if you like this article. The complete Python programming code in Jupyter Notebook, mentioned in this article is available on my Github repository:



  1. Image 1 – https://seevisionc.blogspot.com/2013/09/computer-vision-is-everywhere.html
  2. Image 2-  https://towardsdatascience.com/everything-you-ever-wanted-to-know-about-computer-vision-heres-a-look-why-it-s-so-awesome-e8a58dfb641e
  3. Image 3- https://nptel.ac.in/content/storage2/courses/117104069/chapter_8/8_13.html
  4. Image 4- https://memegenerator.net/instance/21508026/willy-wonka-oh-so-you-created-a-computer-vision-algorithm-you-must-never-lose-the-remote



The media shown in this article are not owned by Analytics Vidhya and are used at the Author’s discretion.

About the Author

Our Top Authors

  • Analytics Vidhya
  • Guest Blog
  • Tavish Srivastava
  • Aishwarya Singh
  • Aniruddha Bhandari
  • Abhishek Sharma
  • Aarshay Jain

Download Analytics Vidhya App for the Latest blog/Article

Leave a Reply Your email address will not be published. Required fields are marked *