How to Deploy a Machine Learning Model on AWS EC2

Raghav Agrawal Last Updated : 12 Oct, 2024

8 min read

This article was published as a part of the Data Science Blogathon.

Introduction

AWS is a cloud computing service that provides on-demand computing resources for storage, networking, Machine learning, etc on a pay-as-you-go pricing model. AWS is a premier cloud computing platform around the globe, and most organization uses AWS for global networking and data storage. The article is for those Machine learning practitioners who know the model building and even they have deployed some projects on other platforms but want to learn how to deploy on major cloud platforms like AWS.

In the article, our main aim is to learn deployment over AWS, but we will walk through each step from the development of the Machine learning model to deployment over AWS from scratch.

Build a Machine learning Model

The very first step is building a model. Our main aim is to learn deployment, so I have taken a simple dataset and applied an SVM algorithm that gives a very good accuracy even after trying Random forest and decision trees. If you want, you can optimize the model accordingly.

Dataset description

We pick a very simple dataset about student placement prediction which is a binary classification problem. The dataset contains input columns such as student CGPA, IQ, and Profile score, and based on 3 input columns you have an output column that indicates whether the student will be placed or not in binary form as 0, 1. You can download the dataset using this GitHub link.

Data Preparation for model building

Data is simple, and it does not contain any missing values or data is not messy so to save time and learn deployment we load the data and prepare it for model creation.

import numpy as np
import pandas as pd
#Read the Data
df = pd.read_csv('https://raw.githubusercontent.com/Raghavagr/Student-Placement-Predictor/main/students_placement.csv')
#look over sample data
print(df.sample(5))
#separate independent and dependent columns
X = df.drop(columns=['placed'])
y = df['placed']
#train-test-split
from sklearn.model_selection import train_test_split
X_train,X_test,y_train,y_test = train_test_split(X,y,test_size=0.2,random_state=2)

Model training and testing

We first import the SVM model using the scikit-learn library, train it on training data, and test it on test data. After that, we apply the accuracy score as a metric to get the performance of the model which is 93 percent.

from sklearn.metrics import accuracy_score
from sklearn.svm import SVC
svc = SVC(kernel='rbf')
svc.fit(X_train,y_train)
y_pred = svc.predict(X_test)
print(accuracy_score(y_test,y_pred))

Export the Model

We have to deploy the model to the website, and for this first, we need to convert the model into a certain file. For converting the model into a file, we can use Pickle or Joblib library. It implements protocols for serializing and de-serializing the python objects so that by de-serialize them, you can use them for different purposes. We are using Pickle at this time to dump the model in a pickle file.

import pickle 
pickle.dump(svc,open('model.pkl','wb'))

First, we have to import the Pickle module, and there is one function named dump where we have to specify two things: objects to be dumped and which file we want to convert.

Build a Flask Website to Serve as a Model

First, we have to create one project folder where you have to keep the Pickle file. You can open the folder in any python code editor of your choice, whether it can be Pycharm, Spider, or VS code. If you do not know anything about Flask, then you can refer to my Basic Flask article on Analytics Vidhya. We will design a form where we will accept student details, and as students click on submit button, our flask app will accept the details at the backend and pass the details to a model where the model will generate a result that we will display to the student at frontend whether he will get the placement or not.

Designing HTML Form

Create a new folder in your project folder as templates and create a new HTML file in it where we will design an HTML form. I hope that you are a little bit familiar with HTML and if not, then you can refer to any beginner article on the web. We will create 3 input tabs to accept details like student CGPA, IQ, and profile score. And create a button to submit a form that states that on submitting the form, the request is POST and define the action where the form will redirect after submitting that we will handle from the flask app.

Student Placement Predictor

    {% if result %}
{{ result }}
    {% endif %}    
        CGPA
     IQ
        Profile Score

Build a Flask App

Create a new python file named an app where we define a new Flask app and load the model dump in the pickle file. As we run the server, the app will serve the “/” URL where we have to display the home page which is our HTML Form. Now after submitting it will create a POST request that we have redirected to a “/predict” URL, and at that time we will accept the form data, pass it to the model, and display output using JINJA.

If you open the command prompt and direct to the current working directory and run the below command, then your server will start running and get a localhost URL that you can open in your browser, and the app is running.

python app.py

Deploy ML Flask Web APP To AWS

Now the app is running well but it is currently running on a local system means we can only use this by running a server so to make it available for public users we have to deploy it on the cloud.

Prepare requirements file

The first thing is you should have a requirements file where we specify all the dependencies or libraries that we have used to develop the project. Create a new text file, and the name should be requirements.txt and copy the content as given below.

flask
numpy
sklearn

Create AWS account

Visit the AWS site and create a free-tier account that is valid for 12 months, and you can use popular AWS services to a certain extent. But after using each service, you should terminate and delete all the services to avoid any kind of bill. To create a free-tier account, you must have Mastercard, or Visa Debit card, or a credit card. When you first create an account, it just cuts 2 INR and returns after 3-5 business days for background verification.

Create EC2 Instance

EC2 stands for Elastic compute cloud that provides scalable compute capacity in AWS Cloud. You can also use AWS Lambda and AWS Elastic Beanstalk for deploying ML models, but EC2 is very old. You can install software on EC2, and easy to deploy your application. It is nothing but you take a server on the cloud on rent. Open EC2 Dashboard in your management console through the search tab or services. As you open the EC2 dashboard, you will observe how many instances are running. Click on the Launch Instance tab to launch a new EC2 Instance. Now by configuring 6-7 steps, you can launch your new Instance.

Step-1) Choose an Amazon Machine Image (AMI)

The first step is to choose an operating system, for instance, where you have many options. We will choose the operating system that is free-tier eligible, so click on Ubuntu and select the Ubuntu operating system which is free-tier eligible. You can also try any supported version of Linux.

Step-2) Choose Instance Type

The step is crucial, and you do not need to select anything in the step because the default option is already selected as t2.micro, which supports a free-tier account.

step-3) Create Key-Pair

To secure your instance, we have to set a password and key-pair a way to achieve this. Click on create new key pair and enter any name of your choice without space. And click on create-key-pair.

Step-4) Network settings / Configure security group

Under this, we have different network settings but keep all the settings as default and move forward. Your AWS EC2 instance is private to you and secretly kept unless you define to accept the network traffic from what kind of source, so we create a new and default security group.

step-5) Configure storage

Free-tier eligible customers can get up to 30 GB EBS, and a minimum of 8 GB is allotted to you. We have set it to 8GB which is the default.

step-6) Review and Launch

At last, you have a summary of all your configurations so you can have a look and launch an EC2 instance. After that, if you visit the instances tab then our EC2 instance is successfully and running.

Download and install Putty and WinSCP

WinSCP is used to upload your project files to the server.

Putty is a remote client where by using the SSH key, you take access to your machine. You can open an EC2 command prompt in Putty and install the project dependencies and libraries. To download the putty, visit the official site and download the installer file as per your OS.

Upload the website

Open the WinSCP, and you need to provide the hostname and the key-value pair we have created above. If the file is in PEM format, it will ask you to convert it to PPK format and click on OK. Click on login, and when you are successfully logged in, then you will have 2 parallel screens, where first displays the files of your local system, and the second one is your EC2 instance in a local system. Open the desired project folder and paste each file on the right side (Server folder).

Before Pasting all the files, we need to specify the port and host in the python app file because currently, the app was running in the local file system, but now it will run on the cloud, so edit the suggested changes.

if __name__ == "__main__":
    app.run(host='0.0.0.0', port=8080)

Install Python and Libraries on Server (EC2 Instance)

We will need Putty for this, so open the Putty using the same key-value pair. so one by one, you have to install each library on the EC2 server.

Install PIP using which we can install all the python libraries on AWS EC2. As we successfully install PIP, we can run the pip command to install all the required libraries specified in the requirements file.

sudo apt-get update && sudo apt-get install python3-pip
pip3 install -r requirements.txt

Start the server and Test the website

In the Putty command prompt, you can run the app file using the command python3 app.py so you will get the URL. Visit the SSH client in Instance Dashboard and copy the AWS EC2 instance address and paste it into the browser. At last, write .8080 port and press enter.

The website will only run till when you run the command in Putty. So to stop or avoid this flaw, you need to run one command, a JNU multiplexer in a complex terminology that copies the current screen on a server. The command is given in the below snippet, so copy and run the command in a terminal, and Now until you press CTRL + C, the server will keep running. In the following article, we will also learn how to download the production environment. For now, I hope this article has provided you with an in-depth overview of model deployment over AWS EC2.

screen -R deploy python3 app.py

Conclusion

In this article, we have learned about deploying a Machine learning model on the AWS cloud using a top-rated AWS EC2 service. Let us discuss the key takeaways from that article that you should remember.

AWS is a cloud-service platform that offers on-demand cloud services on a pay-as-you-go pricing model where you have to only pay for the time and storage of service that you have consumed.
AWS provides a 12-month free-tier account in which you can use some services up to a certain limit of storage.
AWS provides different serverless platforms like AWS Lambda or Beanstalk that can also be used to host a machine learning website.
To develop a Machine learning model or to perform data analytics, AWS EC2 offers separate tools for a specific use case that you can check out under the services tab.
Flask is a micro web framework of python that you can use to build small data apps or static websites.

Thank You Note

I hope it was easy to cope with each step discussed in the article. If you have any queries, please post them in the comment section below or connect with me.
Connect with me on Linkedin.
Check out my other articles on Analytics Vidhya and crazy-techie

The media shown in this article is not owned by Analytics Vidhya and is used at the Author’s discretion.

AWS EC2 blogathon machine learning

Raghav Agrawal

I am a software Engineer with a keen passion towards data science. I love to learn and explore different data-related techniques and technologies. Writing articles provide me with the skill of research and the ability to make others understand what I learned. I aspire to grow as a prominent data architect through my profession and technical content writing as a passion.

AWS Beginner Machine Learning Python

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Reading list

Basics of Machine Learning

Machine Learning Lifecycle

Importance of Stats and EDA

Understanding Data

Probability

Exploring Continuous Variable

Exploring Categorical Variables

Missing Values and Outliers

Central Limit theorem

Bivariate Analysis Introduction

Continuous - Continuous Variables

Continuous Categorical

Categorical Categorical

Multivariate Analysis

Different tasks in Machine Learning

Build Your First Predictive Model

Evaluation Metrics

Preprocessing Data

Linear Models

KNN

Selecting the Right Model

Feature Selection Techniques

Decision Tree

Feature Engineering

Naive Bayes

Multiclass and Multilabel

Basics of Ensemble Techniques

Advance Ensemble Techniques

Hyperparameter Tuning

Support Vector Machine

Advance Dimensionality Reduction

Unsupervised Machine Learning Methods

Recommendation Engines

Improving ML models

Working with Large Datasets

Interpretability of Machine Learning Models

Interpretability of Machine Learning Models

Automated Machine Learning

Model Deployment

Deploying ML Models

Embedded Devices

How to Deploy a Machine Learning Model on AWS EC2

Introduction

Build a Machine learning Model

Dataset description

Data Preparation for model building

Model training and testing

Export the Model

Build a Flask Website to Serve as a Model

Student Placement Predictor

Build a Flask App

Deploy ML Flask Web APP To AWS

Prepare requirements file

Create AWS account

Create EC2 Instance

Download and install Putty and WinSCP

Upload the website

Install Python and Libraries on Server (EC2 Instance)

Start the server and Test the website

Conclusion

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Congratulations, You Did It!

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID