Introduction to Artificial Neural Networks

Gourav Singh 05 Apr, 2024 • 10 min read

This article was published as a part of the Data Science Blogathon

Introduction:

Artificial Neural Networks (ANN) are algorithms based on brain function and are used to model complicated patterns and forecast issues. The Artificial Neural Network (ANN) is a deep learning method that arose from the concept of the human brain Biological Neural Networks. The development of ANN was the result of an attempt to replicate the workings of the human brain. The workings of ANN are extremely similar to those of biological neural networks, although they are not identical. ANN algorithm accepts only numeric and structured data.

Convolutional Neural Networks (CNN) and Recursive Neural Networks (RNN) are used to accept unstructured and non-numeric data forms such as Image, Text, and Speech. This article focuses solely on Artificial Neural Networks.

Introduction:
What is Artificial Neural Network(ANN)?
Artificial Neural Networks Architecture
Benefits of Artificial Neural Networks
Types of Artificial Neural Networks
How do Artificial Neural Networks learn?
Application of Artificial Neural Networks
Advantages of Artificial Neural Networks
Disadvantages of Artificial Neural Networks
Create a Simple ANN for the famous Titanic Dataset
- Conclusion
Frequently Asked Questions

What is Artificial Neural Network(ANN)?

An Artificial Neural Network (ANN) is a computational model inspired by the human brain’s neural structure. It consists of interconnected nodes (neurons) organized into layers. Information flows through these nodes, and the network adjusts the connection strengths (weights) during training to learn from data, enabling it to recognize patterns, make predictions, and solve various tasks in machine learning and artificial intelligence.

Artificial Neural Networks Architecture

1. There are three layers in the network architecture: the input layer, the hidden layer (more than one), and the output layer. Because of the numerous layers are sometimes referred to as the MLP (Multi-Layer Perceptron).

2. It is possible to think of the hidden layer as a “distillation layer,” which extracts some of the most relevant patterns from the inputs and sends them on to the next layer for further analysis. It accelerates and improves the efficiency of the network by recognizing just the most important information from the inputs and discarding the redundant information.

3. The activation function is important for two reasons: first, it allows you to turn on your computer.

This model captures the presence of non-linear relationships between the inputs.
It contributes to the conversion of the input into a more usable output.

Activation functions,artificial neural networks

4. Finding the “optimal values of W — weights” that minimize prediction error is critical to building a successful model. The “backpropagation algorithm” does this by converting ANN into a learning algorithm by learning from mistakes.

5. The optimization approach uses a “gradient descent” technique to quantify prediction errors. To find the optimum value for W, small adjustments in W are tried, and the impact on prediction errors is examined. Finally, those W values are chosen as ideal since further W changes do not reduce mistakes.

Benefits of Artificial Neural Networks

ANNs offers many key benefits that make them particularly well-suited to specific issues and situations:

1. ANNs can learn and model non-linear and complicated interactions, which is critical since many of the relationships between inputs and outputs in real life are non-linear and complex.

2. ANNs can generalize – After learning from the original inputs and their associations, the model may infer unknown relationships from anonymous data, allowing it to generalize and predict unknown data.

3. ANN does not impose any constraints on the input variables, unlike many other prediction approaches (like how they should be distributed). Furthermore, numerous studies have demonstrated that ANNs can better simulate heteroskedasticity, or data with high volatility and non-constant variance, because of their capacity to discover latent correlations in the data without imposing any preset associations. This is particularly helpful in financial time series forecasting (for example, stock prices) when significant data volatility.

Types of Artificial Neural Networks

Five Types of Artifical Neural Networks:

Feedforward Neural Networks (FNNs): These are straightforward networks where information flows in one direction, like from the input to the output. They’re used for tasks like identifying patterns in data or making predictions.
Convolutional Neural Networks (CNNs): Think of these as networks designed specifically for understanding images. They’re great at recognizing patterns in pictures, making them perfect for tasks like identifying objects in photos or videos.
Recurrent Neural Networks (RNNs): These networks are good with sequences, like predicting the next word in a sentence or understanding the context of words. They remember previous information, which helps them understand the current data better.
Long Short-Term Memory Networks (LSTMs): LSTMs are a type of RNN that are really good at remembering long sequences of data. They’re often used in tasks where understanding context over time is important, like translating languages or analyzing time-series data.
Generative Adversarial Networks (GANs): These networks are like artists. One part of the network generates new data, like images or music, while the other part critiques it to make sure it looks or sounds realistic. GANs are used for creating new content, enhancing images, or even generating deepfakes.

How do Artificial Neural Networks learn?

Starting Point: Imagine you’re building a robot brain, but initially, it knows nothing. So, you randomly assign some strengths to the connections between its “neurons” (like how our brain’s neurons are connected).
Seeing Data: Now, show the robot some examples of what you want it to learn. For instance, if you’re teaching it to recognize cats, show it lots of pictures of cats.
Guessing and Checking: The robot tries to imagine what it’s seeing based on the strengths of its connections. At first, it’ll make lots of mistakes because it’s just guessing randomly.
Getting Feedback: You tell the robot how wrong its guesses are. For example, you say, “No, that’s not a cat; it’s a dog.” This helps the robot understand where it went wrong.
Adjusting Strengths: The robot tweaks the strengths of its connections based on the feedback. If it guessed wrong, it changes the connections to be a bit stronger or weaker so that next time it might make a better guess.
Practice Makes Perfect: The robot keeps looking at more examples, guessing, getting feedback, and adjusting until it gets better and better at recognizing cats.
Testing Skills: Once the robot has seen lots of examples and adjusted its connections a lot, you give it a new picture it hasn’t seen before to see if it can correctly identify whether it’s a cat or not.

Application of Artificial Neural Networks

ANNs have a wide range of applications because of their unique properties. A few of the important applications of ANNs include:

1. Image Processing and Character recognition:

ANNs play a significant part in picture and character recognition because of their capacity to take in many inputs, process them, and infer hidden and complicated, non-linear correlations. Character recognition, such as handwriting recognition, has many applications in fraud detection (for example, bank fraud) and even national security assessments.

image processiong,artificial neural networks

Image recognition is a rapidly evolving discipline with several applications ranging from social media facial identification to cancer detection in medicine to satellite image processing for agricultural and defense purposes.

Deep neural networks, which form the core of “deep learning,” have now opened up all of the new and transformative advances in computer vision, speech recognition, and natural language processing – notable examples being self-driving vehicles, thanks to ANN research.

2. Forecasting:

It is widely used in everyday company decisions (sales, the financial allocation between goods, and capacity utilization), economic and monetary policy, finance, and the stock market. Forecasting issues are frequently complex; for example, predicting stock prices is complicated with many underlying variables (some known, some unseen).

Traditional forecasting models have flaws when it comes to accounting for these complicated, non-linear interactions. Given its capacity to model and extract previously unknown characteristics and correlations, ANNs can provide a reliable alternative when used correctly. ANN also has no restrictions on the input and residual distributions, unlike conventional models.

Advantages of Artificial Neural Networks

Attribute-value pairs are used to represent problems in ANN.
The output of ANNs can be discrete-valued, real-valued, or a vector of multiple real or discrete-valued characteristics, while the target function can be discrete-valued, real-valued, or a vector of numerous real or discrete-valued attributes.
Noise in the training data is not a problem for ANN learning techniques. There may be mistakes in the training samples, but they will not affect the final result.
It’s utilized when a quick assessment of the taught target function is necessary.
The number of weights in the network, the number of training instances evaluated, and the settings of different learning algorithm parameters can all contribute to extended training periods for ANNs.

Disadvantages of Artificial Neural Networks

1. Hardware Dependence:

The construction of Artificial Neural Networks necessitates the use of parallel processors.
As a result, the equipment’s realization is contingent.

2. Understanding the network’s operation:

This is the most serious issue with ANN.
When ANN provides a probing answer, it does not explain why or how it was chosen.
As a result, the network’s confidence is eroded.

3. Assured network structure:

Any precise rule does not determine the structure of artificial neural networks.
Experience and trial and error are used to develop a suitable network structure.

4. Difficulty in presenting the issue to the network:

ANNs are capable of working with numerical data.
Before being introduced to ANN, problems must be converted into numerical values.
The display method that is chosen will have a direct impact on the network’s performance.
The user’s skill is a factor here.

5. The network’s lifetime is unknown:

When the network’s error on the sample is decreased to a specific amount, the training is complete.
The value does not produce the best outcomes.

Create a Simple ANN for the famous Titanic Dataset

Now that we have discussed the architecture, advantages, and disadvantages it’s time to create an ANN model so that we would know how it works.

For understanding ANN we would be using world-famous titanic survival prediction. you can find the dataset here https://www.kaggle.com/jamesleslie/titanic-neural-network-for-beginners/data?select=train_clean.csv.

let’s start with importing the dependencies.

## import dependencies 
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
from matplotlib.pyplot import rcParams
%matplotlib inline
rcParams['figure.figsize'] = 10,8
sns.set(style='whitegrid', palette='muted',
        rc={'figure.figsize': (15,10)})
import os
from sklearn.preprocessing import StandardScaler
from sklearn.model_selection import train_test_split
from sklearn.model_selection import GridSearchCV
from keras.wrappers.scikit_learn import KerasClassifier
from keras.models import Sequential
from keras.layers import Dense, Activation, Dropout
from numpy.random import seed
from tensorflow import set_random_seed

Once you have all the preprocessing and modeling libraries imported, we will read the training and testing data.

# Load data as Pandas dataframe train = pd.read_csv('./train_clean.csv', ) test = pd.read_csv('./test_clean.csv') df = pd.concat([train, test], axis=0, sort=True) df.head()

We have concatenated both training and testing CSV in order to apply the same preprocessing method on both of them. once created the dataset we would start preprocessing the dataset since it has multiple columns that are non-numbers. Starting with the column name ‘sex’ in the dataset, we would be converting it to binary variables.

# convert to cateogry dtype
df['Sex'] = df['Sex'].astype('category')
# convert to category codes
df['Sex'] = df['Sex'].cat.codes

After this, we need to convert the rest of the variables:

# subset all categorical variables which need to be encoded
categorical = ['Embarked', 'Title']
for var in categorical:
    df = pd.concat([df, 
                    pd.get_dummies(df[var], prefix=var)], axis=1)
    del df[var]
# drop the variables we won't be using
df.drop(['Cabin', 'Name', 'Ticket', 'PassengerId'], axis=1, inplace=True)
df.head()

## scale continuous variable
continuous = ['Age', 'Fare', 'Parch', 'Pclass', 'SibSp', 'Family_Size']
scaler = StandardScaler()
for var in continuous:
    df[var] = df[var].astype('float64')
    df[var] = scaler.fit_transform(df[var].values.reshape(-1, 1))

Once preprocessing is done we need to split the train and test the dataset again, for that you can use the following code.

X_train = df[pd.notnull(df['Survived'])].drop(['Survived'], axis=1)
y_train = df[pd.notnull(df['Survived'])]['Survived']
X_test = df[pd.isnull(df['Survived'])].drop(['Survived'], axis=1)

Now is the time to define the hyperparameters and define the architecture of the ANN model.

lyrs=[8]
act='linear' 
opt='Adam'
dr=0.0
# set random seed for reproducibility
seed(42)
set_random_seed(42)
model = Sequential()
# create first hidden layer
model.add(Dense(lyrs[0], input_dim=X_train.shape[1], activation=act))
# create additional hidden layers
for i in range(1,len(lyrs)):
    model.add(Dense(lyrs[i], activation=act))
# add dropout, default is none
model.add(Dropout(dr))
# create output layer
model.add(Dense(1, activation='sigmoid'))  # output layer
model.compile(loss='binary_crossentropy', optimizer=opt, metrics=['accuracy'])
model = create_model()
print(model.summary())

model summary | Artificial Neural Networks — Source: Local

after model definition, we will fit the model on our training data and would get the model insight.

# train model on full train set, with 80/20 CV split
training = model.fit(X_train, y_train, epochs=100, batch_size=32, validation_split=0.2, verbose=0)
val_acc = np.mean(training.history['val_acc'])
print("n%s: %.2f%%" % ('val_acc', val_acc*100))
# summarize history for accuracy
plt.plot(training.history['acc'])
plt.plot(training.history['val_acc'])
plt.title('model accuracy')
plt.ylabel('accuracy')
plt.xlabel('epoch')
plt.legend(['train', 'validation'], loc='upper left')
plt.show()

Now you can use the model for predictions on test data, using the following code chunk:

# calculate predictions
test['Survived'] = model.predict(X_test)
test['Survived'] = test['Survived'].apply(lambda x: round(x,0)).astype('int')
solution = test[['PassengerId', 'Survived']]

print(solution)

predicctions | Artificial Neural Networks

Source: Local

Conclusion

Artificial neural networks (ANNs) are powerful models that can be applied in many scenarios in artificial intelligence. Several noteworthy uses of ANNs have been mentioned above, although they have applications in various industries, including medical, security/finance, government, agricultural, and defense. ANNs are particularly effective in tasks such as image recognition, natural language processing, and predictive analytics. They have the ability to learn complex patterns and relationships from data, making them invaluable tools for solving a wide range of problems in different domains.

Frequently Asked Questions

Q1. What is Artificial Neural network(ANN)?

A. An Artificial Neural Network (ANN) is a machine learning model inspired by the human brain’s neural structure. It comprises interconnected nodes (neurons) organized into layers. Data flows through these nodes, adjusting the weights of connections to learn patterns and make predictions. ANNs excel in tasks like image recognition, language processing, and decision-making, revolutionizing various fields.

Q2. What is the main function of artificial neural networks?

A. The primary function of artificial neural networks (ANNs) is to process and learn from data in a way that enables them to recognize patterns, make predictions, and solve complex problems. ANNs mimic the human brain’s neural connections, adjusting the connections’ strengths (weights) during training to improve their ability to generalize and perform tasks such as image recognition, language processing, and decision-making.

Q3.What is the difference between CNN and ANN?

In artificial intelligence, neural networks play a vital role. The two main types are Artificial Neural Networks (ANNs) and Convolutional Neural Networks (CNNs). ANNs are versatile, with interconnected nodes, while CNNs are specialized for grid-like data, particularly images, making them ideal for tasks like image classification and object detection.

Q4.Is ANN deep learning?

Yes, Artificial Neural Networks (ANNs) are a fundamental component of deep learning. Deep learning refers to a subset of machine learning methods that use neural networks with multiple layers to extract high-level features from data. ANNs with multiple hidden layers are what make deep learning possible, enabling the network to learn complex patterns and representations from input data. Therefore, ANNs are indeed a form of deep learning.

References:

https://www.kaggle.com

Image 1 -https://www.analyticsvidhya.com
Image 2- https://medium.com
Image 3 – https://medium.com
Image 4 – https://medium.com

Thanks for reading this article do like if you have learned something new, feel free to comment See you next time !!! ❤️

The media shown in this article are not owned by Analytics Vidhya and are used at the Author’s discretion.

Gourav Singh 05 Apr 2024

Applied Machine Learning Engineer skilled in Computer Vision/Deep Learning Pipeline Development, creating machine learning models, retraining systems and transforming data science prototypes to production-grade solutions. Consistently optimizes and improves real-time systems by evaluating strategies and testing on real world scenarios.

Beginner Classification Deep Learning Python Structured Data

Introduction to Artificial Neural Networks

Introduction:

Table of contents

What is Artificial Neural Network(ANN)?

Artificial Neural Networks Architecture

Benefits of Artificial Neural Networks

Types of Artificial Neural Networks

How do Artificial Neural Networks learn?

Application of Artificial Neural Networks

1. Image Processing and Character recognition:

2. Forecasting:

Advantages of Artificial Neural Networks

Disadvantages of Artificial Neural Networks

1. Hardware Dependence:

2. Understanding the network’s operation:

3. Assured network structure:

4. Difficulty in presenting the issue to the network:

5. The network’s lifetime is unknown:

Create a Simple ANN for the famous Titanic Dataset

Conclusion

Frequently Asked Questions

References:

Frequently Asked Questions

Responses From Readers

Related Courses

Introduction to Neural Networks

Free

Getting Started with Neural Networks

Free

Introduction to Python

Free

Write for us