etl tools

KOGOT

COM424E-CAT2.pdf

Home >Computer Science homework help >etl tools

Kenneth 1

Name: Kenneth Otieno Ouko

Reg no.: COM/1010/17

Course: Neural Networks (COM 424E)

Lecturer: Madam Edna Milgo.

Date: 2/5/2021

CAT 2: 40 marks

1. Compare the similarities of artificial neural network to a biological neural system in

the human brain. (5 marks)

 Both biological neural network and artificial neural network systems process information

in parallel.

 To improve their level of performance, both the networks learn through experience of the

past.

 Learning in both the networks involve adjustment of synaptic connections and weights in

biological neural network and artificial neural network respectively. Weights in ANN is

similar to synaptic connections in the Biological neural system.

 Information transmission in both the networks involve electrical signal.

Kenneth 2

 Storage of information of biological neural network and artificial neural network is at the

synapses and weight matrix respectively.

2. Why is multilayer perception considered suitable to solve non linear problems (the

so called XOR equations)?

When we consider the example of a simple XOR problem. And comparing the abilities of

perceptron and multilayer perceptron in solving the problem,

The diagrams below ascertain the capability of MLP in solving a nonlinear equation and

also give account.

This is an XOR with two inputs x1 and x2.

Pattern space of the XOR

Kenneth 3

Its solution require two lines as shown in the figure below.

Kenneth 4

From this diagram above we can see that the a single line of a perception cannot be used

to solve the problem hence double line is required.

Why is it important to separate training from testing data in supervised models?(5 marks)

This is to avoid overlifting. This is a problem where a model is finely tuned to data it has been

given but fail to give accurate prediction on the data which it has never been trained on.

3. What is a Confusion Matrix.

A confusion matrix is a table that is used to describe the performance of a classification

model (or "classifier") on a set of test data for which the true values are known.

What is Type 1 Error and Type 2 Error in a confusion matrix. Using an example

explain the each of the values in a confusion matrix.(5 marks).

Type I Error (False Positive Error)-is asserting something as true when in reality it is

false. Example, predicting that someone is pregnant she is not. The actual outcome

became false when the prediction was positive.

Type II Error ( False negative): This is when a result of testing shows that a condition

failed while its actual was successfully. This error is committed when we fail to believe a

true condition. Consider the pregnancy example, When we predict that some one isn't

pregnant when in actual sense she is pregnant.

Kenneth 5

4. What is an ROC curve.

Is a graphical representation of the ability of binary classifier, in diagnosis, while its

discrimination threshold varied. Originally, ROC was used in the theory of signal

detection, however its application has extended to medicine, natural hazards, radiology

and machine learning.

Give five advantages of using ROC curve in healthcare data.(5 marks)

 Several diagnostic tasks on a particular subjects could be compared at the same

time in a ROC space.

 By visualizing the curve, sensitivity at a specific FPF can be obtained easily.

 ROC curve analysis can be used to determine optimal cut-off value.

 In contrast to single measures of sensitivity and specificity, the diagnostic

accuracy, such as AUC driven from this analysis is not affected by decision

criterion and it is also independent of prevalence of disease since it is based on

sensitivity and specificity.

5. What is overfitting? What is Underfitting? What happens to an over fitted model

when it is deployed in the real world application.(5 marks)

Overfitting is a situation where a model cannot generalize or fit well on datesets which

are unseen. A sign of overfitting is when the error on the testing dataset is greater than the

error on the training dataset.

Kenneth 6

Underfitting refers to a model that can neither model the training dataset nor generalize

to new dataset.. A Machine learning with under-fitting is not appropriate because it has

poor performance on the training datasets.

6. What is an outliers?

If a data point if considerably far from other data points then its an outlier. Take an

example of a classroom where every student is of an average height except three

extremely tall ones, the three data points, in this case, become outliers.

What do you do when you have skewed data and outliers?

Below are some of the approaches to undertake.

Setup a filter in the testing tool to eliminate skewed and outliers.

Change the values of outliers or skewed data to according to your data.

Remove or change value of outliers during post testing analysis.

Why is it important to normalize data before feeding to a supervised NN.

Normalization of data ensures that, anomalies that may complicate the data analysis are

eliminated. These anomalies may come as a result of deleting data, updating existing

information, and creating more new information.

7. What is deep learning?

Kenneth 7

According to machinelearningmastery.com, deep learning is a subfield of machine

learning concerned with algorithms inspired by the structure and function of the brain

called artificial neural networks.

Why has deep learning gained popularity in the recent years?

Deep learning is superior in giving accurate results on very huge datasets, when trained

Give one example where you would apply deep learning to solve the problem and

why.

Automated Driving: as an automotive researcher,i would use deep learning to detect stop

signs, traffic light and pedestrians on the road.

I would use deep learning because it is superior in predicting outcome of huge

datasets.The system will possibly train itself to give accurate prediction as automation on

road is very crucial.

8. What is convolutional neural network?

Is a kind of algorithm of deep learning that recognizes images by taking different image

inputs, assign weights and bias in various aspects of those images.

How is it different from deep learning and neural network?

How CNN differ from NN- convolutional-neural-network is a subclass of neural-

networks which have at least one convolution layer while CNN, has one or more layers

of convolution units. A convolution unit receives its input from multiple units from the

previous layer which together create a proximity.

Kenneth 8

How CNN differ from deep learning- a convolutional neural network is a class of

deep neural network,commonly applied to analyze visual imagery while deep

learning have several other classes.

Type I Error (False Positive Error)-is asserting something as true when in reality it is false. Example, predicting that someone is pregnant she is not. The actual outcome became false when the prediction was positive.