Improving the performance of the model by standardization

Reem Published at Dev

Reem

I am working on a machine learning project and using 3 classification methods, namely:

Support Vector Machine (SVM)
K-nearest neighbor (KNN)
Multilayer perceptron (MLP)

and while modeling I needed to use a feature scaling technique called StandardScaler to improve the performance of the models.

I am getting the following results:

Are they appropriate? and can the model performance be worse after implementing the standardization? as in the case with the SVM?

Ferch423

Your results are sound and make sense. To understand the results a little more in depth it is worth taking a quick overview over how those algorithms work:

1- MLP is a linear function of the inputs which is later passed through a non-linear activation function. This means that if some of your features have a different numerical scale than the others they will weigh more in the activation function. The standardization in this case helps the mlp network learn better from the features, as they are weighed equally.

2- The KNN algorithm is a non-parametric algorithm and the classification depends on how the current data point is similar to other already labeled data points in the feature space. The similarity function is often computed as a distance function in the feature space (euclidean distance for instance). This implies that Standardization can reduce the distances between data in the feature space, and improve overall performance.

3- The SVM algorithm tries to find a hyper-plane that best separates the classes. In this case the Standardization can cause the data points to be more closely packed so that the best fitting line misclassifies more data points.

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2021-08-8

Comments

0 comments

TOP Ranking

Article

Improving the performance of the model by standardization

Improving the performance of the model by standardization

pump.io port in URL

grouping by column variables and appending a new variable based on condition

Failed to listen on localhost:8000 (reason: Cannot assign requested address)

Can't pre-populate phone number and message body in SMS link on iPhones when SMS app is not running in the background

Group boxplot data while keeping their individual X axis labels in ggplot2 in R

Vector input in shiny R and then use it

BigQuery - concatenate ignoring NULL

Can a 32-bit antivirus program protect you from 64-bit threats

How to remove the extra space from right in a webview?

How to how increase/decrease compared to adjacent cell

android.content.Context.getSharedPreferences(java.lang.String, int)' on a null object reference id DBhandler

Getting 502 Bad Gateway Error While Deploying WordPress On Dockerized Lemp?

Type 'number' is not assignable to type 'NgIterable<any>' when trying to async observe a datasource

Check if a number is a perfect square

FFmpeg resize without upscaling

How do I display Label text character-by-character?

How to show an image in a View with ASP.NET MVC 5? (Many suggestions failed so far)

Json Schema - Conditional Evaluation with RegEx

PlayOnLinux displays weird looking window on 18.04 for MS Office installation

JMeter: Why get error when try to save test plan

Emulator wrong screen resolution in Android Studio 1.3