AKP's Newsletter
Posts
Standard Deviation Rescaling

Standard Deviation Rescaling

Abhishek Kumar Pandey
June 09, 2024

Standard Deviation Rescaling

Easy:

Standard deviation rescaling is a technique used in deep learning to make sure that the numbers in a dataset are all on the same scale. This is important because different numbers can have different meanings, and if they are not on the same scale, it can make it difficult for a model to learn from the data.

Imagine you are trying to teach a child to recognize different animals. You show them pictures of a cat, a dog, and a bird, and you ask them to identify which one is which. If the pictures are all different sizes, it might be hard for the child to recognize the animals because they are not all on the same scale. But if you make sure that all the pictures are the same size, it becomes much easier for the child to recognize the animals.

In deep learning, we do something similar. We take the numbers in our dataset and make sure they are all on the same scale by subtracting the average value and dividing by how spread out the numbers are. This is called standard deviation rescaling.

For example, let’s say we have a dataset of temperatures in a city. The temperatures might be 20, 25, 30, 35, and 40 degrees Celsius. If we want to make sure that all these temperatures are on the same scale, we can subtract the average temperature (which is 30) and divide by how spread out the temperatures are (which is 10). This would give us the following numbers:

-10, -5, 0, 5, 10

Now all the temperatures are on the same scale, and it becomes easier for a model to learn from the data.

So, standard deviation rescaling is a way to make sure that all the numbers in a dataset are on the same scale, which helps deep learning models learn more effectively from the data.

A Cat

Moderate:

Imagine you and your friends are playing a game where you measure how far each of you can throw a ball. You write down the distances, but some of your friends are much stronger and throw really far, while others throw shorter distances.

To make the game fairer, you decide to adjust the scores so that everyone’s throws can be compared more easily. This is where the idea of “Standard Deviation Rescaling” comes in.

First, let’s understand two simple things:

Average (Mean): This is like finding out how far, on average, everyone throws the ball.
Spread (Standard Deviation): This tells us how spread out everyone’s throws are from the average. If everyone throws the ball roughly the same distance, the spread is small. If some throw very far and others very short, the spread is large.

Now, to rescale the scores using Standard Deviation Rescaling, you do the following:

Subtract the Average: For each throw, you subtract the average distance from it. This helps to see how each throw compares to the middle point.
Divide by the Spread: Then, you divide the result by the spread. This makes sure that whether someone threw the ball much farther or shorter, it’s adjusted to a common scale.

In simple terms, you’re taking each throw, seeing how it compares to everyone’s throws (average), and then adjusting it based on how spread out all the throws are (standard deviation).

This way, you can compare throws more fairly, no matter how strong or weak someone is. It’s like leveling the playing field so that everyone’s effort can be fairly compared.

Hard:

Standard deviation rescaling, also known as normalization or standardization, is a technique used in deep learning to preprocess input data before feeding it into a neural network. The main purpose of this technique is to ensure that the features (input variables) are on a similar scale, which can improve the training process and convergence of the neural network.

In deep learning models, the input data often consists of features with different scales or units. For example, in a dataset containing information about people, one feature might be age (ranging from 0 to 100), while another feature could be income (ranging from thousands to millions). Without rescaling, the feature with a larger scale (e.g., income) would have a much larger influence on the model’s output than the feature with a smaller scale (e.g., age), which can lead to suboptimal performance.

Standard deviation rescaling is performed by subtracting the mean (average) value of a feature from each data point and then dividing the result by the standard deviation of that feature. The standard deviation is a measure of how spread out the data is from the mean value. This process is applied to each feature independently.

Mathematically, the standard deviation rescaling of a feature x can be expressed as:

z = (x — μ) / σ

where:

- z is the rescaled feature value

- x is the original feature value

- μ (mu) is the mean (average) of the feature

- σ (sigma) is the standard deviation of the feature

After standard deviation rescaling, each feature will have a mean of 0 and a standard deviation of 1. This rescaling ensures that all features are on a similar scale, typically ranging from -3 to 3 (assuming a normal distribution). This can help the neural network learn patterns more effectively and converge faster during training.

Standard deviation rescaling is often used in combination with other preprocessing techniques, such as feature scaling or normalization, to further improve the performance of deep learning models. It is an important step in data preparation and can significantly impact the accuracy and convergence of the model.

If you want you can support me: https://buymeacoffee.com/abhi83540

If you want such articles in your email inbox you can subscribe to my newsletter: https://abhishekkumarpandey.substack.com/

A few books on deep learning that I am reading:

Book 1

Book 2

Book 3