Introduction to the Descriptive Statistics

Introduction to the Descriptive Statistics

Measures of Central Tendency

When there are extreme values in a dataset, the mean does not represent the total dataset very well.

Trimming the extreme values is a common technique in statistics and also in data science.

So, the median is not sensitive to extreme values.

If all the data appears only once in the data, there is no mode.

Measures of Variation

Image for post

Image for post

The more the values vary in the dataset, the larger the standard deviation.

Image for post

Source: Wikipedia

Five Number Summary

Image for post

IQR can be very useful in determining extreme values or outliers, we talked about in calculating mean and median.

Shape of Data

Image for post

For symmetrically shaped data, the mean and median are the same

Image for post

In left-skewed shape, the tail lies on the left side.

Image for post

In right-skewed shape, the tail lies on the right side

Conclusion

 

#statistics #datascience #dataanalysis 

Leave a Reply

Close Menu