{"id":1587,"date":"2019-03-20T05:18:13","date_gmt":"2019-03-20T05:18:13","guid":{"rendered":"http:\/\/kusuaks7\/?p=1192"},"modified":"2023-06-29T09:49:33","modified_gmt":"2023-06-29T09:49:33","slug":"data-types-in-statistics","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/ai-ml\/data-types-in-statistics\/","title":{"rendered":"Data Types in Statistics"},"content":{"rendered":"<p><big><strong>Data Types are an important concept of statistics, which needs to be understood, to correctly apply statistical measurements to your data and therefore to correctly conclude certain assumptions about it.<\/strong><\/big><\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-07-um-09-50-442.png?w=1400\" style=\"width: 700px; height: 330px;\" \/><\/p>\n<p>This blog post will introduce you to the different data types you need to know, to do proper exploratory data analysis (EDA) on your dataset, which is one of the most underestimated parts of a machine learning project.<\/p>\n<p><b>Table of Contents:<\/b><\/p>\n<ul>\n<li>Introduction to Data Types<\/li>\n<li>Categorical Data\n<ul>\n<li>Nominal<\/li>\n<li>Ordinal<\/li>\n<\/ul>\n<\/li>\n<li>Numerical Data\n<ul>\n<li>Discrete<\/li>\n<li>Continuous&nbsp;\n<ul>\n<li>Interval<\/li>\n<li>Ratio<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<li>Why Data Types are important?<\/li>\n<li>Statistical methods<\/li>\n<li>Summary<\/li>\n<\/ul>\n<h2>&nbsp;<\/h2>\n<h2><b>Introduction to Data Types<\/b><\/h2>\n<p>Having a good understanding of the different data types, also called measurement scales, is a crucial prerequisite for doing Exploratory Data Analysis (EDA), since you can use certain statistical measurements only for specific data types.<\/p>\n<p>You also need to know which data type you are dealing with to choose the right visualization method. Think of data types as a way to categorize different types of variables. We will discuss the main types of variables and look at an example for each. We will sometimes refer to them as measurement scales.<\/p>\n<h2><b>Categorical Data<\/b><\/h2>\n<p>Categorical data represents characteristics. Therefore it can represent things like a person&rsquo;s gender, language etc. Categorical data can also take on numerical values (Example: 1 for female and 0 for male). Note that those numbers don&rsquo;t have mathematical meaning.&nbsp;<\/p>\n<h3><b>Nominal Data<\/b><\/h3>\n<p>Nominal values represent discrete units and are used to label variables, that have no quantitative value. Just think of them as &bdquo;labels&ldquo;. Note that nominal data that has no order. Therefore if you would change the order of its values, the meaning would not change.&nbsp;You can see two examples of nominal features below:&nbsp;<\/p>\n<p style=\"text-align: center;\"><img fetchpriority=\"high\" decoding=\"async\" alt=\"Bildschirmfoto 2018-03-06 um 08.36.08.png\" data-attachment-id=\"450\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-06 um 08.36.08\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=458&amp;h=207?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=458&amp;h=207?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=458&amp;h=207\" data-orig-size=\"2976,1342\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-06-um-08-36-08\/\" height=\"207\" sizes=\"(max-width: 458px) 100vw, 458px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=458&amp;h=207\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=458&amp;h=207 458w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=916&amp;h=414 916w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=150&amp;h=68 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=300&amp;h=135 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-36-08.png?w=768&amp;h=346 768w\" width=\"458\" \/><\/p>\n<p>The left feature that describes a persons gender would be called &bdquo;dichotomous&ldquo;, which is a type of nominal scales that contains only two categories.<\/p>\n<h3><b>Ordinal Data<\/b><\/h3>\n<p>Ordinal values represent discrete&nbsp;and ordered units. It is therefore nearly the same as nominal data, except that it&rsquo;s ordering matters.&nbsp;You can see an example below:<\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" alt=\"Bildschirmfoto 2018-03-06 um 08.42.01\" data-attachment-id=\"451\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-06 um 08.42.01\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=411&amp;h=225?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=411&amp;h=225?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=411&amp;h=225\" data-orig-size=\"1238,678\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-06-um-08-42-01\/\" height=\"225\" sizes=\"(max-width: 411px) 100vw, 411px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=411&amp;h=225\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=411&amp;h=225 411w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=822&amp;h=450 822w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=150&amp;h=82 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=300&amp;h=164 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-08-42-01.png?w=768&amp;h=421 768w\" width=\"411\" \/><\/p>\n<p>Note that the difference between Elementary and High School is different than the difference between High School and College. This is the main limitation of ordinal data, the differences between the values is not really known. Because of that, ordinal scales are usually used to measure non-numeric features like happiness, customer satisfaction and so on.<\/p>\n<h2>Numerical Data<\/h2>\n<h3><b>1. Discrete Data<\/b><\/h3>\n<p>We speak of discrete data if its values are distinct and separate. In other words: We speak of discrete data if the data can only take on certain values. This type of data can&rsquo;t be measured but it can be counted. It basically represents information that can be categorized into a classification. An example is the number of heads in 100 coin flips.<\/p>\n<p>You can check by asking the following two questions whether you are dealing with discrete data or not:&nbsp;&nbsp;Can you count it and can it be divided up into smaller and smaller parts?&nbsp;On the contrary, if the data could be measured but not counted, we would speak of continuous data<\/p>\n<h3><b>2. Continuous Data<\/b><\/h3>\n<p>Continuous Data represents measurements and therefore their values can&rsquo;t be counted but they can be measured. An example would be the height of a person. You can only describe them by using intervals on the real number line.&nbsp;<\/p>\n<p><b>Interval Data<\/b><\/p>\n<p>Interval values represent ordered units that have the same difference. Therefore we speak of interval data when we have a variable that contains numeric values that are ordered and where we know the exact differences between the values.&nbsp;A good example would be a feature that contains temperature of a given place like you can see below:<\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" alt=\"Bildschirmfoto 2018-03-06 um 09.02.29.png\" data-attachment-id=\"452\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-06 um 09.02.29\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=572&amp;h=266?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=572&amp;h=266?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=572&amp;h=266\" data-orig-size=\"1984,922\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-06-um-09-02-29\/\" height=\"266\" sizes=\"(max-width: 572px) 100vw, 572px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=572&amp;h=266\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=572&amp;h=266 572w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=1144&amp;h=532 1144w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=150&amp;h=70 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=300&amp;h=139 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=768&amp;h=357 768w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-02-29.png?w=1024&amp;h=476 1024w\" width=\"572\" \/><\/p>\n<p>The problem with interval values data is that they don&rsquo;t have a &bdquo;true zero&ldquo;. That means in regards to our example, that there is no such thing as no temperature. With interval data, we can add and subtract, but we cannot multiply, divide or calculate ratios. Because there is no true zero, a lot of&nbsp;<a href=\"http:\/\/www.mymarketresearchmethods.com\/descriptive-inferential-statistics-difference\/\" rel=\"noopener\">descriptive and inferential statistics<\/a>&nbsp;can&rsquo;t be applied.<\/p>\n<p><b>Ratio Data<\/b><\/p>\n<p>Ratio values are ordered units with intermediate values. Ratio values are the same as interval values, with the difference that they do have an absolute zero. Good examples are height, weight, length etc.<\/p>\n<p style=\"text-align: center;\"><img loading=\"lazy\" decoding=\"async\" alt=\"Bildschirmfoto 2018-03-06 um 09.09.10\" data-attachment-id=\"453\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-06 um 09.09.10\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=552&amp;h=203?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=552&amp;h=203?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=552&amp;h=203\" data-orig-size=\"1822,670\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-06-um-09-09-10\/\" height=\"203\" sizes=\"(max-width: 552px) 100vw, 552px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=552&amp;h=203\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=552&amp;h=203 552w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=1104&amp;h=406 1104w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=150&amp;h=55 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=300&amp;h=110 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=768&amp;h=282 768w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-09-10.png?w=1024&amp;h=377 1024w\" width=\"552\" \/><\/p>\n<p>&nbsp;<\/p>\n<h2><b>Why Data Types are important?<\/b><\/h2>\n<p>Datatypes are an important concept because statistical methods can only be used with certain data types. You have to analyze continuous data differently than categorical data&nbsp;otherwise it would result in a wrong analysis. Therefore knowing the types of data you are dealing with, enables you to choose the correct method of analysis.<\/p>\n<p>We will now go over every data type again but this time in regards to what statistical methods can be applied. To understand properly what we will now discuss, you have to understand the basics of descriptive statistics. If you don&rsquo;t know them, you can read my blog post (9min read) about it:&nbsp;<a href=\"https:\/\/towardsdatascience.com\/intro-to-descriptive-statistics-252e9c464ac9\" rel=\"noopener\">https:\/\/towardsdatascience.com\/intro-to-descriptive-statistics-252e9c464ac9<\/a>. Other tools that aren&rsquo;t discussed in this blog post, will be explained here.<\/p>\n<p>&nbsp;<\/p>\n<h2><b>Statistical methods<\/b><\/h2>\n<h3><b>Nominal Data<\/b><\/h3>\n<p>When you are dealing with nominal data, you collect information through:<\/p>\n<p>Frequencies:&nbsp;The Frequency is the rate at which something occurs over a period of time or within a dataset.<\/p>\n<p>Proportion:&nbsp;You can easily calculate the proportion by dividing the frequency by the total number of events. (e.g how often something happened divided by how often it could happen)<\/p>\n<p>Percentage:&nbsp;I think this one doesn&rsquo;t need an explanation.<\/p>\n<p>Visualization Methods:&nbsp;To visualize nominal data you can use a pie chart or a bar chart.<\/p>\n<p style=\"text-align: center;\"><img loading=\"lazy\" decoding=\"async\" alt=\"Bildschirmfoto 2018-03-06 um 09.28.28.png\" data-attachment-id=\"454\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-06 um 09.28.28\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=543&amp;h=277?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=543&amp;h=277?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=543&amp;h=277\" data-orig-size=\"3322,1694\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-06-um-09-28-28\/\" height=\"277\" sizes=\"(max-width: 543px) 100vw, 543px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=543&amp;h=277\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=543&amp;h=277 543w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=1086&amp;h=554 1086w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=150&amp;h=76 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=300&amp;h=153 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=768&amp;h=392 768w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-06-um-09-28-28.png?w=1024&amp;h=522 1024w\" width=\"543\" \/><\/p>\n<h3><b>Ordinal Data<\/b><\/h3>\n<p>When you are dealing with ordinal data, you can use the same methods like with nominal data, but you also have access to some additional tools.&nbsp;Therefore you can summarize your ordinal data with frequencies, proportions, percentages. And you can visualize it&nbsp;with pie and bar charts.&nbsp;Additionally, you can use percentiles, median, mode and the interquartile range to summarize your data.<\/p>\n<h3><b>Continuous Data<\/b><\/h3>\n<p>When you are dealing with continuous data, you can use the most methods to describe your data.&nbsp;You can summarize your data using percentiles, median, interquartile range, mean, mode, median, standard deviation, and range.<\/p>\n<p>Visualization Methods:<\/p>\n<p>To visualize continuous data, you can use a histogram or a box-plot. With a histogram, you can check the central tendency, variability, modality, and kurtosis of a distribution. Note that a histogram can&rsquo;t show you if you have any outliers. This is why we also use box-plots.<\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" alt=\"Bildschirmfoto 2018-03-19 um 08.01.07.png\" data-attachment-id=\"508\" data-comments-opened=\"1\" data-image-description=\"\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bildschirmfoto 2018-03-19 um 08.01.07\" data-large-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=736?w=736\" data-medium-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=736?w=300\" data-orig-file=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=736\" data-orig-size=\"2862,1286\" data-permalink=\"https:\/\/machinelearning-blog.com\/2018\/03\/07\/data-types-in-statistics\/bildschirmfoto-2018-03-19-um-08-01-07\/\" sizes=\"(max-width: 736px) 100vw, 736px\" src=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=736\" srcset=\"https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=736 736w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=1472 1472w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=150 150w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=300 300w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=768 768w, https:\/\/machinelearningblogcom.files.wordpress.com\/2018\/03\/bildschirmfoto-2018-03-19-um-08-01-07.png?w=1024 1024w\" \/><\/p>\n<p>&nbsp;<\/p>\n<h2><b>Summary<\/b><\/h2>\n<p>In this post, you discovered the different data types that are used throughout statistics. You learned the difference between discrete &amp; continuous data and learned what nominal, ordinal interval and ratio measurement scales are. Furthermore, you now know what statistical measurements you can use at which datatype and which are the right visualization methods. This enables you to create a big part of an exploratory analysis on a given dataset<\/p>\n<p>&nbsp;<\/p>\n<h2><b>Resources<\/b><\/h2>\n<ul>\n<li><a href=\"https:\/\/en.wikipedia.org\/wiki\/Statistical_data_type\" rel=\"noopener\">https:\/\/en.wikipedia.org\/wiki\/Statistical_data_type<\/a><\/li>\n<li><a href=\"https:\/\/www.youtube.com\/watch?v=hZxnzfnt5v8\" rel=\"noopener\">https:\/\/www.youtube.com\/watch?v=hZxnzfnt5v8<\/a><\/li>\n<li><a href=\"http:\/\/www.dummies.com\/education\/math\/statistics\/types-of-statistical-data-numerical-categorical-and-ordinal\/\" rel=\"noopener\">http:\/\/www.dummies.com\/education\/math\/statistics\/types-of-statistical-data-numerical-categorical-and-ordinal\/<\/a><\/li>\n<li><a href=\"https:\/\/www.isixsigma.com\/dictionary\/discrete-data\/\" rel=\"nofollow noopener\">https:\/\/www.isixsigma.com\/dictionary\/discrete-data\/<\/a><\/li>\n<li><a href=\"https:\/\/www.youtube.com\/watch?v=zHcQPKP6NpM&amp;t=247s\" rel=\"noopener\">https:\/\/www.youtube.com\/watch?v=zHcQPKP6NpM&amp;t=247s<\/a><\/li>\n<li><a href=\"http:\/\/www.mymarketresearchmethods.com\/types-of-data-nominal-ordinal-interval-ratio\/\" rel=\"noopener\">http:\/\/www.mymarketresearchmethods.com\/types-of-data-nominal-ordinal-interval-ratio\/<\/a><\/li>\n<li><a href=\"https:\/\/study.com\/academy\/lesson\/what-is-discrete-data-in-math-definition-examples.html\" rel=\"nofollow noopener\">https:\/\/study.com\/academy\/lesson\/what-is-discrete-data-in-math-definition-examples.html<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Data Types are an important concept of statistics, which needs to be understood, to correctly apply statistical measurements to your data and therefore to correctly conclude certain assumptions about it. In this post, discover the different data types that are used throughout statistics. Learn the difference between discrete &amp; continuous data and learn what nominal, ordinal interval and ratio measurement scales are. &nbsp;Know what statistical measurements you can use at which datatype and which are the right visualization methods. This enables you to create a big part of an exploratory analysis on a given dataset<\/p>\n","protected":false},"author":413,"featured_media":4200,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[183],"tags":[92],"ppma_author":[2327],"class_list":["post-1587","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml","tag-machine-learning"],"authors":[{"term_id":2327,"user_id":413,"is_guest":0,"slug":"niklas-donges","display_name":"Niklas Donges","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g","user_url":"","last_name":"Donges","first_name":"Niklas","job_title":"","description":"<a href=\"https:\/\/www.linkedin.com\/in\/niklas-donges\/\">Niklas Donges<\/a>&nbsp;is Machine Learning Engineer at SAP. He is a Technical Blogger for the &#039;Towards Data Science&#039; publication"}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1587","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/413"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=1587"}],"version-history":[{"count":2,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1587\/revisions"}],"predecessor-version":[{"id":28949,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/1587\/revisions\/28949"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/4200"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=1587"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=1587"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=1587"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=1587"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}