Calculate the following:
• mean
• standard deviation
• skew
• 5-number summary
• interquartile range (IQR) for each of the variables
Create a box-plot for the AnnualSales variable and answer the following questions:
• Does it look symmetric?
• Would you prefer the IQR instead of the standard deviation to describe this variable’s dispersion? Why or why not?
Create a histogram for the Sales/SqFt variable and answer the following questions:
• Is the distribution symmetric? If not, what is the skew?
• Are there any outliers? If so, which one(s)?
• What is the SqFt area of the outlier(s)? Is the outlier(s) smaller or larger than the average restaurant in the database? What can you conclude from this
observation?
• What measure of central tendency is more appropriate to describe Sales/SqFt? Why?