Instructions on How to Use The Statistics Spreadsheet

This app is designed to calculate all of the statistics that are encountered in a standard elementary statistics class.  It includes:

Starting Out

The first time you use this, you will need to go to the link:
     https://docs.google.com/spreadsheets/d/1zacJap2OThc0D9s_fAGon-sHtaqPbHczqGrPRp9ZDLc/edit?usp=sharing
Then you will click on File -> Make a Copy.  Give it a name and hit OK.  The first time you try to do anything with it you will have to click to allow the program to run.

Please email drLarryGreen@gmail.com if you find any errors that need to be fixed of have any suggestions for additions and improvements.

One Variable Statistics

This is the DescStats tab.  When you first open the spreadsheet tab, you will see sample data in column "A".  Highlight this sample data and hit the backspace to clear it out.  You enter your data in column "A" and the spreadsheet will calculate the mean, sample standard deviation, minimum, first quartile (Q1), median, third quartile (Q3), and the maximum.  Below these will be several other lesser used statistics.  You will also see a histogram of the data. 

If you go to the menu item Stats -> BoxPlot, the corresponding box plot will be generated.  The default for the box plot is to include the outliers, but you can change that by changing the "Yes" in the D11 cell to "No".  Then the box plot will be created that first removes the outliers.  Outliers are defined by being at least 1.5 times the IQR above Q3 or below Q1. 

If you go to the menu item Stats -> Stem and Leaf, a stem and leaf diagram will be created.  The default is for the stems to represent the tens (10^1) digit and the leaves to represent the ones digits.  You can change this by changing the D10 cell.  For example if you want the stems to represent the ones digits and the leaves to represent the tenths digit, change the D10 cell to "0".

Video on How to Use the Program to Calculate One Variable Statistics

Confidence Intervals and Hypothesis Tests for a Difference

This is the 2VarStats tab.  When you first open the tab, sample data will be in the "First" and "Second" columns.  Highlight this data and hit the backspace to clear it out.  Then enter the data for the first variable under the "First" column starting on the cell A2 and the data for the second variable under the "Second" column starting on the cell B2.  The computer will generate statistics, confidence intervals, and hypothesis tests for each individual data set, for the difference between the two variables treating them as independent data sets, and for the difference between the variables treating them as dependent (paired) samples.  Also, if you put the values into the "First" column and the frequencies or probabilities into the "Second" column, then the far right "Stats as Frequency Table or Expected Value" will calculate the statistics treating the columns as either a frequency table or a probability distribution table.

Video on How to Use the Program for Differences

Regression Analysis

This is the Regression tab.  When you first open the tab, sample data will be in the "x" and "y" columns.  Highlight this data and hit the backspace to clear it out.  Then enter the data for the first variable under the "x" column starting on the cell A2 and the data for the second variable under the "y" column starting on the cell B2.  The computer will display the full regression analysis including the correlation, r^2, s, the equation of the regression line, the p-Value for a left, right, and two tailed hypothesis test, a prediction for y given x (type in a value for x to find the predicted value for y), and the scatterplot with the regression line displayed on it.

Video on How to Use the Program to Perform Regression Analysis

Chi-Squared Tests

This is the Chi Square tab.  You can select "Goodness of Fit", "Homogeneity", or "Independence" by changing the item on the B1 cell.  Then clear out the data that is in columns E, F, and G.  They put in your own data.  If you are conducting a Goodness of Fit test, put in the observed values in column E and the expected values in column F.  For Homogeneity and Independence, you can use up to 22 columns.  The computation is taxing on the computer so be patient and wait a few seconds while the computer give the message "loading" and you will see the test statistic, Chi Square, and the p-Value.

Video on How to Use the Program to Perform a Goodness of Fit Test

Video on How to Use the Program to Perform a Test for Homogeneity

Video on How to Use the Program to Perform a Test for Independence

ANOVA

This is the ANOVA tab.  First clear out the data in columns C, D, E, and F and then enter your data in columns C, D, E, F, ....  You can use up to 25 columns.  The computer will find the test statistic, F, and the p-Value.  It will also compute other statistics that you most likely be interested in.

Video on How to Use the Program to Perform an ANOVA Test

Confidence Intervals and Hypothesis Tests from Statistics

Use the "Dataless" tab if you want have statistics such as the mean and standard deviation and want to compute a confidence interval or conduct a hypothesis test.  Also use this tab for confidence intervals and hypothesis tests for a proportion or a difference between proportions.  Choose the type you want and enter the information.  You will need to first clear out the sample information that is there.

Video on How to Use the Program to Perform Hypothesis Tests and Find Confidence Intervals When Statistics are Provided

Probability Computations

Use the "Compute" tab to compute probabilities from the binomial, normal, t, Chi-Square, or F distribution.

Video on How to Use the Program to Calculate Probabilities

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License