Binomial test and 95% confidence interval (CI) using SPSS Statistics

Introduction

The binomial test, also known as the one-sample proportion test or test of one proportion, can be used to determine whether the proportion of cases (e.g., "patients", "potential customers", "houses", "coins") in one of only two possible categories (e.g., patients at "high" or "low" risk of heart disease, potential customers who "likely" or "not likely" to purchase, houses with "subsidence" or "no evidence of subsidence", the "heads" or "tails" showing after a coin is thrown) is equal to a pre-specified proportion (e.g., a proportion of 0.17 of patients having a low risk of heart disease).

This pre-specified proportion can be either: (a) a hypothesised value (e.g., 0.5), selected for theoretical reasons, for example (e.g., there is theoretically an "equal chance" of either category being selected, such as a "heads" or "tails" when a coin is tossed); or (b) a known value, based on current knowledge, for example (e.g., 10% of patients, which is 0.1 as a proportion, were previously diagnosed as being at "high" risk of heart disease). In addition to the binomial test, a corresponding 95% confidence interval (CI) can be calculated, such as the exact Clopper-Pearson 95% CI.

In the two tabs below, we include one example to demonstrate when the pre-specified proportion is a hypothesised value and another example to demonstrate when the pre-specified proportion is a known value.

A HYPOTHESISED VALUE (e.g., selected for "theoretical reasons")
A KNOWN VALUE (e.g., selected based on "current knowledge")

A HYPOTHESISED VALUE
(e.g., selected for "theoretical reasons")

You can use a binomial test and corresponding 95% confidence interval (CI) to determine whether there is a preference for one of two options/categories, based on a hypothesised value.

For example, a restaurant is launching a new menu, which will include adding a "bread and butter pudding" to the dessert menu. The chef creates two versions – one based on a traditional recipe and another on a more modern interpretation – with the chef preferring to serve the more modern interpretation. However, the restaurant would like to know which recipe its diners (i.e., customers) would like to see added to the menu: "traditional" or "modern".

In order to decide, the restaurant randomly selects 30 diners who try the bread and butter pudding that uses the "traditional" recipe and another bread and butter pudding that uses the "modern" recipe. The 30 diners, who form the sample for this study, are asked to indicate their preference (i.e., "modern" or "traditional").

The restaurant will calculate the proportion of diners who prefer the "modern" recipe. A preference for the modern recipe would mean that more than half of the diners would prefer it. In other words, the proportion preferring the modern recipe would be greater than 0.5 (or more than 50%, when expressed as a percentage). This is the hypothesised value, with the restaurant not knowing which recipe diners will prefer.

The restaurant only has access to one sample of 30 diners, so it uses the binomial test to make an inference from the sample of 30 diners to all diners who would eat at their restaurant (i.e., where "all diners" represents the population that the restaurant is interested in).

After collating the preferences from its 30 diners, it is shown that 78% of diners prefer the "modern" recipe. However, since this is an estimate and there is uncertainty when making inferences, the restaurant also calculates a corresponding 95% confidence interval (CI). This suggests that the percentage of all diners in the population who would prefer the "modern" recipe could plausibly be as low as 63% and as high as 85%.

On the evidence presented by the binomial test and 95% CI, the restaurant decides to serve the "modern" bread and butter pudding recipe, which it believes that diners as a whole will prefer.

A KNOWN VALUE
(e.g., selected based on "current knowledge")

You can use a binomial test and corresponding 95% confidence interval (CI) to whether the proportion of one thing/category is greater than another thing/category, based on a known value.

For example, a researcher wants to determine whether a new drug to treat a specific illness is more effective than the existing drug that is being prescribed to patients. For the purpose of this research, a drug is considered to be a "success" if patients are "symptom-free" after taking the drug. The existing drug has a success rate of 72% in the population (i.e., all patients with this specific illness). Therefore, 72% is the known value against which the effectiveness of the new drug will be evaluated.

To investigate whether the new drug is more effective than the existing drug, a random sample of 80 patients are given the new drug. The symptoms of the 80 patients, who form the sample for this study, are evaluated after taking the new drug. If the patients are "symptom-free", the new drug is considered to be a "success", but if they are "not" symptom-free, the new drug is considered to be a "failure".

The researcher will calculate the proportion of patients who are "symptom-free" after taking the new drug and compare this to the effectiveness of the existing drug, where 72% of all patients are symptom-free after taking the drug (i.e., .72 of all patients as a proportion).

The researcher only has access to one sample of 80 patients, so he uses the binomial test to make an inference from the sample of 80 patients to all patients who have this specific illness (i.e., where "all patients" represents the population that the researcher is interested in).

After taking the new drug, it is found that 60 out of the 80 patients are symptom-free. In other words, 75% of the 80 patients are successfully treated by the new drug (i.e., (60 ÷ 80) x 100 = 75%). This is a greater success rate than the existing drug, which has a success rate of 72% in the population. However, since this is an estimate and there is uncertainty when making inferences, the researcher also calculates a corresponding 95% confidence interval (CI). This suggests that the percentage of all patients in the population who are symptom-free after taking the new drug could plausibly be as low as 68% and as high as 79%.

On the evidence presented by the binomial test and 95% CI, the researcher considers there to be insufficient evidence that the new drug would be more effective than the existing drug in the population of patients with this specific illness. After all, whilst it appears that the success rate of the new drug is greater than 72%, it could plausibly be as low as 68% (i.e., less successful than the existing drug).

In this introductory guide to the binomial test and corresponding 95% confidence interval (CI), we first set out the basic requirements and assumptions of the the binomial test and corresponding 95% CI, which your study design must meet. Making sure that your study design meets these assumptions is critical because if it does not, the binomial test and corresponding 95% CI is likely to be the incorrect. In the second section, we set out the example we use to illustrate how to carry out a binomial test and corresponding 95% CI using SPSS Statistics. Of the two types of example that we set out above, we demonstrate how a binomial test and corresponding 95% CI can be used to determine whether there is a preference for one of two options/categories, based on a hypothesised value. In other words, if one option/category is preferred over another option/category, it will have a proportion greater than 0.5. In the Data Setup section that follows, we show how to set up your data in the Variable View and Data View of SPSS Statistics to carry out these analyses. Next, we set out the simple 10-step procedure in SPSS Statistics to carry out a binomial test and corresponding 95% CI in the Procedure section. Since there are different types of binomial test and corresponding 95% CI that can be used, with the choice of analysis based on a range of factors (e.g., Newcombe, 2013), we demonstrate the use of the exact binomial test and corresponding exact Clopper-Pearson 95% CI (Clopper and Pearson, 1934), which can be carried out in SPSS Statistics. Next, we explain how to interpret the main results of the binomial test and corresponding 95% CI, where you will determine whether there is a preference for one of two options/categories, based on a hypothesised value. In the final section, Reporting, we explain the information you should include when reporting your results. A short Bibliography section is included at the end for further reading. Therefore, to continue with this introductory guide, go to the next section.

SPSS Statistics

Basic requirements and assumptions of the binomial test and corresponding 95% confidence interval (CI)

The first step before analysing your data using a binomial test is to check whether it is appropriate to use this statistical test. After all, the binomial test will only give you valid/accurate results if your study design meets five assumptions that underpin a binomial test. If any of these five assumptions are not met, you cannot use a binomial test, but you may be able to use another statistical test instead. Therefore, before carrying out a binomial test, you need to check that your study design meets the following five assumptions:

Assumption #1: You have a dichotomous response variable (also referred to as a binary variable). A dichotomous variable can be nominal or ordinal. Examples of dichotomous variables that are nominal include gender (male or female), ethnicity (African American or Hispanic), transport type (bus or car), and degree type (undergraduate or postgraduate). Alternatively, examples of dichotomous variables that are ordinal include exam result (pass or fail), BMI (normal or obese), exercise intensity (low or high), and history of heart disease (Yes or No).
Assumption #2: The outcome of each trial can be specified as a success or a failure. The category for which you are determining the proportion is called the “success" category whilst the other category is called the “failure" category. Sometimes the idea of a “success" category and a “failure" category will make sense for the dichotomous variable in your study (i.e., the success category, "patient got better" versus the failure category, "patient got worse"), but at other times it will not (e.g., "chocolate ice cream" versus "strawberry ice cream"). The point is that you have to pick a category to act as the "success" category in order to calculate a proportion, even if neither category readily stands out as a “success".
Assumption #3: The probability of a success, denoted by p, remains constant from trial to trial. For example, a respondent can choose between two options in a series of trials, but they cannot have three options in some trials and only two options in other trials.
Assumption #4: The n trials are independent. This means that one trial cannot affect the result of another trial. For example, getting feedback from each trial might influence the following trials.
Assumption #5: The sample you have collected is representative of the population (e.g., you have used simple random sampling).

When you are confident that your data has met all five assumptions described above, you can analyse your data using a binomial test. In the sections that follow we show you how to do this using SPSS Statistics, based on the example we set out in the next section: Example used in this guide.

Book	Agresti, A. (2019). An introduction to categorical data analysis (3rd ed.). Hoboken, NJ: Wiley.
Journal Article	Clopper, C., & Pearson, E. S. (1934). The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika, 26(4), 404-413. https://doi.org/10.1093/biomet/26.4.404
Journal Article	Cummings, G., & Finch, S. (2005). Inference by eye: confidence intervals and how to read pictures of data. American Psychologist, 60(2), 170-180. https://doi.org/10.1037/0003-066X.60.2.170
Book	Newcombe, R. G. (2013). Confidence intervals for proportions and related measures of effect size. Boca Raton, FL: CRC Press.
Journal Article	Norman, G. R., & Steiner, D. L. (2012). Do CIs give you confidence? Chest, 141(1), 17-19. https://doi.org/10.1378/chest.11-2193

Binomial test and 95% confidence interval (CI) using SPSS Statistics

Introduction

SPSS Statistics

Basic requirements and assumptions of the binomial test and corresponding 95% confidence interval (CI)

SPSS Statistics

Example used in this guide

SPSS Statistics

Data setup in SPSS Statistics when carrying out a binomial test and corresponding 95% confidence interval (CI)

The "Variable View" in SPSS Statistics

The Data View in SPSS Statistics

SPSS Statistics

SPSS Statistics procedure to carry out a binomial test and corresponding 95% confidence interval (CI)

SPSS Statistics

Interpreting the results of a binomial test analysis and corresponding 95% confidence interval (CI)

The "Hypothesis Test Summary" table

The "One-Sample Binomial Test" table

The "Confidence Interval Summary" table

SPSS Statistics

Reporting the results from a binomial test analysis and corresponding 95% confidence interval (CI)

SPSS Statistics

Short Bibliography

SPSS Statistics

Reference this article