Statistical Considerations for Performing Multiple Tests in a Single Experiment. 2. Comparisons Among Several Therapies

doi:10.1016/S0025-6196(12)62363-5

Mayo Clinic Proceedings

Volume 63, Issue 8, August 1988, Pages 816-820

https://doi.org/10.1016/S0025-6196(12)62363-5 Get rights and content

Section snippets

Overall Preliminary Test.

With use of an overall preliminary test, a null hypothesis is established specifying that no difference exists among any of the groups. If 10 drugs are being studied, for example, a test statistic (referred to as F, analogous to t in the t test for comparing two therapies) is derived by dividing a measure of the variability among the 10 group means by a measure of the variability expected by chance. One obtains the corresponding P value from suitable tables (the larger the F statistic, the

EXAMPLE

We will consider a double-blind crossover study in which nine marketed analgesics and a placebo were evaluated in 57 patients with definite pain problems as a result of unresectable cancer.⁴ Although all possible pairwise comparisons were of interest, only comparisons with aspirin and placebo were reported. Thus, the total number of comparisons was 17. If no difference in efficacy was detected among any of the preparations, separate paired t tests performed at the 0.05 level would probably show

ALTERNATIVE EXAMPLE

Notice that the purpose of the study in the example determined the precise formulation of the study questions and the corresponding data analysis used to answer them. Suppose, however, that the same study had been performed for a different purpose and had addressed somewhat different questions. Specifically, suppose that the manufacturer of propoxyphene (Darvon) had performed this study and that the specific aims of the study had been to evaluate propoxyphene (1) relative to aspirin, (2)

COMMENTS

At this point, readers may ask which of the many methods of analysis that have been described is the best. Perhaps the most important point to be made is that no one method is always the “best” method.

The questions that a study is intended to answer must be clearly stated beforehand. Failure to consider this basic principle often may lead to an overreliance on per-experiment error rates. As an illustration, suppose one investigator conducts an experiment to answer the following question: “Are

ACKNOWLEDGMENT

We thank Charles G. Moertel, M.D., and his colleagues for the use of their data on pain relievers.

First page preview

Click to open first page preview

View PDF

REFERENCES (7)

TA Bancroft
CW Dunnett
A multiple comparison procedure for comparing several treatments with a control
J Am Stat Assoc
(1955)
CW Dunnett
New tables for multiple comparisons with a control
Biometrics
(1964)

There are more references available in the full text version of this article.

Cited by (37)

Quality of life in young patients with chronic myelocytic leukaemia during intensive treatment including interferon
1997, Leukemia Research
The aim of this study was to evaluate to what extent the quality of life (QOL) of young patients with chronic myelocytic leukemia (CML) was affected by treatment with interferon (IF) and intensive chemotherapy. In a main study performed by The Swedish CML Group, aiming at reduction of the malignant pH⁺ cell clone by treatment with hydroxyurea and IF followed by ABMT, QOL was evaluated with VAS scales and the Life Ingredient Profile in 44% of the patients. The intensive treatment did not lead to intolerable suffering or protracted reduction in QOL. However, 80% of the patients were on sick leave during the first year of treatment.
An introduction to the use of interim data analyses in clinical trials
1993, Annals of Emergency Medicine
During a controlled clinical trial, data accumulate that contain information on the relative efficacy of the two treatments, yet these data often are not inspected or analyzed until the planned sample size has been reached and the trial has been terminated. This type of fixed-sample-size trial design, with a single terminal data analysis, has the ethical disadvantage that more patients than are necessary to obtain a reliable result may be randomized to the less efficacious treatment, whichever one that turns out to be. Thus, it often is desirable to schedule one or more analyses of the data, to be conducted before planned termination, to see if a reliable conclusion may be drawn from the data and the trial terminated early. Such analyses are called interim analyses. When interim analyses occur after each of several relatively large groups of patients, the trial is called a group-sequential trial. Interim data analyses must be planned in advance to avoid increasing the risk of committing a Type I error and to achieve adequate power. This article introduces the statistical issues involved in the planning of interim data analyses and the design of group-sequential clinical trials.
Subepithelial collagen table thickness in colon specimens from patients with microscopic colitis and collagenous colitis
1992, Gastroenterology
Microscopic colitis and collagenous colitis are similar conditions that are differentiated by the presence or absence of subepithelial collagen table thickening. To better understand the relationship between these two disorders and the role of collagen table thickening in the pathogenesis of diarrhea, colonic mucosal biopsy specimens from 24 patients with microscopic or collagenous colitis and 9 control subjects were analyzed using a computerassisted morphometric method to evaluate the average thickness of the subepithelial collagen table. The collagen table thickness in colitis patients taken together formed a multimodal rather than a unimodal distribution. There was no tendency for collagen table thickening to increase with age or with duration of symptoms. In general, the types and distribution of inflammatory cells were similar in patients with normal and thickened collagen tables. Stool weight correlated with lamina propria cellularity but not with collagen table thickening. The multimodal distribution of collagen table thickening and the lack of correlation with age, duration of symptoms, or inflammation suggest that microscopic colitis and collagenous colitis are discrete conditions, although the inflammatory changes in the two conditions are similar. Moreover, because stool weight correlates with lamina propria cellularity but not with collagen table thickening, diarrhea probably is caused by the inflammatory changes and not by collagen table thickening per se.
Probucol reduces plasma lipid peroxides in man
1992, Atherosclerosis
Although primarily used as a lipid lowering drug, probucol also possesses anti-oxidant activity and has been shown in animal models to inhibit or delay the progression of atherosclerosis. It has been suggested that this anti-atherosclerosic effect may occur through inhibition of free radical oxidation of low density lipoprotein. The aim of this study was to investigate the effects of probucol on free radical activity in hyperlipidaemic patients. Plasma lipid peroxides were measured before probucol treatment, at 4 and 12 weeks treatment and then 4 weeks after stopping probucol. Lipid peroxide concentrations were significantly reduced during and 4 weeks after stopping treatment with probucol, when compared with baseline values. There were no changes in plasma vitamin E concentrations. The results of this study indicate that probucol reduces lipid peroxidation in patients, an effect which may occur through a free radical scavenging action.
Statistical concepts and methods for the reader of clinical studies in emergency medicine
1991, Journal of Emergency Medicine
An understanding of statistical concepts and methods is essential for the clinician who wishes to interpret the results of clinical studies. In this article the concepts of descriptive statistics, classical hypothesis testing, P values, a priori information, and Type I and Type II errors are discussed with examples to illustrate their application to the interpretation of clinical trials. In addition, descriptions of Student's t test, the chi-squared test, Fisher's exact test, the rank sum test, and sequential methods are given.
Statistical concepts for research in emergency medical services
2021, Emergency Medical Services: Clinical Practice and Systems Oversight: Third Edition

View all citing articles on Scopus

: Individual reprints of this article are not available. The entire six-part series will be available for purchase as a bound booklet from the Proceedings Circulation Office in December.

View full text

Statistical Considerations for Performing Multiple Tests in a Single Experiment. 2. Comparisons Among Several Therapies

Section snippets

Overall Preliminary Test.

EXAMPLE

ALTERNATIVE EXAMPLE

COMMENTS

ACKNOWLEDGMENT

First page preview

A multiple comparison procedure for comparing several treatments with a control

J Am Stat Assoc

New tables for multiple comparisons with a control

Biometrics