Parametric and non parametric test in biostatistics
The document provides an overview of parametric and non-parametric tests in biostatistics, explaining their definitions, applications, and differences. It details various statistical methods used to analyze health-related data, including t-tests, ANOVA, and non-parametric tests, along with their assumptions and conditions for use. The document underscores the importance of biostatistics in improving health outcomes and evaluating medical interventions.
An introduction to biostatistics by Kapil Gautam at the Institute of Medicine, discussing its history, applications, and testing methods.
Overview of the origins of biostatistics, emphasizing Francis Galton as the father and his use of correlation.
Statistics is the discipline of data analysis; biostatistics applies these methods in health science to improve health outcomes.
Biostatistics determines risk factors for diseases, evaluates drugs, and assesses community health programs.
Differences between parametric tests, which assume data quality, and non-parametric tests, which do not require such assumptions.
Key assumptions of parametric tests and their advantages, including power in detecting true differences. Overview of various parametric tests (e.g., t-tests, ANOVA) for analyzing different data types and comparison goals.
Definition, situations for use, advantages and disadvantages of non-parametric tests in statistical analysis.
Different non-parametric tests (e.g., Mann-Whitney, Wilcoxon tests) with examples illustrating their applications.
Non-parametric tests like Kruskal-Wallis for rank analysis and Chi-Square for analyzing frequency data.
Further exploration of non-parametric methods, including McNemar’s test and Fisher’s exact test for binary outcomes.Methods for assessing correlation: scatter diagrams, Pearson's coefficient, and Spearman's rank correlation.Discussion on the limitations of parametric and non-parametric tests and the necessity of statistical knowledge.
A summary of parametric and non-parametric tests highlighting their applications and significance in research.
Final remarks and thanks from the presenter; a brief conclusion on the importance of biostatistics.
Parametric and non parametric test in biostatistics
1.
PARAMETRIC AND NON-
PARAMETRICTEST IN
BIOSTATISTICS
KAPIL GAUTAM
INSTITUTE OF MEDICINE, MAHARAGJUNJ, KTM NEPAL
TEACHING HOSPITAL
2.
Presentation layout
• Biostatistics–introduction/history
- Statistics & differences
- Uses and scope
• Parametric test
• Non-parametric test
• Limitations
• Summary
• References
3.
History
• Father ofbiostatistics – Francis Galton
• Concept- correlation
• He used questionnaires
4.
Statistics – Statisticsis a branch of mathematics that deals
with the methods of
• Collection
• compilation
• Analysis
• Presentation, and interpretation of Data
Biostatistics
• It is defined as application of statistical methods to medical,
Biological and Public health related problems
5.
Bio-statistics
• Is theapplication of statistical techniques to scientific
research in health-related fields, including medicine,
public health , pharmacy, nursing and biology.
• Biostatistics has became an indispensable tool in
improving health and reducing illness
6.
Biostatics can beapplied to
• Determine of major risk factors for heart disease, lung
cancer and related disease
• Test of new drugs combat e.g. HIV/AIDS
• Evaluate the potential environmental factors harmful to
human health, such as tobacco smoke, vehicle smoke etc.
7.
Uses of biostatistics
•Biostatistics helps to define what is normal and it also gives
limits of normality. E.g.: Hemoglobin level.
• It allows us to compare information from one city or region
to that of other .
• Find out correlation between two variables
• Provides useful information for Planning and
Implementation of various health program, Monitoring and
Evaluation of such program.
8.
Uses in preventivemedicine
• To provide the magnitude of any health problem in the
community.
• To find out the basic factors underlying the ill-
health.
• To evaluate the health programs which was
introduced in the community (success/failure).
9.
Parametric vs Non-parametrictests
Parametric test - Assume the data is of sufficient “quality”
• The results can be misleading if assumptions are wrong
• Quality is defined in terms of certain properties of
data
Non-Parametric tests – Can be used when the data is not of
sufficient quality to satisfy the assumptions of
parametric test
• Parametric tests are preferred when the assumptions
are met because they are more sensitive.
10.
Parametric test
Assumption
• Randomindependent samples
• Interval or ratio level of measurement
• Normally distributed
• No-outliers
• Homogeneity of variance
• Sample size larger than minimum for many non-parametric test
Tests
• More power- more like to detect a difference than truly exists
• Less likely than non-parametric tests to make a type-II error
11.
Types of parametrictests
1. Large sample tests
• Z-test
2. Small sample tests
• T-test
• Independent/unpaired t-test
• Paired t-test
• ANOVA (Analysis of variance)
• One way ANOVA
• Two way ANOVA
3. F-test
12.
Z-test
• A Z-testis used for testing the mean of a
population versus a standard.
• OR
• Comparing the means of two populations.
(With large (n≥ 30) samples whether we know the
population standard deviation or not)
13.
It is alsoused for testing the proportion of some
characteristic versus a standard proportion, or comparing
the proportions of two populations.
E.g. Comparing the average engineering salaries of men
versus women.
E.g. Comparing the fraction defectives from two production
lines.
14.
T- test
• Derivedby W S Gosset in 1908.
Properties of t distribution:
i. It has mean 0
ii. It has variance greater than one
iii. It is bell shaped symmetrical distribution about mean
Assumption for t test:
i. Sample must be random, observations independent
ii. Standard deviation is not known
iii. Normal distribution of population
15.
Uses of ttest
i. The mean of the sample
ii. The difference between means or to compare two
samples
iii. Correlation coefficient
• Types of t test:
a. Paired t test
b. Unpaired t test
16.
Paired t test
Consistsof a sample of matched pairs of similar units, or
one group of units that has been tested twice (a
"repeated measures" t-test )
• Ex. where subjects are tested prior to a treatment, say
for high blood pressure, and the same subjects are
tested again after treatment with a blood-pressure
lowering medication.
17.
Unpaired t test:
•When two separate sets of independent and identically
distributed samples are obtained, one from each of the
two populations being compared.
• Ex: 1. compare the height of girls and boys.
2. compare 2 stress reduction interventions
• when one group practiced mindfulness meditation while
the other learned progressive muscle relaxation
18.
Analysis of variance(ANOVA)
Analysis of variance (ANOVA) is a collection of statistical
models used to analyze the differences between group
means and their associated procedures (such as
"variation" among and between groups),
• Compares multiple groups at one time
• Developed by R.A. Fisher
• Two types: i. One way ANOVA
ii. Two way ANOVA
19.
One way ANOVA
Itcompares three or more different groups when data
are categorized in one way E.g.
1. Compare control group with three different doses of
aspirin in rats
2. Effect of supplementation of vit C in each subject
before, during and after the treatment
20.
Two way ANOVA:
•Used to determine the effect of two nominal predictor
variables on a continuous outcome variable.
• A two-way ANOVA test analyzes the effect of the
independent variables on the expected outcome along
with their relationship to the outcome itself
21.
Difference between one& two way
ANOVA
• E.g. In one-way ANOVA - if we want to determine if
there is a difference in the mean height of stalks of three
different types of seeds.
Since there is more than one mean, we can use a one-way
ANOVA since there is only one factor that could be making
the heights different.
22.
Now, if wetake these three different types of seeds, and
then add the possibility that three different types of
fertilizer is used, then we would want to use a two-way
ANOVA.
• The mean height of the stalks could be different for a
combination of several reasons
23.
• The typesof seed could cause the change, the types of
fertilizer could cause the change, and/or there is an
interaction between the type of seed and the type of
fertilizer.
• There are two factors here (type of seed and type of
fertilizer), so, if the assumptions hold, then we can use a two-
way ANOVA.
24.
F’ test
• Astatistical test is used to determine whether two
populations having normal distribution have the same
variances or standard deviation
• Same as ANOVA test or just another name
Estimate of σ2 from means
F=
Estimate of σ2 from individuals
25.
• Summary ofparametric tests applied for different type
of data Sly no Type of Group Parametric test 1.
Comparison of two paired groups Paired ‘t’ test 2.
Comparison of two unpaired groups Unpaired ‘t’ test 3.
Comparison of three or more matched groups Two way
ANOVA 4. Comparison of three or more matched
groups One way ANOVA 5. Correlation between two
variables Pearson correlation
Unmatched
26.
Non-parametric test
• Testare based on certain probability distribution such as
normality of the population distribution and randomness of
the sample.
• Dot not use the parameter of the distribution.
• Test without a model i.e. Called distribution free test or
non-parametric tests
• Can be use for ordinal data, Skewed data etc.
• When the continuous data are not normally distributed, we
select the nonparametric test.
27.
• The distributioncan be checked whether it is normal or
abnormal by using
• Histogram plot
• Normal plot
• Kolmogorov- Smironve test and Shaprio
Wilk test
28.
Situation for thenon-parametric test
• When quick data analysis is required to have a rough idea
about how the things are working with data.
• When only comparative rather than absolute magnitudes are
available as in the case of clinical data.
E.g. The patients can be categorized as better, unchanged and
worse
• When data is available in non parametric nature
• When data does not satisfy the assumption of a parametric
procedure.
29.
Advantages
• Can beapplied for small sample size
• Most suitable for ranked data
• Can be used for observations drawn from
different populations
• Much easier to learn than parametric tests
• Can be calculated in very quick time
30.
Disadvantages
• Don’t usethe actual measurements
• Effects of variable may not find out in non-
parametric test
• Less powerful than the parametric test
31.
Types of Non-parametrictest
1. Run test
2. Sign test
3. Ranks & Median test
4. Paired Wilcoxon Signed Rank
5. Mann-Whitney test (or Wilconxon Rank Sum Test)
6. Kruskal Wallis Test
7. Fisher’s exact test
32.
Run test
• Mostfrequently to test the randomness ( or lack of
randomness) of data.
• A sequence of data that possess a common property.
• The test statistic in this test is V, the number of runs
observed
• Used to decide if a data set is from a random process.
• Run is defined as a series of increasing values or a
series of decreasing values
33.
• The no.of increasing or decreasing, values is the length of
the run
• In a random data set, the probability that the (L+1)th value
is larger or smaller than the Lth value follows a binomial
distribution which forms the basis of the runs
• A run is a sequence of similar events
• E.g. In flipping coins, the number of “heads” in a row
• In a series of patients, the number of “female Patients” in a
row
34.
Median test
• Isthe value above and below which 50% of the data lie.
• If the data is ranked in order, it is the middle value
• In symmetric distribution the mean and median are the
same,
• In skewed distributions, median more appropriate
35.
E.g.
• Let ussuppose the blood pressure of 7 patients are
given below
• BP: 135, 138, 140, 141, 142, 143, 144
Median - 141
In parametric test, mean is compared to test the null
hypothesis whereas in non - Parametric test median is
used
Sign test
• Oneof oldest & easiest non-parametric test in statistics
• Based on the direction of plus or minus sings of
observation rather than numerical values in a samples.
• Frequently used method instead of t-test
38.
• The IQSCORES OF 10 mentally retarded women are given
• Let us assume the population median = 5
• Null Hypothesis (Ho): the population median is 5= {p(+) = p(-)}
=0.5
Test statistic: the test statistics for the sign test is either the
observed no. of plus signs or the observed no. of minus signs
Women Scores Women Scores
1 4 6 9
2 5 7 10
3 8 8 7
4 8 9 6
5 9 10 6
39.
• Distribution oftest statistic: the plus sign are assigned if
observations are greater than population median and
minus sign are assigned if observations are less than the
population median
• Scores above + and below – hypothesized median based
on the data
• No of positive sign = 8 No. of negative sign =1 and no. of zero = 1
Women 1 2 3 4 5 6 7 8 9 1
0
Score relative to hypothesized median - 0 + ++ + + + + +
40.
Use binomial probabilitydistribution to know the probability
of negative or positive sign
For probability of observing fewer minus sign is = 0.0195
Which is less than 0.5 we reject the null hypothesis
We conclude that the median score is not 5
41.
Two Independed sample:The Mann-Whitney test
• Mann Whitney test is also known as Wilcoxon rank sum
test.
• Popular test amongst the rank sum tests
• Used to determine whether two independent samples
have been drawn from the same populations having
same median or not
42.
Assumptions
• Two samplesare drawn randomly and independently
• Two population have same median
• The measurement scale of dependent variable is at
least ordinal
43.
Steps
• First arrangethe both group data in ordered from
(ascending or descending)
• Give the rank from low to high according to the magnitude.
• For tied observation, give the mean rank to all tied
observations.
44.
• Find thesum of ranks assigned to the values of the first
sample (R1) and also the sum of the ranks assigned to the
values of the second samples(R2)
• Then use test statistics U, which is a measurement of the
difference between the ranked observations of the two
samples
45.
n1& n2 arethe sample sizes and R1 &R2 are the sum of ranks
assigned to the values of the first & second samples
respectively.
• A researcherwishes to find the assess the effects of prolonged
inhalation of cadmium oxide. He had taken 11 animals as experimental
subjects and 8 similar animals served as controls. The variable of
interest was hemoglobin level.
• How conclude that prolonged inhalation of cadmium oxide reduces
hemoglobin level ?
• Experimental Group: 14.0 15.3 16.7 13.7 15.3 15.7 15.6 14.1 15.6
15.1 15.9 16.6 14.1
• Control group: 16.2 17.1 17.4 17.5 15.0 16.0 16.9 15.0
48.
• Ho- prolongedinhalation of cadmium oxide doesn’t reduce
hemoglobin level
• H1: prolonged inhalation of cadmium oxide reduces
hemoglobin level
• To find the test statistics, we combine the two samples and
rank all observations from smallest to largest. Tied
observations are assigned a mean rank to all observations
Here, n1= 11,and n=8
R1= 84 and R2= 106, so take the larger sum ran, in this care
R2 is taken
Hence, test statistic
• Tabulated value of U is as U0.05(18),11,8 = 19
• Since calculation value (18) is less than the tabulated value
(19), Ho is rejected
Conclusion
• We can conclude that prolonged inhalation cadmium oxide
does not reduce hemoglobin
51.
Wilcoxon Matched pairstest ( signed rank
test): for dependent samples
• It is equivalent to paired t-test.
• Alternative method to test the paired data when the
observations do not follow the criteria of normality
• Used to test paired data such as: before and after studies,
studies of twins or other relatives, two different medicines
used on the same group at different time period etc.
52.
• Each subjectproduces two scores, one for each condition
• Test is done to show whether there is a statistically
significant difference between the two conditions
Step
• First find the differences(d) of the scores of the two
matched samples
• Assign the rank to the differences ( ignoring the sign)
• Positive ranks are summed
• Negative ranks are summed
• T is the smaller sum of ranks
• n is the number of matched pairs
53.
Conditions of test
•If n>15, T is approximately normally distribution,
and a Z test is used
• If n<15, a special “small sample” procedure is
followed
• The paired data are randomly selected
• The underlying distributions are symmetrical
54.
Hypothesis
Null hypothesis (Ho): the two population are
identical
Alternative Hypothesis H1: the two population are
not identical
55.
• The datain the table below the duration of tolerance of
pain by 11 subject before and after the administration of a
drug (0.04mg/20g) does the data provides sufficient
evidence in support that drug increases the duration of
endurance of pain
• Calculation of Wilcoxon’s Signed Rank Test (small sample)
• Sum ofnegative ranks = -1-6-3-4 = -14
• Sum of positive ranks = 8+11+7+9+5+2+10= 52
• The null hypothesis is tested using the smaller value of
the sums of negative ranks (T), In this case, the sum of
negative ranks (T) = 14
• Tabulated value (critical value) T at 5% level of
significance at 11 pairs = 7
58.
Note: if thedifference of the pair is zero, that pair should
be subtracted from the total pairs
Conclusion
• Experimental value of T is 14, while tabled value of T is
7, it means that null hypothesis can be rejected. So, two
population are not identical
59.
Kruskal Wallis oneway analysis of
variance by rank
• One-way analysis of variance by ranks is a non-
parametric method for testing whether samples
originate from the same distribution
• Used for comparing two or more samples that are
independent,
• And that may have different sample sizes, and
extended the Mann - Whitney U test to more than two
groups
60.
• Just likeone way ANOVA it is applied to
populations from which the samples drawn are not
normally distributed with equal variances or when
the data for analysis consists of only ranks
61.
Example
In study ofcerebrovascular disease, the patients from 16
socioeconomic background were thoroughly investigated.
One characteristic measured was diastolic blood pressure in
mm/hg. Is there any reason to believe that three groups
differ with respect to this characteristic?
• Study of cerebrovascular disease in 3 socioeconomic
backgrounds
62.
Group A GroupB Group c
100 92 81
1031 97 102
89 88 86
78 84 83
105 90 99
95
n=5 n=6 n=5
Total (n) = 5+6+5 =16
63.
Null hypothesis (Ho):there is no difference in the
diastolic pressure of the three groups.
Alternative hypothesis (Ho): there is difference in the
diastolic pressures of the three groups.
64.
Calculation
• Arranged allthe data in ordered from
obs. 78 81 83 84 86 88 89 90 92 95 97 99 100 102 103 105
Ran
k
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15
Group A Group B Group c
13
15
7
1
16
9
11
6
4
8
10
2
14
5
3
12
R1= 52 R2=48 R3= 36
65.
Observed H =1.2
• Since H is distributed with chi square with (3-1) = 2d.f.
and tabulated value of chi square at 2 d.f. = 5.99.
• Therefore, the null hypothesis is not rejected
Conclusion: It means that there is no difference in the
diastolic pressures of the three groups.
66.
Chi - Squaretest =
• The most commonly used nonparametric test in the biological
experiments
• It is computed on the basis of frequencies in a sample and is
applied only for qualitative data (nominal and ordinal scale data)
E.g.. Health, response to drug etc.
• Test is used as a test of significance when data is expressed in
frequencies or in terms of percentages.
• Enable us to determine the degree of deviation between
observed frequencies and expected frequencies and to conclude
whether the deviation between observed frequencies and
expected frequencies is due to error sampling or due to chance.
67.
• Formula
Where O=observed frequency in a class and E = expected
frequency in a class.
68.
• Application oftests
• To test the goodness of fit
• To test independence of attribute
• To test the homogeneity of the attribute in
respect of a particular characteristic or it
may be used to test the population variance
69.
• Criterion forusing test
• Data should be categorical or qualitative
• One or more categories
• Observations must be independent
• Adequate sampling size (50 or more than 50)
• Sample should be taken randomly
• Data are in frequency form
• The expected frequency of any item of cell should not
be less than 5.
if it is less than 5, then frequencies taking from the
preceding or succeeding frequency be pooled together
in order to make it 5 or more than 5
70.
Step involved inthis case are as
1. formulate the hypothesis
2. Test statistic
3. Level of significant
4. Decision
5. Conclusion
71.
For example, totest the hypothesis that a random sample
of 1000 people has been drawn from a population in
which men and women are equal in frequency, the
observed number of men and women would be compared
to the theoretical frequencies of 50 men and 50 women. If
there were 44 men in the sample and 56 women, then
72.
Hypothesis
• Ho: sexration is equal
• H1: sex ration is significantly different
Test statistic
Expected value is 50:50
O = 44(M) or 56(F) E= 50
By solving we got 1.44
73.
• Degree offreedom = k-1 = 2-1 = 1
• Level of significance = 5% = 0.05
• Tabulated value of chi square at 1 d.f. and 5% level of
significance = 3.84
Comparison = calculated chi square (1.44) is less than
tabulated chi square (3.84). So we accept the null
hypothesis.
Conclusion: since null hypothesis is accepted, we conclude
that the sex ratio is equal
74.
McNemar’s test
• Testis a statistical test used on paired Nominal data
• A non parametric chi-square procedure that compares
proportions obtained from a 2*2 contingency table.
• Used on paired nominal data.
• Applied to 2*2 contingency tables with a dichotomous trail,
with matched pairs of subjects, to determine whether the
row and column marginal frequencies are equal.
75.
1- Pair-matched datacan come from
• Case-control studies where each case has a matching
control (matched on age, gender, race etc.)
Twins studies – the matched pairs are twins
• Before & after data
The outcome is present (+) or absence (-) of some
characteristic measured on the same individual at two
time points
76.
Example
• Brest cancerpatients receiving mastectomy followed by
chemotherapy were matched to each other on age and
cancer stage.
Pair-matched Data for case-control Study: outcome is
exposure to some risk factor
77.
• The countsin the table for a before-after study are numbers
of pairs and no. of individuals.
• It is used to test the two types of diagnosis test or two types
of medicine whether they give the same result or not
• The Null hypothesis is Ho: Pb= Pc
• The alternative hypothesis is H1 : Pb ≠ Pc
Fisher’s exact test
•Comparing binary outputs produced by two methods
• The significance of the deviation can be calculated
exactly
• Null hypothesis : Output difference between two
methods is zero
80.
Methods of studyingof correlation
• Correlation analysis measures the degree of association
of two variables
1. Scatter diagram( Graphical method of
representation of relationship)
2. Karl Pearson’s correlation coefficient (for
quantitative data)
3. Spearman’s rank correlation coefficient (ordinal
data)
81.
Scatter diagram (scatterplot) method
• Simples method of studying relationship between
two variables by graphically.
• Fist step of showing the relationship between
variables.
• Give the direction correlation but fail to give the
degree of relationship
82.
Fig. Independent variable( X) variable is plot along with the
X-axis (horizontal) and dependent variable (Y)
83.
• Merits
• Simpleand non mathematical method for studying
correlation
• Easy to understand and easy to interpret
• First step to study the relation
• Demerits
• It gives just an idea about the direction correlation.
It does not establish the exact degree of correlation
• Just qualitative method of showing the relationship
between two variables
84.
Karl Pearson’s coefficientof correlation
• Mathematical method to measure the degree of
relationship between two quantitative variable.
• Denoted by r
• Is a parametric method of finding the relationship
between variables
• k/n bivariate analysis
• The value of correlation coefficient lies in between -1 to
+1
85.
Interpretation of correlationcoefficient
• If r= -1, there is perfect negative correlation between X & Y
• If r=+1, there is perfect positive correlation between X & Y
• If r=0, there is no correlation between X & Y
86.
Merits
• It givesthe exact measure of degree of correlation between
two variables.
• It gives whether the correlation is positive or negative
Demerits
• Affected by extreme values
• Gives only linear relationship
• Tedious calculation
• Uses only in quantitative measurement
87.
Spearman rank correlation
•The data obtained from bi-variate population
which is not in normal then the previous Karl
Pearson coefficient correlation is not applied
• Instead, we give the ranks for each variable
• Used to find the relationship
• We use this method when the variables are taken
from qualitative nature such as intelligence, honesty,
ability, beauty, color etc..
88.
• The spearman’srank correlation is also called non-
parametric test or distribution free test
• Denoted by rs
• Lies in between -1 to +1
89.
• Spearman’s Rankcorrelation (R) =
r= spearman’s rank correlation
D= difference between two ranks
n = Number of pairs of observations
90.
limitation
• Does notdeal individual data
• Technique deals with the quantitative data only. It ignores
qualitative aspects like beauty, goodness, intelligence, gender,
pain, knowledge etc..
• Laws are not exact like mathematical like mathematical laws
• They are based on the average
• Sometimes it gives absurd result
• The greatest limitation of biostatistics is that only who has a
sound knowledge of statistical methods can efficiently handle
statistical data, Person with poor expertise knowingly or
unknowingly can draw faulty conclusion
91.
Parametric test Non-parametrictest
1. Large sample tests
Z-test
2. Small sample tests
T-test
Independent/unpaired
t-test
Paired t-test
ANOVA (Analysis of
variance)
One way ANOVA
Two way ANOVA
3. F test
1. Run test
2. Sign test
3. Ranks & Median test
4. Paired Wilcoxon Signed
Rank
5. Mann-Whitney test (or
Wilconxon Rank Sum Test)
6. Kruskal Wallis Test
7. Fisher’s exact test
Summary
92.
• Simplifies complexity
•Collects the information scientific methods
• Analyzes the data
• Helps in formulation of suitable polices
• Facilitates comparison
• Helps in forecasting
Cond..