WIELD OF
STATISTICS IN
DECISION MAKING
Dr.A.Saberunnisa
Assistant Professor of Statistics
The Madura College
Madurai
TOPICS COVERED
 Data
 Collection of Data
 Organization of Data
 Presentation of Data
 Hypothesis testing
 Types of errors
DATA
 Data is a collection of raw, unprocessed facts and figures
that can be used as a basis for calculation, reasoning, or
discussion. It can include abstract ideas, concrete
measurements, statistics, and more. Data is often collected
through experiments, observations, or measurements, and
it lacks context and interpretation.
EXAMPLES
 The amount of Food that goes waste in India every year
was more than 40% valued at 58,000 core.
 In India, 96.5% kids got to School: Survey
1. There is a lot food that goes waste in India every
year
2. The Population of India is Growing Rapidly
DATA
QUALITATIVE
DATA
QUANTITATIV
E DATA
 is interpretation-based, descriptive, and relating to language.
 can help us to understand why, how, or what happened behind certain
behaviors.
 subjective and unique.
 methods are interviewing and observing.
 is analyzed by grouping the data into categories and themes.
 is numbers-based, countable, or measurable.
 tells us how many, how much, or how often in calculations.
 is fixed and universal.
 is subjective and unique.
 methods are measuring and counting.
 is analyzed using statistical analysis.
QUALITATIVE DATA
QUANTITATIVE DATA
COLLECTION OF
DATA
DATA COLLECTION
It’s the process of gathering and measuring information on
variables of interest, in an established systematic fashion
that enables one to answer stated research questions, test
hypotheses, and evaluate outcomes
SOURCES OF DATA
PRIMARY SOURCES
INTERVIEW
QUESTIONNAIR
E
INVESTIGATION
SECONDARY
SOURCES
PUBLISHED
UNPUBLISHED
PRIMARY DATA
 Primary data is a type of information that is obtained from
first – hand sources by means of surveys, observation or
experimentation. It is data that has not been previously
published and is derived from a new or original research
study and collected at the source such as in marketing.
SOURCES OF COLLECTION OF
PRIMARY DATA
 Direct Personal Interview:
Data is personally collected by the interviewer.
 Telephonic Interviews:
 Data is collected through an interview over the
telephone with the interviewer.
SOURCES OF COLLECTION OF
PRIMARY DATA
 Indirect Oral Investigation:
Data is collected from third parties who have
information about subject of enquiry.
 Information from Correspondents:
 Data is collected from agents appointed in
the area of investigation.
SOURCES OF COLLECTION OF
PRIMARY DATA
 Mailed Questionnaire:
Data is collected through questionnaire mailed to
the information.
 Questionnaire filled by enumerators:
 Data is collected by trained enumerators
who fill questionnaires.
ESSENTIALS OF A GOOD
QUESTIONNAIRE
 A covering letter with objectives and
scope of survey.
 Minimum number of questions.
 Avoid personal questions.
 Questions should be clear and simple.
 Questions should be logically arranged.
HOW TO COLLECT PRIMARY DATA?
These are the ways to collect primary data:
1. Sampling: It is a process through which we choose
a smaller group to collect data that can be the best
representative of the population.
2. Survey: it can be done in face to face
mode(interviews) or indirect mode(Telephone,
internet etc.,)
3. Census: It is method in which data is collected
from every unit of population.
CENSUS METHOD SAMPLING
METHOD
1.Every unit of
population studied
2.Reliable and
accurate results
3.Expensive method
4.Suitable when
population is of
homogenous
nature.
1.Few unit of
population are
studied
2.Less Reliable and
accurate results
3.Less expensive
method
4. Suitable when
population is
SECONDARY DATA
Secondary data is data that is not collected by the
person who is doing research. An example of
secondary data is a community assessment done
by another organization but used to substantiate
another organizations research.
SOURCES OF SECONDARY DATA
Published Source
Government publications, Semi-government
publications etc.,
Unpublished Source
Census of India, National Sample Survey organization
[They are collected by the organizations for their own
record]
PROCESS OF DATA COLLECTION
Data
Collection
Data
Analysis
Drawing
Inferences
Populatio
n
Sample
ORGANIZATION
OF DATA
Data organization is the way to arrange the raw data in an
understandable order. Organizing data include classification,
frequency distribution table.
Example: The marks scored out of 50 in a Maths
exam taken by 15 students are as follows:
26,15, 40, 18, 26, 24, 48, 40, 39, 26, 23, 37, 38, 40, 45, 48.
PRESENTATION
OF DATA
The data collected should be presented in a
suitable, concise form for further analysis. The
collected data may be presented in the form of
tabular or diagrammatic or graphical form.
HYPOTHESIS TESTING
WHAT IS A HYPOTHESIS?
A hypothesis is an assumption about the
population parameter.
A parameter is characteristic of the population,
like its mean or variance.
The parameter must be identified before
analysis.
TESTING OF HYPOTHESIS
A hypothesis is an assumption about the population parameter (say
population mean) which is to be tested.
For that we collect sample data, then we calculate sample
statistics(say sample mean) and then use this information to
judge/decide whether hypothesized value of population parameter
is correct or not.
 To test the validity of assumed or hypothetical value of
population, we gather sample data and determine the
difference between hypothesized value and actual value of
the sample mean.
 Then we judge whether the difference is significant or not.
 The smaller the difference, the greater the likelihood that
our hypothesized value for the mean is correct. The larger
the difference, the smaller the likelihood.
 In hypothesis testing the first step is to state the assumed
or hypothesized (numerical) value of the population
parameter.
 The assumption we wish/ want to test is called the null
hypothesis. It is denoted with
THE NULL HYPOTHESIS,
 State the Assumption to be tested.
 The average marks of the second year
students( in sem3) is 75
 Begin with the assumption that the null
hypothesis is true.
The Alternative Hypothesis,
 The Alternate Hypothesis is the logical opposite
of the null hypothesis.
 The average marks of the second year
students( in sem3) is not 75. (:µ≠75)
PROCEDURE OF HYPOTHESIS
TESTING
The Hypothesis Testing comprises the following steps:
Step 1:
Set up a hypothesis.
Step 2:
Set up a suitable significance level.
The confidence with which an experimenter rejects or
accepts Null hypothesis depends on the significance level
adopted.
Level of significance is the rejection region. It is denoted
with α.
SELECTING A SIGNIFICANCE LEVEL
 Level of significance can be adopted either 5% or 1%.
 5% level of significance ie., α=0.05, then there are about 5
chances out of 100 that we would reject the null
hypothesis.
 There is 95% chance to accept the null hypothesis.
 It means that there is about 95% confidence that the
decision taken is right.
If the sample statistic(calculated value) falls in non shaded
region( acceptance region), then it simply means that there
is no evidence to reject the null hypothesis.
It proves that null hypothesis is true. Otherwise it will be
rejected.
Step 3: Determination of suitable test statistics
Step 4: Determine the critical value from the table.
Step 5: After doing computation , check the sample result.
Compare the calculated value(sample result) with the value obtained
from the table(tabulated or critical value)
Step 6: Making decisions either accepting or rejecting the null
hypothesis. If the computed value is more than the tabulated or
critical value, then it falls in the critical region. In that case, reject
null hypothesis, otherwise accept.
TYPE OF ERRORS
TYPE I AND II ERRORS
When a statistical hypothesis is tested, there are 4 possible
results:
1. The hypothesis is true but our test accepts it.
2. The hypothesis is false but our test rejects it.
3. The hypothesis is true but our test rejects it.
4. The hypothesis is false but our test accept it.
Rejecting a null hypothesis when it is true is called Type I
error.
Accepting a null hypothesis when it is false is called Type II
error.
Example:
In a court room, a defendant is considered not guilty as
long as his guilt is not proven. The prosecutor tries to prove
the guilt of the defendant. Only when there is enough
charging evidence the defendant is condemned. In the start
of the procedure, there are two hypotheses
WIELD OF STATISTICS IN DECISION - MAKING
WIELD OF STATISTICS IN DECISION - MAKING
WIELD OF STATISTICS IN DECISION - MAKING

WIELD OF STATISTICS IN DECISION - MAKING

  • 1.
    WIELD OF STATISTICS IN DECISIONMAKING Dr.A.Saberunnisa Assistant Professor of Statistics The Madura College Madurai
  • 2.
    TOPICS COVERED  Data Collection of Data  Organization of Data  Presentation of Data  Hypothesis testing  Types of errors
  • 3.
    DATA  Data isa collection of raw, unprocessed facts and figures that can be used as a basis for calculation, reasoning, or discussion. It can include abstract ideas, concrete measurements, statistics, and more. Data is often collected through experiments, observations, or measurements, and it lacks context and interpretation.
  • 4.
    EXAMPLES  The amountof Food that goes waste in India every year was more than 40% valued at 58,000 core.  In India, 96.5% kids got to School: Survey 1. There is a lot food that goes waste in India every year 2. The Population of India is Growing Rapidly
  • 5.
  • 6.
     is interpretation-based,descriptive, and relating to language.  can help us to understand why, how, or what happened behind certain behaviors.  subjective and unique.  methods are interviewing and observing.  is analyzed by grouping the data into categories and themes.  is numbers-based, countable, or measurable.  tells us how many, how much, or how often in calculations.  is fixed and universal.  is subjective and unique.  methods are measuring and counting.  is analyzed using statistical analysis. QUALITATIVE DATA QUANTITATIVE DATA
  • 8.
  • 9.
    DATA COLLECTION It’s theprocess of gathering and measuring information on variables of interest, in an established systematic fashion that enables one to answer stated research questions, test hypotheses, and evaluate outcomes
  • 10.
    SOURCES OF DATA PRIMARYSOURCES INTERVIEW QUESTIONNAIR E INVESTIGATION SECONDARY SOURCES PUBLISHED UNPUBLISHED
  • 11.
    PRIMARY DATA  Primarydata is a type of information that is obtained from first – hand sources by means of surveys, observation or experimentation. It is data that has not been previously published and is derived from a new or original research study and collected at the source such as in marketing.
  • 12.
    SOURCES OF COLLECTIONOF PRIMARY DATA  Direct Personal Interview: Data is personally collected by the interviewer.  Telephonic Interviews:  Data is collected through an interview over the telephone with the interviewer.
  • 13.
    SOURCES OF COLLECTIONOF PRIMARY DATA  Indirect Oral Investigation: Data is collected from third parties who have information about subject of enquiry.  Information from Correspondents:  Data is collected from agents appointed in the area of investigation.
  • 14.
    SOURCES OF COLLECTIONOF PRIMARY DATA  Mailed Questionnaire: Data is collected through questionnaire mailed to the information.  Questionnaire filled by enumerators:  Data is collected by trained enumerators who fill questionnaires.
  • 15.
    ESSENTIALS OF AGOOD QUESTIONNAIRE  A covering letter with objectives and scope of survey.  Minimum number of questions.  Avoid personal questions.  Questions should be clear and simple.  Questions should be logically arranged.
  • 16.
    HOW TO COLLECTPRIMARY DATA? These are the ways to collect primary data: 1. Sampling: It is a process through which we choose a smaller group to collect data that can be the best representative of the population. 2. Survey: it can be done in face to face mode(interviews) or indirect mode(Telephone, internet etc.,) 3. Census: It is method in which data is collected from every unit of population.
  • 17.
    CENSUS METHOD SAMPLING METHOD 1.Everyunit of population studied 2.Reliable and accurate results 3.Expensive method 4.Suitable when population is of homogenous nature. 1.Few unit of population are studied 2.Less Reliable and accurate results 3.Less expensive method 4. Suitable when population is
  • 18.
    SECONDARY DATA Secondary datais data that is not collected by the person who is doing research. An example of secondary data is a community assessment done by another organization but used to substantiate another organizations research.
  • 19.
    SOURCES OF SECONDARYDATA Published Source Government publications, Semi-government publications etc., Unpublished Source Census of India, National Sample Survey organization [They are collected by the organizations for their own record]
  • 20.
    PROCESS OF DATACOLLECTION Data Collection Data Analysis Drawing Inferences Populatio n Sample
  • 21.
  • 22.
    Data organization isthe way to arrange the raw data in an understandable order. Organizing data include classification, frequency distribution table. Example: The marks scored out of 50 in a Maths exam taken by 15 students are as follows: 26,15, 40, 18, 26, 24, 48, 40, 39, 26, 23, 37, 38, 40, 45, 48.
  • 23.
  • 24.
    The data collectedshould be presented in a suitable, concise form for further analysis. The collected data may be presented in the form of tabular or diagrammatic or graphical form.
  • 25.
  • 26.
    WHAT IS AHYPOTHESIS? A hypothesis is an assumption about the population parameter. A parameter is characteristic of the population, like its mean or variance. The parameter must be identified before analysis.
  • 27.
    TESTING OF HYPOTHESIS Ahypothesis is an assumption about the population parameter (say population mean) which is to be tested. For that we collect sample data, then we calculate sample statistics(say sample mean) and then use this information to judge/decide whether hypothesized value of population parameter is correct or not.
  • 28.
     To testthe validity of assumed or hypothetical value of population, we gather sample data and determine the difference between hypothesized value and actual value of the sample mean.  Then we judge whether the difference is significant or not.  The smaller the difference, the greater the likelihood that our hypothesized value for the mean is correct. The larger the difference, the smaller the likelihood.
  • 29.
     In hypothesistesting the first step is to state the assumed or hypothesized (numerical) value of the population parameter.  The assumption we wish/ want to test is called the null hypothesis. It is denoted with
  • 30.
    THE NULL HYPOTHESIS, State the Assumption to be tested.  The average marks of the second year students( in sem3) is 75  Begin with the assumption that the null hypothesis is true.
  • 31.
    The Alternative Hypothesis, The Alternate Hypothesis is the logical opposite of the null hypothesis.  The average marks of the second year students( in sem3) is not 75. (:µ≠75)
  • 32.
    PROCEDURE OF HYPOTHESIS TESTING TheHypothesis Testing comprises the following steps: Step 1: Set up a hypothesis. Step 2: Set up a suitable significance level. The confidence with which an experimenter rejects or accepts Null hypothesis depends on the significance level adopted. Level of significance is the rejection region. It is denoted with α.
  • 33.
    SELECTING A SIGNIFICANCELEVEL  Level of significance can be adopted either 5% or 1%.  5% level of significance ie., α=0.05, then there are about 5 chances out of 100 that we would reject the null hypothesis.  There is 95% chance to accept the null hypothesis.  It means that there is about 95% confidence that the decision taken is right.
  • 34.
    If the samplestatistic(calculated value) falls in non shaded region( acceptance region), then it simply means that there is no evidence to reject the null hypothesis. It proves that null hypothesis is true. Otherwise it will be rejected.
  • 35.
    Step 3: Determinationof suitable test statistics Step 4: Determine the critical value from the table. Step 5: After doing computation , check the sample result. Compare the calculated value(sample result) with the value obtained from the table(tabulated or critical value) Step 6: Making decisions either accepting or rejecting the null hypothesis. If the computed value is more than the tabulated or critical value, then it falls in the critical region. In that case, reject null hypothesis, otherwise accept.
  • 36.
  • 37.
    TYPE I ANDII ERRORS When a statistical hypothesis is tested, there are 4 possible results: 1. The hypothesis is true but our test accepts it. 2. The hypothesis is false but our test rejects it. 3. The hypothesis is true but our test rejects it. 4. The hypothesis is false but our test accept it. Rejecting a null hypothesis when it is true is called Type I error. Accepting a null hypothesis when it is false is called Type II error.
  • 38.
    Example: In a courtroom, a defendant is considered not guilty as long as his guilt is not proven. The prosecutor tries to prove the guilt of the defendant. Only when there is enough charging evidence the defendant is condemned. In the start of the procedure, there are two hypotheses