SlideShare ist ein Scribd-Unternehmen logo
1 von 49
Statistics: First Steps
Andrew Martin
PS 372
University of Kentucky
Variance
Variance is a measure of dispersion of data
points about the mean for interval- and ratio-level
data.
Variance is a fundamental concept that social
scientists seek to explain in the dependent
variable.
Standard Deviation
Standard deviation is a measure of dispersion of
data points about the mean for interval- and ratio-
level data.
Like the mean, standard deviation is sensitive to
extreme values.
Standard deviation is calculated as the square
root of the variance.
Normal Distribution

The bulk of observations lie in the center,
where there is a single peak.

In a normal distribution half (50 percent) of the
observations lie above the mean and half lie
below it.

The mean, median and mode have the same
statistical values.

Fewer and fewer observations fall in the tails.

The spread of the distribution is symmetric.
Normal Distribution

Mathematical theory allows us to know what
percentage of observations lie within one
(68%), two (95%) or three (98%) standard
deviations of the mean.

If data are not perfectly normally distributed, the
percentages will only be approximations.

Many naturally occurring variables do have
nearly normal distributions.

Some can be transformed using logarithms.
Frequency Distribution
What about categorical variables?
Example
Calculate the ID and IQV for a former PS 372
class grades using the following frequencies or
proportions:
Grade Freq. Prop.
A 4 (.12)
B 7 (.21)
C 4 (.12)
D 7 (.21)
E 12 (.34)
Index of Diversity
ID = 1 – (p2
a
+ p2
b
+ p2
c
+p2
d
+p2
e
)
ID = 1 - (.122
+ .212
+ .122
+ .212
+ .342
)
ID = 1 - (.0144 + .0441 + .0144 + .0441 + .1156)
ID = 1 - (.2326)
ID = .7674
Index of Qualitative Variation
1 – (p2
a
+ p2
b
+ p2
c
+p2
d
+p2
e
)
1 - (1/K)
Index of Qualitative Variation
.7674
(1 – 1/5)
.9592
Data Matrix
A data matrix is an array of rows and columns
that stores the values of a set of variables for all
the cases in a data set.
This is frequently referred to as a dataset.
Data Matrix from JRM
Properties of Good Graphs
Should answer several of the following questions:
(JRM 384)
1. Where does the center of the distribution lie?
2. How spread out or bunched up are the
observations?
3. Does it have a single peak or more than one?
4. Approximately what proportion of observations
in in the ends of the distributions?
Properties of Good Graphs
5. Do observations tend to pile up at one end of
the measurement scale, with relatively few
observations at the other end?
6. Are there values that, compared with most,
seem very large or very small?
7. How does one distribution compare to another
in terms of shape, spread, and central tendency?
8. Do values of one variable seem related to
another variable?
Statistical Concepts
Let's quickly review some concepts.
Population
A population refers to any well-defined set of
objects such as people, countries, states,
organizations, and so on. The term doesn't simply
mean the population of the United States or some
other geographical area.
Population

A sample is a subset of the population.

Samples are drawn in some known manner and
each case is chosen independently of the other.

From here on out, when the book uses the term
sample, random sample or simple random
sample, it's making reference to the same
concept, which is a sample chosen at random.
Populations

Parameters are numerical features of a
population.

A sample statistic is an estimator that
corresponds to a population parameter of
interest and is used to estimate the population
value.

Y is the sample mean, (μ) is the population
mean.

^ is a “hat”, caret or circumflex
Two Kinds of Inference
Hypothesis Testing
Point and interval estimation
Hypothesis Testing
Many claims can be translated into specific
statements about a population that can be
confirmed or disconfirmed with the aid of
probability theory.
Ex: There is no ideological difference between the
voting patterns between the voting patterns of
Republican and Democrat justices on the U.S.
Supreme Court.
Point and Interval Estimation
The goal here is to estimate unknown population
parameters from samples and to surround those
estimates with confidence intervals. Confidence
intervals suggest the estimates reliability or
precision.
Hypothesis Testing
Start with a specific verbal claim or proposition.
Ex: The chances of getting heads or tails when
flipping the coin is are roughly the same.
Ex: The chances of the United States electing a
Republican or Democrat president are roughly the
same.
Hypothesis Testing
Hypothesis Testing
Next, the researcher constructs a null hypothesis.
A null hypothesis is a statement that a
population parameter equals a specific value.
Hypothesis Testing
Following up on the coin example, the null
hypothesis would equal .5.
Stated more formally: H0
: P = .5
Where P stands for the probability that the coin
will be heads when tossed.
H0
is typically used to denote a null hypothesis.
Hypothesis Testing

Next, specify an alternative hypothesis.

An alternative hypothesis is a statement
about the value or values of a population
parameter. It is proposed as an alternative to
the null hypothesis.

An alternative hypothesis can merely state that
the population does not equal the null
hypothesis, or is greater than or less than the
null hypothesis.
Hypothesis Testing
Suppose you believe the coin is unfair, but have
no intuition about whether it is too prone to come
up heads or tails.
Stated formally, the alternative hypothesis is:
HA
: P ≠ .5
Hypothesis Testing
Perhaps you believe the coin is more likely to
come up heads than tails. You would formulate
the following alternative hypothesis:
HA
: P > .5
Conversely, if you believe the coin is less likely to
come up heads than tails, you would formulate
the alternative hypothesis in the opposite
direction:
HA
: P < .5
Hypothesis Testing

After specifying the null and alternative
hypothesis, identify the sample estimator that
corresponds to the parameter in question.

The sample must come from the data, which in
this case is generated by flipping a coin.
Hypothesis Testing

Next, determine how the sample statistic is
distributed in repeated random samples. That
is, specify the sampling distribution of the
estimator.

For example, what are the chances of getting
10 heads in 10 flips (p = 1.)? What about 9
heads in 10 flips (p = .9)? 8 flips (p = .8)?
Hypothesis Testing

Make a decision rule based on some criterion
of probability or likelihood.

In social sciences, a result that occurs with a
probability of .05 (that is, 1 chance in 20) is
considered unusual and consequently is
grounds for rejecting a null hypothesis.

Other common thresholds (.01, .001) are also
common..

Make the decision rule before collecting data.
Hypothesis Testing

In light of the decision rule, define a critical
region. The critical region consists of those
outcomes so unlikely to occur that one has
cause to reject the null hypothesis should they
occur.

So there are areas of “rejection” (critical areas)
and nonrejection.
Hypothesis Testing

Collect a random sample and calculate the
sample estimator.

Calculate the observed test statistic. A test
statistic converts the sample result into a
number that can be compared with the critical
values specified by your decision rule and
critical values.

Examine the observed test statistic to see if it
falls in the critical region.

Make practical or theoretical interpretation of
the findings.
Statistics 091208004734-phpapp01 (1)

Weitere ähnliche Inhalte

Was ist angesagt?

Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"jemille6
 
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)Saúl Pillaca Yupanqu
 
Mayo: Day #2 slides
Mayo: Day #2 slidesMayo: Day #2 slides
Mayo: Day #2 slidesjemille6
 
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)jemille6
 
Hypothesis testing, error and bias
Hypothesis testing, error and biasHypothesis testing, error and bias
Hypothesis testing, error and biasDr.Jatin Chhaya
 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis TestingSr Edith Bogue
 
Byrd statistical considerations of the histomorphometric test protocol (1)
Byrd statistical considerations of the histomorphometric test protocol (1)Byrd statistical considerations of the histomorphometric test protocol (1)
Byrd statistical considerations of the histomorphometric test protocol (1)jemille6
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statisticsMaria Theresa
 
Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma jemille6
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statisticsSarfraz Ahmad
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingpraveen3030
 
Four steps to hypothesis testing
Four steps to hypothesis testingFour steps to hypothesis testing
Four steps to hypothesis testingHasnain Baber
 
HYPOTHESIS TESTING
HYPOTHESIS TESTINGHYPOTHESIS TESTING
HYPOTHESIS TESTINGAmna Sheikh
 
Hypothesis testing ppt final
Hypothesis testing ppt finalHypothesis testing ppt final
Hypothesis testing ppt finalpiyushdhaker
 
Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13jemille6
 

Was ist angesagt? (20)

Introductory Statistics
Introductory StatisticsIntroductory Statistics
Introductory Statistics
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
 
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)
[David m. kreps]_game_theory_and_economic_modellin(b-ok.org)
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Mayo: Day #2 slides
Mayo: Day #2 slidesMayo: Day #2 slides
Mayo: Day #2 slides
 
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
Frequentist Statistics as a Theory of Inductive Inference (2/27/14)
 
Hypothesis testing, error and bias
Hypothesis testing, error and biasHypothesis testing, error and bias
Hypothesis testing, error and bias
 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis Testing
 
Byrd statistical considerations of the histomorphometric test protocol (1)
Byrd statistical considerations of the histomorphometric test protocol (1)Byrd statistical considerations of the histomorphometric test protocol (1)
Byrd statistical considerations of the histomorphometric test protocol (1)
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
 
Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma Statistical Flukes, the Higgs Discovery, and 5 Sigma
Statistical Flukes, the Higgs Discovery, and 5 Sigma
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statistics
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Four steps to hypothesis testing
Four steps to hypothesis testingFour steps to hypothesis testing
Four steps to hypothesis testing
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
HYPOTHESIS TESTING
HYPOTHESIS TESTINGHYPOTHESIS TESTING
HYPOTHESIS TESTING
 
Hypothesis testing ppt final
Hypothesis testing ppt finalHypothesis testing ppt final
Hypothesis testing ppt final
 
P value
P valueP value
P value
 
Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13Phil6334 day#4slidesfeb13
Phil6334 day#4slidesfeb13
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 

Andere mochten auch (20)

Presidency
PresidencyPresidency
Presidency
 
Week 7 - sampling
Week 7  - samplingWeek 7  - sampling
Week 7 - sampling
 
Week 7 Sampling
Week 7   SamplingWeek 7   Sampling
Week 7 Sampling
 
Stats Intro Ps 372
Stats Intro Ps 372Stats Intro Ps 372
Stats Intro Ps 372
 
Judiciary Part 2
Judiciary Part 2Judiciary Part 2
Judiciary Part 2
 
Measurement pt. 2
Measurement pt. 2Measurement pt. 2
Measurement pt. 2
 
Constitution2
Constitution2Constitution2
Constitution2
 
Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
 
Berry et al
Berry et alBerry et al
Berry et al
 
Bureaucracy
BureaucracyBureaucracy
Bureaucracy
 
Statisticalrelationships
StatisticalrelationshipsStatisticalrelationships
Statisticalrelationships
 
Civil Rights
Civil RightsCivil Rights
Civil Rights
 
Measurement
MeasurementMeasurement
Measurement
 
Media
MediaMedia
Media
 
Politicalculture
PoliticalculturePoliticalculture
Politicalculture
 
Civil Liberties
Civil LibertiesCivil Liberties
Civil Liberties
 
Elections
ElectionsElections
Elections
 
Constitution1
Constitution1Constitution1
Constitution1
 
Democratic Theory
Democratic TheoryDemocratic Theory
Democratic Theory
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
 

Ähnlich wie Statistics 091208004734-phpapp01 (1)

0hypothesis testing.pdf
0hypothesis testing.pdf0hypothesis testing.pdf
0hypothesis testing.pdfAyushPandey175
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxAASTHA76
 
Steps in hypothesis.pptx
Steps in hypothesis.pptxSteps in hypothesis.pptx
Steps in hypothesis.pptxYashwanth Rm
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxkarlhennesey
 
Chi-square IMP.ppt
Chi-square IMP.pptChi-square IMP.ppt
Chi-square IMP.pptShivraj Nile
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminologyaswhite
 
hypothesis testing overview
hypothesis testing overviewhypothesis testing overview
hypothesis testing overviewi i
 
How to read a paper
How to read a paperHow to read a paper
How to read a paperfaheta
 
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative datamostafasharafiye
 
Statistics for management
Statistics for managementStatistics for management
Statistics for managementJohn Prarthan
 
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Daniel Katz
 

Ähnlich wie Statistics 091208004734-phpapp01 (1) (20)

Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
0hypothesis testing.pdf
0hypothesis testing.pdf0hypothesis testing.pdf
0hypothesis testing.pdf
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
 
Steps in hypothesis.pptx
Steps in hypothesis.pptxSteps in hypothesis.pptx
Steps in hypothesis.pptx
 
Important terminologies
Important terminologiesImportant terminologies
Important terminologies
 
HYPOTHESIS
HYPOTHESISHYPOTHESIS
HYPOTHESIS
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
 
Data analysis
Data analysis Data analysis
Data analysis
 
Chi-square IMP.ppt
Chi-square IMP.pptChi-square IMP.ppt
Chi-square IMP.ppt
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
 
More Statistics
More StatisticsMore Statistics
More Statistics
 
hypothesis testing overview
hypothesis testing overviewhypothesis testing overview
hypothesis testing overview
 
How to read a paper
How to read a paperHow to read a paper
How to read a paper
 
Analyzing quantitative data
Analyzing quantitative dataAnalyzing quantitative data
Analyzing quantitative data
 
Statistics for management
Statistics for managementStatistics for management
Statistics for management
 
Rm 3 Hypothesis
Rm   3   HypothesisRm   3   Hypothesis
Rm 3 Hypothesis
 
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
Quantitative Methods for Lawyers - Class #14 - Power Laws, Hypothesis Testing...
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Chapter 1 - AP Psychology
Chapter 1 - AP PsychologyChapter 1 - AP Psychology
Chapter 1 - AP Psychology
 
Chapter_9.pptx
Chapter_9.pptxChapter_9.pptx
Chapter_9.pptx
 

Mehr von mandrewmartin (15)

Regression
RegressionRegression
Regression
 
Diffmeans
DiffmeansDiffmeans
Diffmeans
 
More tabs
More tabsMore tabs
More tabs
 
Crosstabs
CrosstabsCrosstabs
Crosstabs
 
Research design pt. 2
Research design pt. 2Research design pt. 2
Research design pt. 2
 
Research design
Research designResearch design
Research design
 
Introduction
IntroductionIntroduction
Introduction
 
Building blocks of scientific research
Building blocks of scientific researchBuilding blocks of scientific research
Building blocks of scientific research
 
Studying politics scientifically
Studying politics scientificallyStudying politics scientifically
Studying politics scientifically
 
Media
MediaMedia
Media
 
Political Parties
Political PartiesPolitical Parties
Political Parties
 
Judiciary
JudiciaryJudiciary
Judiciary
 
Congress Part 2
Congress Part 2Congress Part 2
Congress Part 2
 
Congress
CongressCongress
Congress
 
Presidency Part 2
Presidency Part 2Presidency Part 2
Presidency Part 2
 

Kürzlich hochgeladen

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Kürzlich hochgeladen (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Statistics 091208004734-phpapp01 (1)

  • 1. Statistics: First Steps Andrew Martin PS 372 University of Kentucky
  • 2. Variance Variance is a measure of dispersion of data points about the mean for interval- and ratio-level data. Variance is a fundamental concept that social scientists seek to explain in the dependent variable.
  • 3.
  • 4. Standard Deviation Standard deviation is a measure of dispersion of data points about the mean for interval- and ratio- level data. Like the mean, standard deviation is sensitive to extreme values. Standard deviation is calculated as the square root of the variance.
  • 5.
  • 6.
  • 7. Normal Distribution  The bulk of observations lie in the center, where there is a single peak.  In a normal distribution half (50 percent) of the observations lie above the mean and half lie below it.  The mean, median and mode have the same statistical values.  Fewer and fewer observations fall in the tails.  The spread of the distribution is symmetric.
  • 8. Normal Distribution  Mathematical theory allows us to know what percentage of observations lie within one (68%), two (95%) or three (98%) standard deviations of the mean.  If data are not perfectly normally distributed, the percentages will only be approximations.  Many naturally occurring variables do have nearly normal distributions.  Some can be transformed using logarithms.
  • 11.
  • 12. Example Calculate the ID and IQV for a former PS 372 class grades using the following frequencies or proportions: Grade Freq. Prop. A 4 (.12) B 7 (.21) C 4 (.12) D 7 (.21) E 12 (.34)
  • 13. Index of Diversity ID = 1 – (p2 a + p2 b + p2 c +p2 d +p2 e ) ID = 1 - (.122 + .212 + .122 + .212 + .342 ) ID = 1 - (.0144 + .0441 + .0144 + .0441 + .1156) ID = 1 - (.2326) ID = .7674
  • 14. Index of Qualitative Variation 1 – (p2 a + p2 b + p2 c +p2 d +p2 e ) 1 - (1/K)
  • 15. Index of Qualitative Variation .7674 (1 – 1/5) .9592
  • 16.
  • 17. Data Matrix A data matrix is an array of rows and columns that stores the values of a set of variables for all the cases in a data set. This is frequently referred to as a dataset.
  • 18.
  • 19.
  • 21. Properties of Good Graphs Should answer several of the following questions: (JRM 384) 1. Where does the center of the distribution lie? 2. How spread out or bunched up are the observations? 3. Does it have a single peak or more than one? 4. Approximately what proportion of observations in in the ends of the distributions?
  • 22. Properties of Good Graphs 5. Do observations tend to pile up at one end of the measurement scale, with relatively few observations at the other end? 6. Are there values that, compared with most, seem very large or very small? 7. How does one distribution compare to another in terms of shape, spread, and central tendency? 8. Do values of one variable seem related to another variable?
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. Statistical Concepts Let's quickly review some concepts.
  • 29. Population A population refers to any well-defined set of objects such as people, countries, states, organizations, and so on. The term doesn't simply mean the population of the United States or some other geographical area.
  • 30. Population  A sample is a subset of the population.  Samples are drawn in some known manner and each case is chosen independently of the other.  From here on out, when the book uses the term sample, random sample or simple random sample, it's making reference to the same concept, which is a sample chosen at random.
  • 31. Populations  Parameters are numerical features of a population.  A sample statistic is an estimator that corresponds to a population parameter of interest and is used to estimate the population value.  Y is the sample mean, (μ) is the population mean.  ^ is a “hat”, caret or circumflex
  • 32. Two Kinds of Inference Hypothesis Testing Point and interval estimation
  • 33. Hypothesis Testing Many claims can be translated into specific statements about a population that can be confirmed or disconfirmed with the aid of probability theory. Ex: There is no ideological difference between the voting patterns between the voting patterns of Republican and Democrat justices on the U.S. Supreme Court.
  • 34. Point and Interval Estimation The goal here is to estimate unknown population parameters from samples and to surround those estimates with confidence intervals. Confidence intervals suggest the estimates reliability or precision.
  • 35. Hypothesis Testing Start with a specific verbal claim or proposition. Ex: The chances of getting heads or tails when flipping the coin is are roughly the same. Ex: The chances of the United States electing a Republican or Democrat president are roughly the same.
  • 37. Hypothesis Testing Next, the researcher constructs a null hypothesis. A null hypothesis is a statement that a population parameter equals a specific value.
  • 38. Hypothesis Testing Following up on the coin example, the null hypothesis would equal .5. Stated more formally: H0 : P = .5 Where P stands for the probability that the coin will be heads when tossed. H0 is typically used to denote a null hypothesis.
  • 39. Hypothesis Testing  Next, specify an alternative hypothesis.  An alternative hypothesis is a statement about the value or values of a population parameter. It is proposed as an alternative to the null hypothesis.  An alternative hypothesis can merely state that the population does not equal the null hypothesis, or is greater than or less than the null hypothesis.
  • 40. Hypothesis Testing Suppose you believe the coin is unfair, but have no intuition about whether it is too prone to come up heads or tails. Stated formally, the alternative hypothesis is: HA : P ≠ .5
  • 41. Hypothesis Testing Perhaps you believe the coin is more likely to come up heads than tails. You would formulate the following alternative hypothesis: HA : P > .5 Conversely, if you believe the coin is less likely to come up heads than tails, you would formulate the alternative hypothesis in the opposite direction: HA : P < .5
  • 42. Hypothesis Testing  After specifying the null and alternative hypothesis, identify the sample estimator that corresponds to the parameter in question.  The sample must come from the data, which in this case is generated by flipping a coin.
  • 43. Hypothesis Testing  Next, determine how the sample statistic is distributed in repeated random samples. That is, specify the sampling distribution of the estimator.  For example, what are the chances of getting 10 heads in 10 flips (p = 1.)? What about 9 heads in 10 flips (p = .9)? 8 flips (p = .8)?
  • 44.
  • 45. Hypothesis Testing  Make a decision rule based on some criterion of probability or likelihood.  In social sciences, a result that occurs with a probability of .05 (that is, 1 chance in 20) is considered unusual and consequently is grounds for rejecting a null hypothesis.  Other common thresholds (.01, .001) are also common..  Make the decision rule before collecting data.
  • 46. Hypothesis Testing  In light of the decision rule, define a critical region. The critical region consists of those outcomes so unlikely to occur that one has cause to reject the null hypothesis should they occur.  So there are areas of “rejection” (critical areas) and nonrejection.
  • 47.
  • 48. Hypothesis Testing  Collect a random sample and calculate the sample estimator.  Calculate the observed test statistic. A test statistic converts the sample result into a number that can be compared with the critical values specified by your decision rule and critical values.  Examine the observed test statistic to see if it falls in the critical region.  Make practical or theoretical interpretation of the findings.