SlideShare ist ein Scribd-Unternehmen logo
1 von 72
Application Software

Statistical Software
MSc. (Medical Administration) Program 2014
Post Graduate Institute of Medicine,
University of Colombo ,Sri Lanka

Dr.B.D.W.Jayamanne
M.B.B.S.,MSc.(Biostatistics),MSc.(Biomedical Informatics)
17 - 02 -2014
Outline -1

•
•
•
•
•

Statistics - overview
Data processing
Data types in computing
Data representation in computers
Data Analysis with computers
o

•

o

Statistical Software/Package - overview
o

•

Variable types
Choose of test

o

Stand alone -FOSS / Proprietary
Online resources

Data entering options
o
o
o

Spreadsheet
Database
Statistical software

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
•
•

Outline - 2
PSPP software
How to construct PSPP data file
o

•
•
•
•

o

How to import other format files
How to recode variables
Processing data
How to analyse - Parametric /Non Parametric
o
o

•

Text variables
Numeric Variables

o

Frequency
Bivariate Analysis 
Cross tab

Correlation

t test
Sub group selection

Introduction of R software

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Session 01
Overview
The research process – 8 step model
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Statistics
What is Statistics(ස ස ‍ ස ස ?
ࡃ ࡃ ࡃ )
ࡃ

The science of collection, analysis, and making
inference / conclusion of data.

•
•
•

Collection
Analysis
Making Inference

(* the word statistic(ස ස ‍ ස ස ) has a different
ࡃ ࡃ ࡃ ࡃ
ࡃ
meaning)
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable
Variable:
A quantity that vary from one unit to another ,the quantity
referred as a variable.

Eg: Height ,Weight,Blood Pressure, Crop yield -one value is
no sufficient
Discrete - Fixed number of possibilities (Blood Group)
Continuous - Infinite number of possibilities (BP) -even within
a finite interval

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Constant
Constant:
Opposite of a variable .If the quantity is not vary from one
unit to another that quantity is referred as a constant.
Eg. Density of an element - one value is sufficient

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data Processing - Steps
Raw Data

Interviews
Questionnaires
Observations
Interview guides
Secondary sources

Editing

Coding

<Codebook>
Coding the data
Verifying the coded data

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com

Analysis

Develop frame of analysis
Analysis
Data editing

•

Scrutinizing the completed research
instruments (identify and minimize )
Errors
o Incompleteness
o Misclassification
o Information gaps
o

•

Two ways
o
o

One variable at a time
One Questionnaire at a time

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data Types with computers

•

Boolen

•

Text/Character/String
o
o

•

Numeric
o
o

•

Single
Multiple

Integer
Decimal

Date /time

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Levels of measurement in statistics
1. Nominal scale
a.
b.

Only indicates category
Eg.Religion -Buddhism ,Christianity,Hindu

2. Ordinal scale
a.
b.

in addition to the category,allows cases to be ordered by degree
according to the measurement
Eg: very poor,Poor,OK,Good,Excellent

3. Interval scale
a.
b.
c.

Has units measuring intervals of equal distance between values measured in linear scale
No true zero
Eg: temperature in Celsius ,Date ,Latitude

4. Ratio scale
a.
b.

Has true zero
Not measured in linear scale

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data type & Scale of measurement
Data type
Measurement

•
•

Boolean
Text

•

Nominal

•

Numeric

•

Ordinal

•

Date/Time

•

Interval & Ratio scale

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data type & Scale of measurement

•
•

Identification of correct data type for the
scale of measurement is very important
before data entry
If wrongly applied
o
o
o
o

Can’t‍do‍appropriate‍analysis
Wrong conclusions
Can recode and correct the issues
Or can re-enter

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Coding of Questions

•

Open ended
o

•

o

Structured (close ended)
o

o

•

Text eg .Name
Number eg.age
Single Answer
 Yes / No - True/False
 Likert scale ( Agree -> Strongly disagree )
 Multiple Options/List - One Answer
More than one answer
 Multiple Options/List

Combined

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Coding of Questions….
1.Age :
2. Have you obtained any Postgraduate qualifications
1.Yes
2.No
3.We do not have to worry because Sri Lanka is not much
affected by climate change ?
1.Strongly agree 2.Agree 3.No opinion 4.Disagree 5.Strongly
disagree
4. You obtain information
1.TV 2.Radio
3.Newspapers
5.Journals 6.Books
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com

4.Internet
Good Data File Should...
Correct coding
of Questions

Correct Data
type

Correct scale of
measurement

Good Data File

Good Analysis

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
If not….

•
•
•
•
•
•

Error correction
Cleaning of data
Recoding
Import/export
Re-Enter
Hand calculations ??

Time and resource wasting????
Distress ??
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Statistical Software

•
•
•

Proprietary
Free
Online

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Proprietary Software list (familiar ) -2014

•
•
•
•
•
•
•
•

SPSS
MiniTAB
SAS
STATA
LISREL
MedCalc
STATISTICA
etc

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com

US $ 5,500
US $ 1,400
US $ 1,440
US $ 620
Free Software list (?? unfamiliar )

•
•

PSPP - Analog for SPSS
R and supportive packages
o

•
•
•
•

o

R Commander
Red R

Epi Info (7)
Epi Data
Win PEPI
Openepi -online Free

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data entering options

•
•

Not necessarily be a statistical software
Spreadsheet
o
o

•

o

Databases
o

•

Openoffice Calc
MS Excel
Google Spreadsheet

o

MS Access
etc

Statistical package
o
o
o
o

Epi Data
Epi Info
PSPP / SPSS
etc

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Session 02
Using Statistical Software
PSPP
Working with PSPP

•
•
•

Ver 0.8.x

Perfect Statistics Professionally Presented!
Probabilities Sometimes Prevent Problems!
People Should Prefer PSPP!!

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Introduction to PSPP

•

How to Download and install ?
o
o
o
o

•

o

Similar features with SPSS
o
o

•

Free download
Easy to install
Light weight (Small in size)
http://pspp.awardspace.com/
or simple google search download PSPP

o

Layout
Menu Commands
Scripts

Datafile & Script compatibility with SPSS

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
•
•
•
•

Advantages of PSPP
Free download / No subscription fees
Compatible with SPSS data files (similar)
Compatible with SPSS scripts
Multiplatform compatible - Has Linux versions

(Inter platform portability )

•
•
•

Faster than SPSS
> 1 billion variables(SPSS 2.15 billion,Excel
16,000)
> 1 billion cases (SPSS 2.15 billion,Excel 1million)

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Windows in PSPP
1.Data Editor(default)
a. Data view
b. Variable view
2. Output Window
3. Syntax editor

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data Editor

•
•

Provides a convenient, spreadsheet-like method for
creating and editing data files.
This window opens automatically when you start a
session.

Switch windows

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Toolbar - Data View
Save File

Jump to case
Jump to variable

OpenFile (Data/Syntax (Script),etc)

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data view
•Data View. This view displays the actual data values or
defined value labels.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
•

Data view

Rows are cases.
Each row represents a case or an observation. For
example, each individual respondent to a questionnaire is a
case.

•

Columns are variables.
Each column represents a variable or characteristic that is being
measured. For example, each item on a questionnaire is a
variable.

•

Cells contain values. Each cell contains a single value of a
variable for a case. The cell is where the case and the variable
intersect. Cells contain only data values.
**Unlike spreadsheet programs, cells in the Data Editor cannot

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
View Data labels

Menu

Toolbar

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable view
Variable View. This view displays variable definition
information, including defined variable and value
labels, data type (for example, string, date, or
numeric), measurement level (nominal, ordinal, or
scale), and user-defined missing values.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable view
•Variable View contains descriptions of the attributes of each variable in the

data file. In Variable View:
•‍Rows‍are‍variables.
•‍Columns‍are‍variable‍attributes.
•You can add or delete variables and modify attributes of

variables, including the following attributes:
•‍Variable‍name

•‍Data‍type
•‍Number‍of‍digits‍or‍characters
•‍Number‍of‍decimal‍places
•‍Descriptive‍variable‍and‍value‍labels
•‍User-defined missing values
•‍Column‍width
•‍Measurement‍level
•All‍of‍these‍attributes‍are‍saved‍when‍you‍save‍the‍data‍file.
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable view

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable Name

•
•
•
•

Each variable name must be unique; duplication is not
allowed.
Variable names can be up to 64 bytes long, and the first
character must be a letter or one of the characters @, #, or
$. Subsequent characters can be any combination of letters
and numbers
Variable names cannot contain spaces. Can keep space
using underscores
Reserved keywords cannot be used as variable names.
Reserved keywords are
ALL, AND, BY, EQ, GE, GT, LE, LT, NE, NOT, OR, TO, and
WITH.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable Type

•

Variable Type specifies the data type for each variable. By default, all
new variables are assumed to be numeric. You can use Variable Type
to change the data type.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable Labels

•

Can assign descriptive variable labels up to 256 characters
(128 characters in double-byte languages). Variable labels
can contain spaces and reserved characters that are not
allowed in variable names.

Missing Values

•

Missing Values defines specified data values as usermissing. For example, you might want to distinguish
between data that are missing because a respondent
refused to answer and data that are missing because the
question didn't apply to that respondent. Data values that
are specified as user-missing are flagged for special
treatment and are excluded from most calculations.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Value Labels

•
•

You can assign descriptive value labels for each value of a
variable. This process is particularly useful if your data file
uses numeric codes to represent non-numeric categories
(for example, codes of 1 and 2 for male and female).
Value labels are saved with the data file. You do not need
to redefine value labels each time you open a data file.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable Measurement Level

•

Nominal.

A variable can be treated as nominal when its values represent categories
with no intrinsic ranking (for example, the department of the company in
which an employee works). Examples of nominal variables include
region, zip code, and religious affiliation.

•

Ordinal.

A variable can be treated as ordinal when its values represent categories
with some intrinsic ranking (for example, levels of service satisfaction
from highly dissatisfied to highly satisfied). Examples of ordinal variables
include attitude scores representing degree of satisfaction or confidence
and preference rating scores.

•

Scale.

A variable can be treated as scale when its values represent ordered
categories with a meaningful metric, so that distance comparisons
between values are appropriate. Examples of scale variables include
age in years and income in thousands of dollars.
Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Variable Measurement Level

Nominal

Ordinal

Interval

Are there different categories ?

Yes

Yes

Yes

Can I rank the Categories ?

No

Yes

Yes

Can I specify the difference between
categories numerically ?

No

No

Yes

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Importing Data Files -Spreadsheets

•

Should be compatible with data structure

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Importing Data Files -Spreadsheets
•

From the menus choose

– File
ÂťOpen

ÂťImport Data
ÂťSelect All spreadsheets as the file type you want to view
ÂťOpen *.xls file

Access
DBsae

Excel

Other

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Data entry
with value labels

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Compute Variables

•
•

Simple to complex (adding ,subtract,multiply..)
Type Conversions

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Recode Variables

•
•

Recode into Same Variables
To Recode Values of a Variable

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Recode into Same Variables
•The Recode into Same Variables dialog box allows you to reassign the
values of existing variables or collapse ranges of existing values into new
values. For example, you could collapse salaries into salary range
categories.
•You can recode numeric and string variables. If you select multiple
variables, they must all be the same type. You cannot recode numeric and
string variables together.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Recode into Different Variables
•The Recode into Different Variables dialog box allows you to

reassign the values of existing variables or collapse ranges
of existing values into new values for a new variable. For
example, you could collapse salaries into a new variable
containing salary-range categories.
•‍You‍can‍recode‍numeric‍and‍string‍variables.
•‍You‍can‍recode‍numeric‍variables‍into‍string‍variables‍and‍
vice versa.
•‍If‍you‍select‍multiple‍variables,‍they‍must‍all‍be‍the‍same‍
type. You cannot recode numeric and string variables
together.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Analysis
Univariate

Bivariate

Frequency Distribution Crosstabulation

Multivariate
Conditional tables

Scattergrams

Partial rank order
correlation

Regression

Multiple and partial
correlation

Rank order
Correlation

Multiple and partial
Regression

Comparison of mean

Path analysis

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Univariate : Frequency

•
•

The first thing to do when all the data are collected is to count how
many people gave particular answers to each question.
We look at how the sample is spread or distributed in the various
categories of each variable.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Univariate : Frequency...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Central Tendency

•
•

One of the most important way of summarizing
a distribution of values for a variable is to
establish its Central Tendency
Central Tendency : The typical value in a
distribution .
o The arithmetic mean
o The median
o The mode

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Central Tendency...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Central Tendency...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Dispersion

•

The amount of variation shown by that distribution
is called dispersion.
 Range
 Variance

•
•

•



Standard Deviation

Range : Difference between highest and Lower
value in a distribution.
Variance : Average amount of deviation from the
mean.
Standard Deviation : Square root of the variance.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Dispersion...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Dispersion...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Measuring Dispersion...

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Group Selection (Select Cases)

•

Specified analysis for a category /selected
group

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Bivariate Analysis

•

The aim of bivariate analysis is to see whether
two variables are related.
o Cross Tabulation
o Bivariate Correlation

o t test

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Crosstabulation

•

Crosstabulation are a way of displaying data so that we
fairly readily detect association between two variable.

Steps of Crosstabulation

•
•
•
•

Determine which variable is to be treated as independent.
The independent variable is usually placed across the top of
the variable and a column is drawn for each category of that
variable.
The dependent variable is usually placed on the side of the
table and a row is drawn for each category of that variable
Compare percentages for each subgroups of the
independent variable within one category of the dependent
variable at a time.

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Crosstabulation

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Bivariate Correlation
(Only Pearson Correlation )

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
t Test
•In‍order‍to‍determine‍whether‍a‍set‍or‍sets‍of‍
scores are from the same population, a t-test
used
•There‍are‍three‍main‍types‍of‍t-test:
•One‍–Sample
•Independent‍groups

•Repeated‍measures/Related‍samples

• Assumptions

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
One –Sample

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Independent groups

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Related Groups

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
R software
http://cran.r-project.org/bin/windows/base/

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
R commander

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
R commander - Import data files

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
R commander - Menu commands

Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
Thank you

Have a statistically significant day !

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

The Lived Experiences of Black Women Faculty in the Instructional Technology ...
The Lived Experiences of Black Women Faculty in the Instructional Technology ...The Lived Experiences of Black Women Faculty in the Instructional Technology ...
The Lived Experiences of Black Women Faculty in the Instructional Technology ...
 
Database user’s
Database user’sDatabase user’s
Database user’s
 
Database index by Reema Gajjar
Database index by Reema GajjarDatabase index by Reema Gajjar
Database index by Reema Gajjar
 
An introduction to MongoDB
An introduction to MongoDBAn introduction to MongoDB
An introduction to MongoDB
 
Grounded Theory
Grounded Theory Grounded Theory
Grounded Theory
 
Joins in SQL
Joins in SQLJoins in SQL
Joins in SQL
 
File Organization
File OrganizationFile Organization
File Organization
 
Developing good research questions
Developing good research questionsDeveloping good research questions
Developing good research questions
 
DBMS
DBMSDBMS
DBMS
 
Denormalization
DenormalizationDenormalization
Denormalization
 
Database Keys & Relationship
Database Keys & RelationshipDatabase Keys & Relationship
Database Keys & Relationship
 
Flow chart
Flow chartFlow chart
Flow chart
 
database recovery techniques
database recovery techniques database recovery techniques
database recovery techniques
 
Respondent Driven Sampling
Respondent Driven Sampling Respondent Driven Sampling
Respondent Driven Sampling
 
Access ppt
Access ppt Access ppt
Access ppt
 
Unit01 dbms
Unit01 dbmsUnit01 dbms
Unit01 dbms
 
Stored procedure
Stored procedureStored procedure
Stored procedure
 
Database development progress(database)
Database development progress(database)Database development progress(database)
Database development progress(database)
 
Database management system
Database management systemDatabase management system
Database management system
 
Os presentation
Os presentationOs presentation
Os presentation
 

Andere mochten auch

การใช้งานโปรแกรมPspp
การใช้งานโปรแกรมPsppการใช้งานโปรแกรมPspp
การใช้งานโปรแกรมPsppekkawit sittiwa
 
โปรแกรม Webpage maker
โปรแกรม Webpage makerโปรแกรม Webpage maker
โปรแกรม Webpage maker491320129
 
Research process
Research processResearch process
Research processaditi garg
 
Pelatihan Bahasa R
Pelatihan Bahasa RPelatihan Bahasa R
Pelatihan Bahasa Ranom0164
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data AnalysisAsma Muhamad
 

Andere mochten auch (10)

การใช้งานโปรแกรมPspp
การใช้งานโปรแกรมPsppการใช้งานโปรแกรมPspp
การใช้งานโปรแกรมPspp
 
โปรแกรม Webpage maker
โปรแกรม Webpage makerโปรแกรม Webpage maker
โปรแกรม Webpage maker
 
Research process
Research processResearch process
Research process
 
Pelatihan Bahasa R
Pelatihan Bahasa RPelatihan Bahasa R
Pelatihan Bahasa R
 
Ap ch 20
Ap ch 20Ap ch 20
Ap ch 20
 
Class ppt intro to r
Class ppt intro to rClass ppt intro to r
Class ppt intro to r
 
Research Methods
Research Methods Research Methods
Research Methods
 
Research question
Research questionResearch question
Research question
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data Analysis
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
 

Ähnlich wie PSPP overview and Introduction to R & R Commander

Data Processing DOH Workshop.pptx
Data Processing DOH Workshop.pptxData Processing DOH Workshop.pptx
Data Processing DOH Workshop.pptxcharlslabarda
 
APSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxAPSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxMariaMalikAwan
 
Data analysis using spss
Data analysis using spssData analysis using spss
Data analysis using spssSyed Faisal
 
RSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSRSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSWesam Abuznadah
 
Introduction to the Research Method and SPSS
Introduction to the Research Method and SPSS Introduction to the Research Method and SPSS
Introduction to the Research Method and SPSS emuptv
 
5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptxHimaniPandya13
 
Introduction to Statistics Variable_part 2.pptx
Introduction to Statistics Variable_part 2.pptxIntroduction to Statistics Variable_part 2.pptx
Introduction to Statistics Variable_part 2.pptxMdAshrafulAlamRayhan
 
Introduction to Statistics Variable_part 2.pdf
Introduction to Statistics Variable_part 2.pdfIntroduction to Statistics Variable_part 2.pdf
Introduction to Statistics Variable_part 2.pdfAtoshe Elmi
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aRai University
 
CV_Gangadhar 1
CV_Gangadhar 1CV_Gangadhar 1
CV_Gangadhar 1Gangadhar H
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptxSreeLatha98
 
Data Science and Analysis.pptx
Data Science and Analysis.pptxData Science and Analysis.pptx
Data Science and Analysis.pptxPrashantYadav931011
 
SPSS Assignment Answers
SPSS Assignment AnswersSPSS Assignment Answers
SPSS Assignment AnswersLesa Cote
 
20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vsIan Feller
 
4 Statistical Software.pptx
4 Statistical Software.pptx4 Statistical Software.pptx
4 Statistical Software.pptxkaleabtegegne
 

Ähnlich wie PSPP overview and Introduction to R & R Commander (20)

Data Processing DOH Workshop.pptx
Data Processing DOH Workshop.pptxData Processing DOH Workshop.pptx
Data Processing DOH Workshop.pptx
 
APSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxAPSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptx
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
Pelatihan Data Analitik
Pelatihan Data AnalitikPelatihan Data Analitik
Pelatihan Data Analitik
 
Data analysis using spss
Data analysis using spssData analysis using spss
Data analysis using spss
 
Presentation 1.pptx
Presentation 1.pptxPresentation 1.pptx
Presentation 1.pptx
 
RSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSSRSS 2012 Data Entry SPSS
RSS 2012 Data Entry SPSS
 
Introduction to the Research Method and SPSS
Introduction to the Research Method and SPSS Introduction to the Research Method and SPSS
Introduction to the Research Method and SPSS
 
5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx5.Measurement and scaling technique.pptx
5.Measurement and scaling technique.pptx
 
Introduction to Statistics Variable_part 2.pptx
Introduction to Statistics Variable_part 2.pptxIntroduction to Statistics Variable_part 2.pptx
Introduction to Statistics Variable_part 2.pptx
 
Introduction to Statistics Variable_part 2.pdf
Introduction to Statistics Variable_part 2.pdfIntroduction to Statistics Variable_part 2.pdf
Introduction to Statistics Variable_part 2.pdf
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
CV_Gangadhar 1
CV_Gangadhar 1CV_Gangadhar 1
CV_Gangadhar 1
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptx
 
Data Science and Analysis.pptx
Data Science and Analysis.pptxData Science and Analysis.pptx
Data Science and Analysis.pptx
 
SPSS Assignment Answers
SPSS Assignment AnswersSPSS Assignment Answers
SPSS Assignment Answers
 
Data mining
Data miningData mining
Data mining
 
20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs
 
4 Statistical Software.pptx
4 Statistical Software.pptx4 Statistical Software.pptx
4 Statistical Software.pptx
 

Mehr von Bernard Deepal W. Jayamanne

MD Paediatrics (Part 1) - Overview of Basic Statistics
MD Paediatrics (Part 1) - Overview of Basic StatisticsMD Paediatrics (Part 1) - Overview of Basic Statistics
MD Paediatrics (Part 1) - Overview of Basic StatisticsBernard Deepal W. Jayamanne
 
Study of Hypopigmented patches children attending to the Dermatology clinic i...
Study of Hypopigmented patches children attending to the Dermatology clinic i...Study of Hypopigmented patches children attending to the Dermatology clinic i...
Study of Hypopigmented patches children attending to the Dermatology clinic i...Bernard Deepal W. Jayamanne
 
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...Bernard Deepal W. Jayamanne
 
MD Administration (Part1) - Lab Session -Data Representation and Processing
MD Administration (Part1) - Lab Session -Data Representation and Processing MD Administration (Part1) - Lab Session -Data Representation and Processing
MD Administration (Part1) - Lab Session -Data Representation and Processing Bernard Deepal W. Jayamanne
 
MD Paediatrics (Part 2) - Overview of Clinical Trials
MD Paediatrics (Part 2) - Overview of Clinical TrialsMD Paediatrics (Part 2) - Overview of Clinical Trials
MD Paediatrics (Part 2) - Overview of Clinical TrialsBernard Deepal W. Jayamanne
 
MD Paediatricts (Part 2) - Epidemiology and Statistics
MD Paediatricts (Part 2) - Epidemiology and StatisticsMD Paediatricts (Part 2) - Epidemiology and Statistics
MD Paediatricts (Part 2) - Epidemiology and StatisticsBernard Deepal W. Jayamanne
 

Mehr von Bernard Deepal W. Jayamanne (7)

Introduction to Health GIS
Introduction  to Health GIS Introduction  to Health GIS
Introduction to Health GIS
 
MD Paediatrics (Part 1) - Overview of Basic Statistics
MD Paediatrics (Part 1) - Overview of Basic StatisticsMD Paediatrics (Part 1) - Overview of Basic Statistics
MD Paediatrics (Part 1) - Overview of Basic Statistics
 
Study of Hypopigmented patches children attending to the Dermatology clinic i...
Study of Hypopigmented patches children attending to the Dermatology clinic i...Study of Hypopigmented patches children attending to the Dermatology clinic i...
Study of Hypopigmented patches children attending to the Dermatology clinic i...
 
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...
Molluscum Contagiosum and Nutritional Level of Children Attending to a tertia...
 
MD Administration (Part1) - Lab Session -Data Representation and Processing
MD Administration (Part1) - Lab Session -Data Representation and Processing MD Administration (Part1) - Lab Session -Data Representation and Processing
MD Administration (Part1) - Lab Session -Data Representation and Processing
 
MD Paediatrics (Part 2) - Overview of Clinical Trials
MD Paediatrics (Part 2) - Overview of Clinical TrialsMD Paediatrics (Part 2) - Overview of Clinical Trials
MD Paediatrics (Part 2) - Overview of Clinical Trials
 
MD Paediatricts (Part 2) - Epidemiology and Statistics
MD Paediatricts (Part 2) - Epidemiology and StatisticsMD Paediatricts (Part 2) - Epidemiology and Statistics
MD Paediatricts (Part 2) - Epidemiology and Statistics
 

KĂźrzlich hochgeladen

How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 

KĂźrzlich hochgeladen (20)

How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 

PSPP overview and Introduction to R & R Commander

  • 1. Application Software Statistical Software MSc. (Medical Administration) Program 2014 Post Graduate Institute of Medicine, University of Colombo ,Sri Lanka Dr.B.D.W.Jayamanne M.B.B.S.,MSc.(Biostatistics),MSc.(Biomedical Informatics) 17 - 02 -2014
  • 2. Outline -1 • • • • • Statistics - overview Data processing Data types in computing Data representation in computers Data Analysis with computers o • o Statistical Software/Package - overview o • Variable types Choose of test o Stand alone -FOSS / Proprietary Online resources Data entering options o o o Spreadsheet Database Statistical software Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 3. • • Outline - 2 PSPP software How to construct PSPP data file o • • • • o How to import other format files How to recode variables Processing data How to analyse - Parametric /Non Parametric o o • Text variables Numeric Variables o Frequency Bivariate Analysis  Cross tab  Correlation  t test Sub group selection Introduction of R software Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 5. The research process – 8 step model Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 6. Statistics What is Statistics(ࡃ ࡃ ‍ ࡃ ࡃ ? ࡃ ࡃ ࡃ ) ࡃ The science of collection, analysis, and making inference / conclusion of data. • • • Collection Analysis Making Inference (* the word statistic(ࡃ ࡃ ‍ ࡃ ࡃ ) has a different ࡃ ࡃ ࡃ ࡃ ࡃ meaning) Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 7. Variable Variable: A quantity that vary from one unit to another ,the quantity referred as a variable. Eg: Height ,Weight,Blood Pressure, Crop yield -one value is no sufficient Discrete - Fixed number of possibilities (Blood Group) Continuous - Infinite number of possibilities (BP) -even within a finite interval Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 8. Constant Constant: Opposite of a variable .If the quantity is not vary from one unit to another that quantity is referred as a constant. Eg. Density of an element - one value is sufficient Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 9. Data Processing - Steps Raw Data Interviews Questionnaires Observations Interview guides Secondary sources Editing Coding <Codebook> Coding the data Verifying the coded data Š bdwjayamanne@gmail.com/djayamanne@yahoo.com Analysis Develop frame of analysis Analysis
  • 10. Data editing • Scrutinizing the completed research instruments (identify and minimize ) Errors o Incompleteness o Misclassification o Information gaps o • Two ways o o One variable at a time One Questionnaire at a time Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 11. Data Types with computers • Boolen • Text/Character/String o o • Numeric o o • Single Multiple Integer Decimal Date /time Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 12. Levels of measurement in statistics 1. Nominal scale a. b. Only indicates category Eg.Religion -Buddhism ,Christianity,Hindu 2. Ordinal scale a. b. in addition to the category,allows cases to be ordered by degree according to the measurement Eg: very poor,Poor,OK,Good,Excellent 3. Interval scale a. b. c. Has units measuring intervals of equal distance between values measured in linear scale No true zero Eg: temperature in Celsius ,Date ,Latitude 4. Ratio scale a. b. Has true zero Not measured in linear scale Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 13. Data type & Scale of measurement Data type Measurement • • Boolean Text • Nominal • Numeric • Ordinal • Date/Time • Interval & Ratio scale Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 14. Data type & Scale of measurement • • Identification of correct data type for the scale of measurement is very important before data entry If wrongly applied o o o o Can’t‍do‍appropriate‍analysis Wrong conclusions Can recode and correct the issues Or can re-enter Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 15. Coding of Questions • Open ended o • o Structured (close ended) o o • Text eg .Name Number eg.age Single Answer  Yes / No - True/False  Likert scale ( Agree -> Strongly disagree )  Multiple Options/List - One Answer More than one answer  Multiple Options/List Combined Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 16. Coding of Questions…. 1.Age : 2. Have you obtained any Postgraduate qualifications 1.Yes 2.No 3.We do not have to worry because Sri Lanka is not much affected by climate change ? 1.Strongly agree 2.Agree 3.No opinion 4.Disagree 5.Strongly disagree 4. You obtain information 1.TV 2.Radio 3.Newspapers 5.Journals 6.Books Š bdwjayamanne@gmail.com/djayamanne@yahoo.com 4.Internet
  • 17. Good Data File Should... Correct coding of Questions Correct Data type Correct scale of measurement Good Data File Good Analysis Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 18. If not…. • • • • • • Error correction Cleaning of data Recoding Import/export Re-Enter Hand calculations ?? Time and resource wasting???? Distress ?? Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 20. Proprietary Software list (familiar ) -2014 • • • • • • • • SPSS MiniTAB SAS STATA LISREL MedCalc STATISTICA etc Š bdwjayamanne@gmail.com/djayamanne@yahoo.com US $ 5,500 US $ 1,400 US $ 1,440 US $ 620
  • 21. Free Software list (?? unfamiliar ) • • PSPP - Analog for SPSS R and supportive packages o • • • • o R Commander Red R Epi Info (7) Epi Data Win PEPI Openepi -online Free Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 22. Data entering options • • Not necessarily be a statistical software Spreadsheet o o • o Databases o • Openoffice Calc MS Excel Google Spreadsheet o MS Access etc Statistical package o o o o Epi Data Epi Info PSPP / SPSS etc Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 24. Working with PSPP • • • Ver 0.8.x Perfect Statistics Professionally Presented! Probabilities Sometimes Prevent Problems! People Should Prefer PSPP!! Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 25. Introduction to PSPP • How to Download and install ? o o o o • o Similar features with SPSS o o • Free download Easy to install Light weight (Small in size) http://pspp.awardspace.com/ or simple google search download PSPP o Layout Menu Commands Scripts Datafile & Script compatibility with SPSS Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 26. • • • • Advantages of PSPP Free download / No subscription fees Compatible with SPSS data files (similar) Compatible with SPSS scripts Multiplatform compatible - Has Linux versions (Inter platform portability ) • • • Faster than SPSS > 1 billion variables(SPSS 2.15 billion,Excel 16,000) > 1 billion cases (SPSS 2.15 billion,Excel 1million) Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 27. Windows in PSPP 1.Data Editor(default) a. Data view b. Variable view 2. Output Window 3. Syntax editor Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 28. Data Editor • • Provides a convenient, spreadsheet-like method for creating and editing data files. This window opens automatically when you start a session. Switch windows Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 29. Toolbar - Data View Save File Jump to case Jump to variable OpenFile (Data/Syntax (Script),etc) Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 30. Data view •Data View. This view displays the actual data values or defined value labels. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 31. • Data view Rows are cases. Each row represents a case or an observation. For example, each individual respondent to a questionnaire is a case. • Columns are variables. Each column represents a variable or characteristic that is being measured. For example, each item on a questionnaire is a variable. • Cells contain values. Each cell contains a single value of a variable for a case. The cell is where the case and the variable intersect. Cells contain only data values. **Unlike spreadsheet programs, cells in the Data Editor cannot Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 32. View Data labels Menu Toolbar Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 33. Variable view Variable View. This view displays variable definition information, including defined variable and value labels, data type (for example, string, date, or numeric), measurement level (nominal, ordinal, or scale), and user-defined missing values. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 34. Variable view •Variable View contains descriptions of the attributes of each variable in the data file. In Variable View: •‍Rows‍are‍variables. •‍Columns‍are‍variable‍attributes. •You can add or delete variables and modify attributes of variables, including the following attributes: •‍Variable‍name •‍Data‍type •‍Number‍of‍digits‍or‍characters •‍Number‍of‍decimal‍places •‍Descriptive‍variable‍and‍value‍labels •‍User-defined missing values •‍Column‍width •‍Measurement‍level •All‍of‍these‍attributes‍are‍saved‍when‍you‍save‍the‍data‍file. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 36. Variable Name • • • • Each variable name must be unique; duplication is not allowed. Variable names can be up to 64 bytes long, and the first character must be a letter or one of the characters @, #, or $. Subsequent characters can be any combination of letters and numbers Variable names cannot contain spaces. Can keep space using underscores Reserved keywords cannot be used as variable names. Reserved keywords are ALL, AND, BY, EQ, GE, GT, LE, LT, NE, NOT, OR, TO, and WITH. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 37. Variable Type • Variable Type specifies the data type for each variable. By default, all new variables are assumed to be numeric. You can use Variable Type to change the data type. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 38. Variable Labels • Can assign descriptive variable labels up to 256 characters (128 characters in double-byte languages). Variable labels can contain spaces and reserved characters that are not allowed in variable names. Missing Values • Missing Values defines specified data values as usermissing. For example, you might want to distinguish between data that are missing because a respondent refused to answer and data that are missing because the question didn't apply to that respondent. Data values that are specified as user-missing are flagged for special treatment and are excluded from most calculations. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 39. Value Labels • • You can assign descriptive value labels for each value of a variable. This process is particularly useful if your data file uses numeric codes to represent non-numeric categories (for example, codes of 1 and 2 for male and female). Value labels are saved with the data file. You do not need to redefine value labels each time you open a data file. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 40. Variable Measurement Level • Nominal. A variable can be treated as nominal when its values represent categories with no intrinsic ranking (for example, the department of the company in which an employee works). Examples of nominal variables include region, zip code, and religious affiliation. • Ordinal. A variable can be treated as ordinal when its values represent categories with some intrinsic ranking (for example, levels of service satisfaction from highly dissatisfied to highly satisfied). Examples of ordinal variables include attitude scores representing degree of satisfaction or confidence and preference rating scores. • Scale. A variable can be treated as scale when its values represent ordered categories with a meaningful metric, so that distance comparisons between values are appropriate. Examples of scale variables include age in years and income in thousands of dollars. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 41. Variable Measurement Level Nominal Ordinal Interval Are there different categories ? Yes Yes Yes Can I rank the Categories ? No Yes Yes Can I specify the difference between categories numerically ? No No Yes Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 42. Importing Data Files -Spreadsheets • Should be compatible with data structure Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 43. Importing Data Files -Spreadsheets • From the menus choose – File ÂťOpen ÂťImport Data ÂťSelect All spreadsheets as the file type you want to view ÂťOpen *.xls file Access DBsae Excel Other Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 44. Data entry with value labels Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 45. Compute Variables • • Simple to complex (adding ,subtract,multiply..) Type Conversions Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 46. Recode Variables • • Recode into Same Variables To Recode Values of a Variable Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 47. Recode into Same Variables •The Recode into Same Variables dialog box allows you to reassign the values of existing variables or collapse ranges of existing values into new values. For example, you could collapse salaries into salary range categories. •You can recode numeric and string variables. If you select multiple variables, they must all be the same type. You cannot recode numeric and string variables together. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 48. Recode into Different Variables •The Recode into Different Variables dialog box allows you to reassign the values of existing variables or collapse ranges of existing values into new values for a new variable. For example, you could collapse salaries into a new variable containing salary-range categories. •‍You‍can‍recode‍numeric‍and‍string‍variables. •‍You‍can‍recode‍numeric‍variables‍into‍string‍variables‍and‍ vice versa. •‍If‍you‍select‍multiple‍variables,‍they‍must‍all‍be‍the‍same‍ type. You cannot recode numeric and string variables together. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 49. Analysis Univariate Bivariate Frequency Distribution Crosstabulation Multivariate Conditional tables Scattergrams Partial rank order correlation Regression Multiple and partial correlation Rank order Correlation Multiple and partial Regression Comparison of mean Path analysis Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 50. Univariate : Frequency • • The first thing to do when all the data are collected is to count how many people gave particular answers to each question. We look at how the sample is spread or distributed in the various categories of each variable. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 51. Univariate : Frequency... Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 52. Measuring Central Tendency • • One of the most important way of summarizing a distribution of values for a variable is to establish its Central Tendency Central Tendency : The typical value in a distribution . o The arithmetic mean o The median o The mode Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 53. Measuring Central Tendency... Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 54. Measuring Central Tendency... Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 55. Measuring Dispersion • The amount of variation shown by that distribution is called dispersion.  Range  Variance • • •  Standard Deviation Range : Difference between highest and Lower value in a distribution. Variance : Average amount of deviation from the mean. Standard Deviation : Square root of the variance. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 59. Group Selection (Select Cases) • Specified analysis for a category /selected group Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 60. Bivariate Analysis • The aim of bivariate analysis is to see whether two variables are related. o Cross Tabulation o Bivariate Correlation o t test Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 61. Crosstabulation • Crosstabulation are a way of displaying data so that we fairly readily detect association between two variable. Steps of Crosstabulation • • • • Determine which variable is to be treated as independent. The independent variable is usually placed across the top of the variable and a column is drawn for each category of that variable. The dependent variable is usually placed on the side of the table and a row is drawn for each category of that variable Compare percentages for each subgroups of the independent variable within one category of the dependent variable at a time. Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 63. Bivariate Correlation (Only Pearson Correlation ) Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 64. t Test •In‍order‍to‍determine‍whether‍a‍set‍or‍sets‍of‍ scores are from the same population, a t-test used •There‍are‍three‍main‍types‍of‍t-test: •One‍–Sample •Independent‍groups •Repeated‍measures/Related‍samples • Assumptions Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 70. R commander - Import data files Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 71. R commander - Menu commands Š bdwjayamanne@gmail.com/djayamanne@yahoo.com
  • 72. Thank you Have a statistically significant day !