Weitere ähnliche Inhalte Mehr von SAKAUE, Tatsuya (20) Kürzlich hochgeladen (20) HiRoshima.R #2 入門者講習資料1. Hi oshima.
2012-02-11 HiRoshima.R #2 @
Friday, February 10, 12 1
3. 0.
• (SAKAUE, Tatsuya)
• : ...
• Nagoya.R / HiRoshima.R
• ID: sakaue
• ...
Friday, February 10, 12 4
5. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 7
6. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 8
7. •
• Ross Ihaka & Robert Gentleman (R & R)
• Ross Ihaka and Robert Gentleman. R: A language for data analysis and graphics.
Journal of Computational and Graphical Statistics, 5(3):299-314, 1996.
• http://biostat.mc.vanderbilt.edu/twiki/pub/Main/Je reyHorner/JCGSR.pdf
•
Friday, February 10, 12 9
9. • IBM SPSS Statistics 10 ...
•
•
Friday, February 10, 12 11
10. •
•
•
•
•
Friday, February 10, 12 12
12. Excel SPSS ...
•
•
• Excel
•
Friday, February 10, 12 14
13. Excel
SPSS
Friday, February 10, 12 15
16. • Windows: : http://cran.md.tsukuba.ac.jp/bin/windows/base/
• Download R 2.14.0 for Windows
• Mac OS X: http://cran.md.tsukuba.ac.jp/bin/macosx/
• R-2.14.0.pkg (latest version)
Friday, February 10, 12 18
24. •
•
• Windows
•
Friday, February 10, 12 26
25. • Windows
• [ Windows XP] →
• [ Windows 7 ] →
• Mac OS X
• /Applications /Library
Frameworks R. framework CleanApp
Friday, February 10, 12 27
26. • Windows
• START → Program → R → R 2.14.1
• Mac OS X
• /Applications R
R
Friday, February 10, 12 28
27. • Windows Mac OS X
1. q()
2.
3. R
Friday, February 10, 12 29
28. ...
• help(sth)
• seekR (http://seekr.jp/)
• R SEEK (http://www.rseek.org/)
• RjpWiki (http://www.okada.jp.org/RWiki/)
• R-Tips (http://cse.naro.a rc.go.jp/takezawa/r-tips/r.html)
•R (http://aoki2.si.gunma-u.ac.jp/R/)
Friday, February 10, 12 30
29. •
•
• help(sth) !!
Friday, February 10, 12 31
30. Excel
SPSS
Friday, February 10, 12 32
31. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 33
32. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 34
34. • [Enter]
• > 3+5 [Enter]
• > 10-3 [Enter]
• > 2*3 [Enter]
• > 100/20 [Enter]
• > (12 + 34 -56) * 78 / 90 [Enter]
Friday, February 10, 12 36
35. •
• > 100^1/2
• > 100^(1/2)
Friday, February 10, 12 37
39. “I don't know !” by fmgbain http://www.flickr.com/photos/fmgbain/4382010455/
Friday, February 10, 12 41
41. sqrt()
• > sqrt(2)
• > sqrt(144)
• > sqrt(104976)
Friday, February 10, 12 43
42. ( )
•
•q() help(sth)
•
Friday, February 10, 12 44
48. “I don't know !” by fmgbain http://www.flickr.com/photos/fmgbain/4382010455/
Friday, February 10, 12 50
50. “hako”
• > hako <- c(1,2,3,4,5)
• > hako
• c() concatenate/combine
•
Friday, February 10, 12 52
51. c() “<-”
hako <- c(1,2,3,4,5) “<-”
←
Friday, February 10, 12 53
52. + ...
• [Enter]
• [STOP]
• [Esc]
Friday, February 10, 12 54
53. •
• Tab
Friday, February 10, 12 55
54. hako
1 5 5
Friday, February 10, 12 56
56. • > sqrt(hako)
• > log(hako)
Friday, February 10, 12 58
60. • summary()
• table()
• sum()
• length()
Friday, February 10, 12 62
61. • mean()
• max(), min()
• median()
• sd()
Friday, February 10, 12 63
62. •R
•
• q(), help(), sqrt(), log(), c()
•
Friday, February 10, 12 64
64. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 66
65. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 67
66. Sub topics
1.
2.
3.
Friday, February 10, 12 68
69. ...
•
•
• TOEIC
Friday, February 10, 12 71
70. ...
A 180 75
B 170 65
C 165 60
D 175 70
E 190 80
Friday, February 10, 12 72
73. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 75
75. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 77
76. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 78
77. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 79
79. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 81
80. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 82
81. 1 2 3
4 5 6
7 8 9
Friday, February 10, 12 83
84. matrix()
•
• matrix( , , )
•
Friday, February 10, 12 86
85. A 180 75
B 170 65
C 165 60
D 175 70
E 190 80
Friday, February 10, 12 87
86. karada <- matrix(c(180, 170, 165,
175, 190, 75, 65, 60, 70, 80), 5, 2)
1. c()
2. matrix()
• 5 2
3. karada
Friday, February 10, 12 88
90. 1.
2.
Friday, February 10, 12 92
92. 2 → demo[2,]
2 → demo[,2]
http://gyazo.com/6726084afd9e1cc4b03df85fe6bc0f29.png
Friday, February 10, 12 94
93. 2 4
→ demo[c(2,4),]
2 4
→ demo[, c(2,4)]
Friday, February 10, 12 95
95. •
•
•
Friday, February 10, 12 97
96. Sub topics
1.
2.
3.
Friday, February 10, 12 98
97. •
•
•
Friday, February 10, 12 99
98. B 175 65
B 177 70
B 174 75
A 179 70
O 170 60
Friday, February 10, 12 100
99. • Excel WS
•
•
Friday, February 10, 12 101
100. ...
• data.frame() (p.240)
• (p.243)
• (p.243)
Friday, February 10, 12 102
102. “173.365 - Come on, feel the noise” by Je the Trojan
http://www. ickr.com/photos/trojanguy/3046207115/
Friday, February 10, 12 104
104. Sub topics
1.
2.
3.
Friday, February 10, 12 106
106. ...
• Excel
•
→
Friday, February 10, 12 108
107. • ...
•
•
Friday, February 10, 12 109
108. • Windows →
• “MyDocuments”
• Mac OS X →
• Linux up to you...
Friday, February 10, 12 110
109. •
• getwd()
• > getwd()
• setwd()
• > setwd("/Users/sakaue/Desktop/")
Friday, February 10, 12 111
110. • read.csv()
• CSV
• CSV: Comma Separated Value
Friday, February 10, 12 112
111. 1. demo.csv
• XLS/XLSX
• CSV UTF-8
2. > test <- read.csv(“demo.csv”)
3. > test [Enter]
Friday, February 10, 12 113
112. CSV
...
Friday, February 10, 12 114
114. 1. demo.xls
2. > test2 <- read.delim("clipboard")
Mac : read.delim(pipe(“pbpaste”))
3. > test2 [Enter]
Friday, February 10, 12 116
115. > table(test2[,1])
• 1
> mean(test2[,2])
• 2
> hist(test2[,2])
• 2
Friday, February 10, 12 117
116. •
• CSV read.csv()
• or
Friday, February 10, 12 118
117. CSV
Excel
“y2.d175 | Lasershow! Relax!” by B Rosen
http://www.flickr.com/photos/rosengrant/4751386872/
Friday, February 10, 12 119
118. Excel
SPSS
Friday, February 10, 12 120
119. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 121
120. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 122
123. 1.
2.
Friday, February 10, 12 125
124. 2
Excel SPSS
... orz
Friday, February 10, 12 126
126. > age <- c(18, 23, 14, 19,
21, 29, 22, 21, 23, 19, 20, 20,
26, 18, 14, 6, 8, 16, 23, 20)
> hist(age)
Friday, February 10, 12 128
128. > score <- c(60, 50, 72, 43, 50,
55, 43, 50, 85, 40)
> words <- c(340, 190, 465, 170,
130, 225,140, 310, 580, 120)
> plot(score,words)
Friday, February 10, 12 130
130. > high <- c(350, 285, 315, 340,
210, 185, 120, 740, 425, 155)
> coll <- c(365, 570, 645, 540, 645,
665, 880, 550, 410, 585)
> boxplot(high, coll, names=c("High", "Coll"))
Friday, February 10, 12 132
131. 1.
2.
3.
twitter, FB
Friday, February 10, 12 133
132. Excel
“y2.d175 | Lasershow! Relax!” by B Rosen
http://www.flickr.com/photos/rosengrant/4751386872/
Friday, February 10, 12 134
133. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 135
134. Agenda
1. R
2.
3.
4.
5. R
Friday, February 10, 12 136
138. • H0
• H1
• H0 (t, χ2, F )
•
Friday, February 10, 12 140
139. t
•
•R t.test()
Friday, February 10, 12 141
140. t
•
> like <- c(6,10,6,10,7,8,7,9,10,4)
> dislike <- c(3,5,6,4,4,8,4,5,4,7)
> t.test(like,dislike,var.equal=TRUE)
Friday, February 10, 12 142
141. t
> t.test(like,dislike,var.equal=TRUE)
Two Sample t-test
data: like and dislike
t = 3.3041, df = 18, p-value = 0.003946
alternative hypothesis: true difference in
means is not equal to 0 #
95 percent confidence interval: #
0.9831754 4.4168246
sample estimates:
mean of x mean of y
7.7 5.0
Friday, February 10, 12 143
142. t ...
“
t Welch t
Welch t ”
http://aoki2.si.gunma-u.ac.jp/lecture/Average/bunsan1.html
Friday, February 10, 12 144
144. ! !
― Welch ―
• http://oku.edu.mie-u.ac.jp/~okumura/blog/node/2262
• http://aoki2.si.gunma-u.ac.jp/lecture/BF/index.html
• Donald W. Zimmerman, ``Some properties of preliminary tests of equality of variances
in the two-sample location problem'', The Journal of General Psychology, Vol.123, pp.
217-231 (1996)
• The unequal variance t-test is an underused alternative to Student's t-test and the
Mann-Whitney U test -- Ruxton 17 (4): 688 -- Behavioral Ecology
• : Mann-Whitney U http://qdai.way-
nifty.com/qjes/2005/02/mannwhitneyu.html
Friday, February 10, 12 146
145. ...
> t.test(like,dislike,var.equal=FALSE)
Welch Two Sample t-test
data: like and dislike
t = 3.3041, df = 16.795, p-value = 0.004249
alternative hypothesis: true difference in
means is not equal to 0
95 percent confidence interval:
0.9743014 4.4256986
sample estimates:
mean of x mean of y
7.7 5.0
Friday, February 10, 12 147
146. t t ...
• t t ...
• Student t
t
• s^2
s t ...
• http://ja.wikipedia.org/wiki/%E3%82%A6%E3%82%A3%E3%83%AA
%E3%82%A2%E3%83%A0%E3%83%BB%E3%82%B4%E3%82%BB%E3%83%83%E3%83%88
• http://ja.wikipedia.org/wiki/%E8%87%AA%E7%94%B1%E5%BA%A6
• http://mat.isc.chubu.ac.jp/fpr/fpr1997/0119.html
• http://www.pol.geophys.tohoku.ac.jp/~hanawa/ori/ori/054.html
Friday, February 10, 12 148
149. •
•
• A B
•
Friday, February 10, 12 151
150. : “ ”
“however”
109 347 8 493
[ ] , ....
[ ] ..., , ....
[ ] ..., .
Friday, February 10, 12 152
151. > freq <- c(109,347,8)
> chisq.test(freq,correct=FALSE)
Chi-squared test for given probabilities
data: freq
X-squared = 391.7371, df = 2, p-value < 2.2e-16
# 2
# http://homepage2.nifty.com/nandemoarchive/toukei_kiso/t_F_chi.htm
Friday, February 10, 12 153
154. •3
• t
• ANOVA ANalysis Of VAriance
•F F
Friday, February 10, 12 156
155. A B C
10 9 6
8 7 4
9 4 3
6 5 9
9 2 2
5 8 6
7 4 2
9 2 4
8 8 3
10 4 9
Friday, February 10, 12 157
156. > test <- read.csv("demo.csv", head=T)
> anova(lm(Class ~ Score, data=test))
Analysis of Variance Table
Response: Class
Df Sum Sq Mean Sq F value Pr(>F)
Score 1 5.2389 5.2389 9.9376 0.00384
**
Residuals 28 14.7611 0.5272
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Friday, February 10, 12 158
157. •
• t
•
• !
• ...
Friday, February 10, 12 159
162. 2,940 1,785 3,780
Friday, February 10, 12 164
164. 22 19 R
R
R R
http://www.occn.zaq.ne.jp/cuhxr802/epi_20120126.pdf
Friday, February 10, 12 166
165. Excel
SPSS
Friday, February 10, 12 167
166. twitter: @sakaue
e-mail: tsakaue<AT>hiroshima-u.ac.jp
Friday, February 10, 12 168