7. Their basic pitch was
“Genomics is a fraud”
“
”
http://www.technologyreview.com/news/535771/a-contrarian-in-biotech/
8.
9. “The explosive growth of next-generation
sequencing data submitted into the SRA
exceeds the growth rate of storage
capacity
”
http://www.ncbi.nlm.nih.gov/pubmed/22009675
29. @leekgroup
Data:
xik
- value for feature i, sample k
yk
- group indicator for sample k
TSP is (i,j) pair that maximizes:
|Pr(xik
< xjk
| yk
=1) – Pr(xik
< xjk
| yk
=0)|⌃ ⌃
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1989150/
31. @leekgroup
• Not the same as TSP
• But |â/s.e.(â)| = |û/s.e.(û)|, algebraically
• “Variance regularized” TSP
• zijk
invariant to monotone transformations
• Fix parameters → find features
E[yk
|zijk
] = u0ij
+ u1ij
zijk
Patil et al. (in prep)
32. @leekgroup
1. Calculate t-statistic for all pairs
2. Choose top pair (or covariate)
3. Continue for a fixed number of pairs
E[yk
|zijk
] = u0ij
+ u1ij
zijk
Patil et al. (in prep)
72. acknowledgements
Leek group
Prasad Patil
Leo Collado Torres
Abhi Nellore
Claire Ruberman
Jack Fu
Kai Kammers
Collaborators
Michael Rosenblum
Benjamin Haibe-Kains
P.O. Bachant-Winner
Roger Peng