SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
Engaging a Scientific Community in Contributing
to a Biological Database
Paul Gardner
June 21, 2013
Paul Gardner Engaging Scientists
What is RNA?
RNA is a fundamental biological molecule, essential for untold
biological processes
My aim is to build an analog to the Periodic Table for
classifying RNA families and motifs, enabling researchers to
predict function.
New technologies are accelerating the rate of RNA discovery.
base basepair
R
A
U
A
G
A
U Y
A
C
A
U
U
5´
Y
G
A
A
R
5´
C
U
U C
G
G
5´
R
U
R R R
Y
5´
R
R
G
C
G
U
R A
R
A
G
C
Y
5´
R
Y
G
G
A
G
Y
R RR
R
C RR
G
A
R
R
5´
C
G
A
A
G
Y
Y
R
Y
Y RR
G
G
G
R
U
G
G
A
G
5´
C
C
R
A
Y
C
C
C
R
U C
C
G
A
A
C
U
Y
G
G
5´
A N Y A G N R A U N C G T loop U t ur n k t r n1 k t r n2 tw ist
R
C
Y
R
G
G
A
AC
U
G
A
RC
R
U
Y
AG
U
A
C
G
GG
A R R A5´
Y
Y
Y
A
GU
A
G Y R
A
G
G
A
A
R
R
R
5´ R
Y
G
R
Y
A
A
Y
C
RY
A Y
Y
A
G
R
GA
A
Y
C
5´
R
C
A
GG A
G
Y
5´
A
C
A
C U
G
R
Y
R
Y G Y R R R R
R
Y
C
A
R
U
Y
5´
R
A
G
C
R
C
G
R
A
G
Y AY
G
Y
Y
R
G
U
U
Y
5´
A
A
A
A
A
G
C
Y
R
Y
Y R
R
Y
G
G
Y
U
U
U
U
UU
Y U Y5´
R
R
A
R
R Y
Y
U
U
UU
U U Y5´
sar r ic1 sar r ic2 U A A G A N C sr C loop dom V t er m 1 t er m 2
R Y Y Y Y
G
C
G
A
G
C
A
G
A
C
G
C
A
R
A
A
C
R
C
C
C
R
R
Y
R
R
Y
G
G
G
Y
G
U
U
Y
U
G
C
G
U
C
U
G
C
U
C
G
C
R R R R5´
Y
U
Y
UC
U
C
A
A
C
AG
UG
Y
U
U
G
R
R
R
A
A
Y
5´ Y
Y
Y
Y
Y
A
U
GA
Y G
R
Y
Y
Y
YA
A
A Y
Y
Y
YY
R
R
G
R
R
Y
C U GAU
Y
Y
Y
R
R
R
5´
G
G
G
U
C
U
C
U
C
U
G
Y
U
A
G
A
C
C
A
G
AU
CU
G
A
G
C
C
UG
GG
A
G
C
U
C
U
C
U
G
G
C
U
A
R
C
U
A
G
G
G
A
A
C
C
CA
C5´ UG
U
A
A
A
C
A
U
C
CU
Y G
A
C
U
G
G
A
A
G
C
UG
U
R
R
R
Y R Y
R
R
RR
G
C
U
U
U
C
A
G
U
C
G
G
A
U
G
U
U
U
G
C
5´ U CU
U
U
G
G
U
U
A
U
C
U
A
G
C
U
G
UA
U
G
AG
U
G
Y
Y R
C
RU
C
A
UA
A
A
G
C
U
A
G
A
U
A
C
C
G
A
AR
U5´ C
Y
Y
R
UC
C
C
U
G
A
G
A
C
C
C
U
A
A
C
Y
U
G
U
G
AG
Y
U
Y
YY
A
G
Y
UU
C
A
C
A
R
G
U
R
G
G
Y
U
C
U
Y
G
G
G
R
CY
R
G
G
5´
G
C
U
A
A
A
A
G
G
A
A
C
G
A
U
C
G
U
U
G
U
G
A
U
A
U
GC
G
U
U
RRU
U
YC
G
U
U
AC
A
U
A
U
C
A
C
A
G
U
G
A
U
U
U
U
C
C
U
U
U
A
U
A
R
CG
C5´ C Y GY
G
Y
Y
C
A
U
C
U
U
A
C
Y
G
RG
C
A
G
U
G
U
U
G
GA
U
G
Y
YY R
R
G
Y
C
UC
U
A
A
Y
A
C
U
G
YC
U
G
G
U
A
A
Y
G
A
U
G
R
C
RY
C G G5´ Y Y Y Y R R GY
A
C
A
U
R
C
U
U
C
U
U
U
A
U
A
U
C
C
C
A
U
AY
R
A
Y
R
R
R
CU
A
U
G
G
A
A
U
G
U
A
A
A
G
A
A
G
U
A
U
G
U
AY
Y Y G G Y5´ Y R R YY
C
R
U
C
A
A
A
R
U
G
G
Y
U
G
U
G
A
R U
G
U
Y
R
U
CA
U
A
U
C
A
C
A
G
C
C
A
C
U
U
U
G
A
U
G
AG
Y U Y R R5´ Y A A RA
A
G
G
G
A
A
Y
R
G
U
U
G
C
U
G
U
G
A
U
R
U
A
Y
Y
Y
A Y
Y
Y
Y
U
YU
A
U
A
U
C
A
C
A
G
U
G
G
C
U
G
U
U
C
U
U
U
UU
G G U Y5´ Y
C
R
G
G
U
G
A
G
G
U
A
G
U
A
G
G
U
U
G
U
A
U
A
G
U
U
RR
R
R
Y
Y
Y
Y YG
G
A
GY
A
A
C
U
R
U
A
C
A
A
Y
C
U
R
C
U
A
C
U
U
Y
C
C
U
G
R
5´
G
G
C
U
G
G
U
C
C
G
A
R
RG
U
A
G
U
G
G
G
U
U
A
Y
R
U
Y
A
AY
Y
Y
Y
U
U
R
Y
Y Y YU
C
Y
C
C
CYC
Y
C
A
C
U RC
UR
YA
C
U
U
G
A
C
U
R
G
C
CU
U U5´ Y
Y
Y
C
U
G
Y
R
R
U
G
U
C
G
UA
R
Y
Y
Y
Y
Y
U
G
A
R
C
CRAY
Y
Y
Y
Y
Y
G
G
G
R
G
Y
Y
Y
Y
Y
R
G
G
YA
G C
C
C
YY
G
G
GA
A
R
C
A
A
R
Y
R
R
R
R
Y
R
C
C
C A CCU
R
R
R
Y
R
YRG
G
U
U
C
A
R
R
R
R
Y
A
C
G
G
C
A
Y
Y
R
Y
G
G
R
Y
YY
Y5´
Y
Y
R
C
G
R
C
C
A
UA
C
R
R
R
G
R
A
R
C
A
CC
Y
G
R
U
C
C
CA
U C
C
G
A A
CY
C
R
GA
A
G
U UA
A
GC
Y
Y
Y Y
GG
C Y
R R
G
U
A C U
R
G R YG
RG
R
AYC
CUG
GG
AA
RY
RGGU
G
Y
Y
G
Y
R
RY
5´
G
RU
A
GYYY
AR
Y
G
G
Y AR
R R C
RY
Y
R
G
Y
U
Y A
A
Y
Y
R
R
RR
Y
R
RG
G UU
C
R
AR
U
C
C
Y
YY
YR
5´
R
R
AAR
Y
U
C
R
Y
R
R
R
R
GYYAC
R
R
YG
A
G
U
R
Y
Y R
YRCUC
Y
CYYYY
G G G A A GGU
C U G A G
A
R
G
C
CAY
Y
R
C
C
CU
G
GGGYR
Y
Y
Y
Y
Y
Y
GR
R
R
R
G
R
R
R
R Y G R G Y Y
A
C
C
AG
A A A Y
R
R Y Y
Y
Y
R
RGY
U
U
GGAA
RRCUYRY
GGCY
RG Y R R Y U
A
G
U
C
A
A
U
R
Y
GRR
Y
R
R
Y
Y
Y
R
AAC
Y
C
R
A
UUCAG
A
C
U
A
UCU
Y
Y
5´
T R I T I R E SE C I S m ir -T A R m ir -30 m ir -9 lin-4 m ir -5 m ir -8 m ir -1 m ir -2 m ir -6 let -7 Y R N A 6S 5S t R N A R N aseP
AURRGRYA
G
G
YA
U
U
G
AA
CUGU
AU
U
G
U
G
CR
C
C
UU
GCAUARAGCUAAAGCACUAAAAAGGAGUAA5´
A
G
U
C
A
U
G
A
U
YG
C
U
A
U
U
C
Y
Y Y
A
A
A
U
A
G
UG
A
U
U
G
U
G
A
U
AG
C
G
A
U
G
C
G
G
Y
G
U
G
U
UG C
G
C
A
C
R
Y
C
G
Y
A
Y
C
G
CG
C U5´
AGAGGAARCR
G
G
G
G
C
CAY
G
C
A
GAAGC
G
U
UC
ACG
U
C
G
C
G
G
C
C
C
CU
GUC
A
G
A
U
U
C
RGU
R
A
A
U
C
U
GC
GAAUUCUGCU5´
G A U AC
A
U
A
G
G
A
A
C
C
U
C
C
U
C A
A
A
G
G
A
U
U
C
U
A
U
GG
A C AG
U
C
G
A
U
G
C
A
G
G
G
A
G
G
G A CR
R
C
U
C
C
C
U
G
C
A
U
C
G
G
CG
A U U U U5´ A
C
G
R
RG
U
R RA
R
UG
C
G
A U A A Y A YA
A
U
A
A
U
GAAA
U
U
C
C
U
CU
U U G A C
G
G
C
C
A
A
U
A
GC
GA
U
A
U
U
G
G
C
CA
U
U
U
U
U
U
U
5´ R
Y
C
U
U
U
A
G
C
G
GG
Y
U
R
RR
U
Y A R U CURG
Y
Y
G
G
Y
G
U
U
U
C
G
C
C
G
R
C
Y YU
R
C
Y
Y
U
G
A
Y
R
Y
5´
RYYRYYCC
G
U
G
G
UG
A
U
U
U
G
RYC
GGCCGG
C
U
U
G
C
AG
C
C
A
C
GU
UAAAYAAUCGCUAAARAGGCCGRGGRRR5´
G
UCGRR
U
Y Y C A
C
UG
A U G AG U C Y
U R
ARGAC
G
A
AA
C
5´ Y Y R
A
U
Y
U
AAA
RA
A
A
C
A G CU
U UC
A AG
U G CCU U U Y U GC
A G
U
U
YYY
CARGAGCGC
A
A
G
A
U
RG
R U A5´
R
Y
G
GY
Y G
Y
U
U
G
C
C
A
U
A
C
G
C
C
C
YY
Y YY
C
G
G
C A
GG
U
A
U
G
G
A
A
R
C
A
C
C
C
YC
G Y A CG
A
C
U
G
GY
Y
C G
G
A
C
A
CY
GY
C
G
U
C
CC
G
C
C
A
G
A
U
C
5´ CA
C
A
U
C
A
G
A
U U U
C
C
U
G
G
U
G
UA
A CG
A
A
U
U
U
U
C
A
A
G
U
G
C
U U C
U
U
G
C
A
U
A
A
G
C
A
A
G
U
U
URA
U
C
C
C
G
C
Y
C
CY
YC
G
R
G
Y
C
G
G
G
A
UU
U5´ A U GG
A
G
A
C
A
UGGCR
U
AA
AG
C C AG
A R
A
G U R A
G
A
AC
R U A A C
Y
U
A
G
A
C
U
R
U
ACUUGAA
C
U
G
A
U
UYRC
A
U
C
U
CA
U U U U5´
G
C
R
C
Y
G
C
AA
AA
U
C
R
G
R
Y
G
C
C G G G A
U UG
G
YA
YCCCG
R
A
Y
R
R
R
R
Y
R
A R C G C
Y
GCGYU
U
U
U
U
U
5´
Y U R C G U G A C G A A G CG
C
G
C
G
CA
A
A
G
UGG
A C
AA
U
A
A
AG
C
C
UR
A G C
RU
Y
R
A
G
UAG
U
C
G
Y
CAG
A
C
G
C
C
G
G
U
U A A
G
C
C
G
G
C
G
U
UU
U U U5´ YR
Y
A
C
G
UR
Y
C
Y
G
U
U
R
UR
G
Y
C
C
G
G
U
U
G
C
U
U
UG
GU
C
G
G
U
G
A
C
C
G
G
R
R R R
R
A
G
C
C
C
R
C
UU G
G
U
G
G
G
Y
U
UU
U U5´
G
G
Y
C
R
G
C
Y
C
R
CC
C C CC
R
G
R
G
C
Y
G
R
C
C
G A C G G C C C C C G C
U CC
C
C
CCY
GGCGGGGGYCGUC
C
C
Y
Y
5´
U U G G C G A U R UU
U
U
U
G
GU
U G
G
A
A
U
G
UAGUGY
YY
UU
A
R C A C U AA
A CG
C U G
CC
A C AA
A
U
A
A
CCUG
U
CAGU
U
A
U
U
U
C
A
Y
C
A
A
A
AA
U A A A5´
RYYRYUG
C
C
C
UCY
G
G
G
CG
UUUCCUCCCUAGACUU
G
G
C
Y
Y
YY
R
R
G
G
C
CU
UUUUUUUYYY5´
SA M V sym R C P E B 3 F inP sr oB m sr SA M a H H 3 V m nt n3 livK D sr A C A E SA R isr K sr oD isr B 6C r spL suhB
UY
G
C
A
UCCGCYAA
Y
CGGUYA
G C C GU G UC
G C GG A
A G
G
U
U
Y Y
Y
A
A
C
CA
G C UR
Y
Y U Y Y G RA
ACRRAG
RRA
GGUG
A
G
C
G
5´
UG
A
A
A
GAC
G
C
G
C
A
U
U
U
GU
U A U C A U CA
UC
C CU
G
U Y
C
A
G
AG
A
U
GY
A
A
U
U
U
GG
CC
AC
AG
Y
RY
G
U
G
G
C
C
U
U
U
U
C
5´
* U
U
C
U
A
C
U
G
A
C
U
C
UU
U
U
A
AA
A
U
A
AU
U
A
U
U
C
A
U
U
G
G
AG
G U UU
A
A
UA
U
G
A
A
U
A
UA
A A G G A U G A G CA
U A
U
A
G
A
AG
C
GUUUG
C
UCYUU
GU
U
A
G
AU
C
R
G
U
U
A
G
U
A
G
G
AA
5´
G A U U UG
G
U
R
R
C
U
G
C
G
C
U
C
UU
C UA
A
G
C
C
A
G
U
U
A
C
C
CG
G
U
U
C
A
A
A
R
A
U
U
G C C
A
G
C
U
U
Y
G
A
A
C
CU
UC
G
A
A
A
A
A
C
C
A
C
C
U
Y CR
R
G
G
U
G
G
U
U
U
U
U
U
C
GU
5´
R R R R R R R R
C
U
C
R
U
AU
A
A
YYYCRRR
AA
U
A
UG GY
Y Y G R R A
GU
U U C UAC
C R R G Y R
C CG
U
AAA
YRYYYG
A
CU
A
Y
G
A
G
RR
R5´
C
G
G
C
A
U
C
C
C
C
A
U
U
A C C
U
A
U
G
G AC
A
CG
G
U
G
C
C
G
C A R G C U C U G G R A
G UU
C
GUYCCRGAGYYUG
Y
Y
G
G
A
A
R
G
G
U
U
U
U
C
C
G
U
G
U
C
C
A
G
5´
R
R
Y
G
G
A
R
G
CRR
U
GA
R
Y
R
Y
Y
Y
YU
Y
A
U
YU
G G GCA
C
Y
U
G
R
R
R
Y
R
YG
G
A
G
C
YAG
U R GU
G
C
A
ACCG
R
C
C
R
Y
R
R
R
5´
G
U
U
G
U
A
A
C
U
AU
G
U
U
G
C
A
R
YA
R A C G AG
A
A
C
C
G
AG
U
A
U
A
G
U
U
C
A
U
GG
G
R
U Y A
CA
UG
AA
UU G U UU
A
A
CU
RU
CC
U
C
U
GG
A
U U
C
CC
G
U
C
C
AU
G
R
C
A
GU
C
G
G
U
U
C
5´
CUUA
C
U
G
A
GA
G
C
A
C
AA
A
GU
UUC
C
C
G
U
GC
CA
A
C
A
G
G
G
A
G
U
G
U
UAU
A
AC
G
G
UU
UAUU
A
G
U
C
U
G
G
AG
ACG
G
C
A
G
A
C
U
AU
CCUCUUC
C
C
G
G
U
C
CC
CUA
U
G
C
C
G
G
GU
UUUUUUUAUGUC5´
UURGRYUYRCCUG
A
A
U
G
U
G
A
CU
A
U
C
A
C
U
U
CA
AACRRYGRGYAACCUCAGUAUCAUCRYRGAGYUA
A
A
C
C
C
U
C
G
C
C
G
C
CUG
A
C
G
G
Y
G
A
G
G
G
U
U
UU
CUUUUGGR5´
U G U A A A A A A C A U Y A U U UA
G
C
GUGAYU
U
U
C
U
A
U
C
A
ACAG
C U A A C
A
A
U
U
G
U
UA
U
U
A
C
UG
C
CUA
A
Y
G
Y
U
C
A
UA
A G G G U A AUU
U
U
A
A
A
A
A
AGG
G CG
A
U
A
A
AA
A
A
C
G
A
U
U
G G GGGA
U
G
A
G
A
Y
A
U
G
AAC
G
C
UC
A A G C A5´
C C C A G A G G U A U U G A UU
G
G
U
G
A
U
R
R
C
A
Y
Y
U C U
R
U
G
Y
U
Y
A
U
UY
A
U
UR
C
A
C
C
A
A C C U G C G C RG
A
UGCGCAGGU
U
U
U
U
U
U
U
5´
AR
R
R
Y
Y
YYYAAURYCAACYUUUAGCGCACG
G
C
U
C
U
YY
A
A
G
A
G
C
CA
UUYCCCUA
G
R
C
C
A
A
A
C
A
G
GAAU
Y
G
U
U
U
G
G
Y
C
UU
UUUUU5´
G
G
G
C
A
R
G
A
U
A
U
G
U
G
A
A
GU
R
GC
Y
A
C
C
GC
AA
GC
YGR
U
A
CY
CUU
CAC
Y
Y Y C C
U
U
A U UC
G C
U
Y
GC
U
CAAC
GGR
A
U
C
Y
U
G
C
U
CU
G C G A G G C Y5´
GUGCRRYCYRAUUYYR
G
Y
Y
G
Y
G
C
C
Y
R
Y
R
A
R
AAC
AUCAYAA
R
A
U
A
CG
G
C
R
C
R
R
CC
ACRAUUUCCCUG
G
U
G
U
U
GG
C
G
C
A
GU
AUU
C
G
C
G
C
A
C
CC
CGGUCUACC5´
Y
U
U
Y
R
Y
U
R
R
U
U
U
Y
A
U
C
A
R
A
YC
U GU
U
U
G
A
U
R
R
A
A
G
Y
U
A
R
Y
G
A
R
R Y Y C A Y UA
A
C
R
G
C
U
Y
U
Y
GC
Y G
G
C
Y Y G
A
C
C
C
G
A
G
R
Y
Y
G
U
UU
U U U U5´
RACGUUCAY
C
C
Y
YY
R
G
G
RC
GCAYRA
Y
C
A
R
R
Y
C
A
Y
GG
AAC
G
G
G
G
R
Y
Y
U
G
R
R
5´
sucA Sr aD sxy R N A I P ur ine SA M -C hl cdiG M P 2 A nt i-Q G adY r nk ldr P r fA O m r A -B R yeB t r aJ 2 Sr aH 23Sm et h D S-p ep
U U C G G C C Y CG
C
R
R
C
G
YU
U YU
Y
C
G
Y
Y
G
CC
C U C U G C A YG
C
C
G
U
C
G
C
C
G
A
CGCAY
U
C
C
Y
A
U
U
CG
A
A Y Y G U
G
C
G
A
U
C
C
U
G
U
C
G
C CY
U
C
C
U
GC
G
G
C
G
C
G
G
C
5´ CG
Y
R
G
C
G
C
U
U
G
U
UA
U U
U
R
Y
Y
G C U
G
U
G
U
A
G U GUC
G
U
C
Y
YR
A R Y Y R G R R Y Y Y
A
A
A
C
C
C
C
G
C
C
Y
UU
Y
G
G
C
G
G
G
G
U
U
U
UG
C U U U U U5´
** C
U
U
A
C
C
G
G
A
G
GY
R
U
A
UGGAC
C
C
UG
A UC
C C AC
Y C C U
C
U
C
C
C
C G
A
UG
G
A
G
AA
U
Y
Y
YU
U
U
C
C
G
G
U
A
A
GC
C Y G Y C U Y Y
R
C
U
G
Y
Y
U
U
A
C
C
G
G UG
Y
G
U
A
A
G
G
C
A
G
UG
A C G U Y U5´
G
G
R
A
G
R
Y
R
Y
CU
G
GU G R
Y
C
G
G
C
U
UC
A AA
CC
GR
Y G
RR
G
Y
R
Y
Y
Y
Y
G
G
Y
RGG
U
U
C
G
AY
U
C
C
Y
RY
Y
C
U
Y
C
C
5´ U
G
A
C
C
C
U
U
U
A R
C
C
R
A
G
G
G
U
C
AC
C U A G C C A A C U G A C GU
U
G
U
U
AG
U
G
A
A
Y
YY
A
U
G
U
U
C
A
C A
RA
U
A
R
GC
C
A
A
U
C
G
C
U
U
U
G
C
G
R
U
U
G
GC
U U U U U U U U U5´ C U U A A UR
A
A
CAA
G
A
A
A
A
C
YAA
R C G
U
A
C
Y
U
U
C
C
Y C
C
U
G
AG
UU
C
A
G
G
C
U
G
G
A
A
UG
C
G
C A
CAG
C U RA
U U G U U G A U AA
G G G CU
ACUC
AUACCGACAA
GC
CAGU
G
A
A
G
C
G
AUG
A
AU
G
U
C
GG
U
U
CC
A C5´
R
U
Y
Y
RC
U
G
A
Y
GA
G
U
C
C
C
A
A AU
A
G
G
A
CGA
A
A C G C
GCGU
CY
G
R
A
U
5´ CU
C
C
A
U
GU
A
U
C
U
UU
G
G
G
A
C
C
U
G
U
C
A
GC
UG
U
G
G
C
A
G U
CU
C
C
C U
UC
C
U
A
G
CC
A
U
G
G
AA
G A G C A U A U U C UU
G
U
U
U
AU
U
G
G
C
A
A
A
GC
U
G
U
CA
C
C
A
U
UU
RA
U
U
G
G
UA
U
C
A
G
A U U
C
U
GAC
U
U
G
C
A
C
A
AG
U
A
A
C
AU
U C5´ C Y G G U U GG
U
G
G
C
G
C
A
C
U
U
C
C
Y
Y
A
C
G
G
G
C
G
G
U
G
U R
U
Y
A
CG
Y R Y U R Y R R Y A G A R R R A Y A C C
A
G
C
C
C
G
C
Y
RR
R
A
G
C
G
G
G
C
UU
U U U U5´
G
U
C
A
U
A
C
U
A
C
G
G
UG
C
A
A
Y
GY
R
RA
A
A
G
U A
A
AC
G
A
U
G
A
C
C C Y
A
RG
A
A
C
U
C
Y
RG
G U A
A A
A
U
R
CR
UAUC
A
A
A
A
U
G
Y
A
A
A
A
U
U
G
U
Y U G A C C U G G GR
UY
Y
UCCGGGUYRG
Y
U
Y
U
U
U
U
5´
U R U G C U A A C U R R R A A YG
U
U
G
Y
A U
R
Y
A
A
CCC
U
U
G
R
Y
G
C
U
U
A
U Y
CC
U
U
U
R
Y
C
A
A
GC
A U A U U A Y AR
C
G
R
U
C
G
Y
YA
A A G G A G A A A U G5´
U C R A A A G A A C AU
G
A
A
A
U
G
G
A
G
G
AGAAAUU
AC
A
GC
A A U U UA
UC
AR
C U
G
A
A
A
UU
A
U
AG
G
U
GU
AG
ACA
C A
UGUC
A
GC
R G UG
G
A
A
A
CAGUU
UC U A
UC
A A A A UU
A A AG
U
A
U
UUAG
A
G
AUUUU
C
C
U
C A
AA
U
U
U
C
AA
A U5´
ACAG
G
G
U
A
R
G
G
R
Y
Y
Y
Y
Y
UU
RU
R
R
R
R
R
Y
C
C
U
U
A
C
C
G
GR
UUUCU
C
A
A
R
U
Y
G
G
R
G
YA
AA
Y
C
C
G
R
U
U
G
RA
RUAUARAGGARG5´
CGYGUUA
U
A
U
G
CC
UU
U
A
U
U
G
UC
ACARUUYUUUUUYYG
Y
U
G
R
Y
C
A
U
U
G
GYAY
YA
U
U
R
A
U
U
Y
C
C
A
G
CR
AUAAAYG
A
C
A
A
G
C
C
C
G
A
A
C
RY
U
G
U
U
C
G
G
G
C
U
U
U
UU
UUURRUYA5´
Y Y Y AU
G
G
Y
G
G
Y
G
R
G
G
G
R
RCC
UU
Y
G GG Y
Y
G
C
C
G
GUU
C
C
YY
R
CCG
GU Y U RC
C
A
A
C
C
C
Y
Y
R
C
Y
R
C
C
AC
C Y5´
AUGGAYRU
G
C
G
C
A
GGA
A
G
C
G
CR
AAGACARACAGGGACACRYAGGRA
C
CCG
GA
UGGYGGRRYAGGAUGUCAGGRAACAGUCUGCA
A
A
G
C
C
C
C
G
C
YY
YG
G
C
G
G
G
G
U
U
U
U
5´
P s-R ho r nk ps M gsens t R N A S Q r r isr C H H 1 SN R 24 T r p ldr gr eA pr eQ 12 H A R 1F T er m L eu M icC C 4 R sm Y R ib osom e
Paul Gardner Engaging Scientists
What is Rfam?
A database of ncRNA alignments and structures
Used for annotating RNAs in genome sequences, bioinformatic
algorithm development and molecular evolutionary analyses
Gardner et al. (2008) Rfam: updates to the RNA families database
Nucleic Acids Research.
Paul Gardner Engaging Scientists
How can we keep textual descriptions of RNAs up to date?
AC RF00005
ID tRNA
CC Transfer RNA (tRNA) molecules are approximately 80 nucleotides in
CC length. Their secondary structure includes four short
CC double-helical elements and three loops (D, anti-codon, and T
CC loops). Further hydrogen bonds mediate the characteristic
CC L-shaped molecular structure. tRNAs have two regions of
CC fundamental functional importance: the anti-codon, which is
CC responsible for specific mRNA codon recognition, and the 3’ end,
CC to which the tRNAs corresponding amino acid is attached (by
CC aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of
CC the genetic code in two manners: having more than one tRNA (with
CC a specific anti-codon) for a particular amino acid; and ’wobble’
CC base-pairing, i.e. permitting non-standard base-pairing at the
CC 3rd anti-codon position.
RN [1]
RM 8256282
RT The tertiary structure of tRNA and the development of the genetic
RT code.
RA Hou YM;
RL Trends Biochem Sci 1993;18:362-364.
RN [2]
RM 9023104
RT tRNAscan-SE: a program for improved detection of transfer RNA genes
RT in genomic sequence.
RA Lowe TM, Eddy SR;
RL Nucleic Acids Res 1997;25:955-964.
Paul Gardner Engaging Scientists
This Wikipedia thing looks pretty good!
Paul Gardner Engaging Scientists
WikiProject RNA
The WikiProjects are social corners of Wikipedia for interested
parties to discuss themed articles
Involved in reviewing, ranking and rating articles
Now rolled into the larger WikiProject Molecular and Cellular
Biology
Paul Gardner Engaging Scientists
How has the Wikipedia experiment gone?
x x x x
x
x
x x
x
x
x x x x x x x
x x x x x x x x
x x
x
x x x x x x x x x x
x
x x
x
x x x
0
2000
4000
6000
8000
10000
Number of Rfam pages edited
Year
Numberofedits
2007 2008 2009 2010 2011
9089
x x xxxxxxxxxxx xxxxxxxxxxxx xxxxx xx x
106
Total edits
Vandalism
Gardner et al. (2011) Rfam: Wikipedia, clans and the “decimal”
release Nucleic Acids Research.
Paul Gardner Engaging Scientists
Who are these Wikipedians donating their time?
Rfambot
Ppgardne
Citationbot1
WillowW
SmackBot
DOI_bot
Addbot
Alexbateman
Jebus989
JenniferRfm
Zashaw
Rjwilmsi
Qwyrxian
Yobot
RE73
Narayanese
RichFarmbrough
Addshore
Wgscott
MiRroar
RjwilmsiBot
Arcadian
DO11.10
Gortonk
Banus
Drmed36
FrescoBot
Boghog
Top 20 Rfam wikiproject editors
Numberofedits
0
200
400
600
800
1000
Bots
Proof Readers
Scientists
Paul Gardner Engaging Scientists
What incentives can we give to Academics?
Academics love publishing articles
Introducing the “families track” at RNA Biology
Publication requirements are an alignment & a Wikipedia
article
100s of new families have been added thanks to this track
Paul Gardner Engaging Scientists
Who else is now using this model?
Finn, Gardner, Bateman (2012) Making your database available
through Wikipedia: the pros and cons Nucleic Acids Research.
Paul Gardner Engaging Scientists
Wikipedia need you!
What is the highest impact contribution academics can make?
Rule 1: Register an Account
Rule 2: Learn the Five Pillars
ENCYC, NPOV, FREE, RESPECT, NORULES
Rule 3: Be Bold, but Not Reckless
Rule 4: Know Your Audience
Rule 5: Do Not Infringe Copyright
...
Paul Gardner Engaging Scientists
Who might be reading about your field?
Paul Gardner Engaging Scientists
Thanks!
The Rfam Consortium
Wikipedians & the long
tail!
PPG is supported by a Rutherford Discovery Fellowship from Government funding, administered by the Royal
Society of New Zealand.
Paul Gardner Engaging Scientists
Engaging Scientific Communities in Contributing to a Biological Database

Weitere ähnliche Inhalte

Andere mochten auch

Bioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAsBioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAs
Paul Gardner
 
A visit to london
A visit to londonA visit to london
A visit to london
sehrish123
 
Vizbi2013: Visualising RNA
Vizbi2013: Visualising RNAVizbi2013: Visualising RNA
Vizbi2013: Visualising RNA
Paul Gardner
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
Paul Gardner
 

Andere mochten auch (12)

Later is NOW! Extract Bb Content
Later is NOW! Extract Bb ContentLater is NOW! Extract Bb Content
Later is NOW! Extract Bb Content
 
Citizen Scientists
Citizen ScientistsCitizen Scientists
Citizen Scientists
 
Bioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAsBioinformatic approaches to functionally characterise RNAs
Bioinformatic approaches to functionally characterise RNAs
 
My Favourite Pastime
My Favourite PastimeMy Favourite Pastime
My Favourite Pastime
 
A visit to london
A visit to londonA visit to london
A visit to london
 
Vizbi2013: Visualising RNA
Vizbi2013: Visualising RNAVizbi2013: Visualising RNA
Vizbi2013: Visualising RNA
 
Random RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotesRandom RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotes
 
Sakai: Set Up Your Course Site
Sakai: Set Up Your Course SiteSakai: Set Up Your Course Site
Sakai: Set Up Your Course Site
 
Sakai: Get oriented
Sakai: Get orientedSakai: Get oriented
Sakai: Get oriented
 
BIOL335: Homology search
BIOL335: Homology searchBIOL335: Homology search
BIOL335: Homology search
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
 
BIOL335: RNA bioinformatics
BIOL335: RNA bioinformaticsBIOL335: RNA bioinformatics
BIOL335: RNA bioinformatics
 

Ähnlich wie Engaging Scientific Communities in Contributing to a Biological Database

как это работает4
как это работает4как это работает4
как это работает4
Vladislav Troshin
 
Waukegan west 1984
Waukegan west 1984Waukegan west 1984
Waukegan west 1984
Dave Levine
 
Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984
Dave Levine
 
PFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBookPFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBook
punxsyscience
 
Sopa de letras
Sopa de letrasSopa de letras
Sopa de letras
nirvana18
 

Ähnlich wie Engaging Scientific Communities in Contributing to a Biological Database (20)

Colageno I Secuenciacion
Colageno I SecuenciacionColageno I Secuenciacion
Colageno I Secuenciacion
 
Gene Sequences
 Gene Sequences Gene Sequences
Gene Sequences
 
как это работает4
как это работает4как это работает4
как это работает4
 
Waukegan west 1984
Waukegan west 1984Waukegan west 1984
Waukegan west 1984
 
CONJUNTOS
CONJUNTOSCONJUNTOS
CONJUNTOS
 
Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984Niles West v Glenbrook South 1984
Niles West v Glenbrook South 1984
 
3D Printing Basics: Going From Bytes To Atoms
3D Printing Basics: Going From Bytes To Atoms3D Printing Basics: Going From Bytes To Atoms
3D Printing Basics: Going From Bytes To Atoms
 
Projek akhir asas pengangkutan data a168611
Projek akhir asas pengangkutan data a168611Projek akhir asas pengangkutan data a168611
Projek akhir asas pengangkutan data a168611
 
PFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBookPFediganProteinSynthesisFlipBook
PFediganProteinSynthesisFlipBook
 
Puente diciembre del 5 al 8.
Puente diciembre del 5 al 8.Puente diciembre del 5 al 8.
Puente diciembre del 5 al 8.
 
¡Recordemos las letras!
¡Recordemos las letras!¡Recordemos las letras!
¡Recordemos las letras!
 
Practiquemos las letras
Practiquemos las letrasPractiquemos las letras
Practiquemos las letras
 
Animaliak
AnimaliakAnimaliak
Animaliak
 
Sopa de letras
Sopa de letrasSopa de letras
Sopa de letras
 
Landschap en Energie Inpassingsstrategien West-Brabant
Landschap en Energie Inpassingsstrategien West-BrabantLandschap en Energie Inpassingsstrategien West-Brabant
Landschap en Energie Inpassingsstrategien West-Brabant
 
Diapositivas para Proyecto.pptx
Diapositivas para Proyecto.pptxDiapositivas para Proyecto.pptx
Diapositivas para Proyecto.pptx
 
Permainan bahasa
Permainan bahasaPermainan bahasa
Permainan bahasa
 
Come realizzare una fondazione a "Platea Calda"
Come realizzare una fondazione a "Platea Calda"Come realizzare una fondazione a "Platea Calda"
Come realizzare una fondazione a "Platea Calda"
 
PPT Komp. Makam Arung Palakka
PPT Komp. Makam Arung PalakkaPPT Komp. Makam Arung Palakka
PPT Komp. Makam Arung Palakka
 
Gujarat Agro Business Policy 2016-21
Gujarat Agro Business Policy 2016-21Gujarat Agro Business Policy 2016-21
Gujarat Agro Business Policy 2016-21
 

Mehr von Paul Gardner

Mehr von Paul Gardner (20)

ppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdfppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdf
 
ppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdfppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdf
 
ppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdfppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdf
 
ppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdfppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdf
 
ppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdfppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdf
 
Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?
 
Machine learning methods
Machine learning methodsMachine learning methods
Machine learning methods
 
Clustering
ClusteringClustering
Clustering
 
Monte Carlo methods
Monte Carlo methodsMonte Carlo methods
Monte Carlo methods
 
The jackknife and bootstrap
The jackknife and bootstrapThe jackknife and bootstrap
The jackknife and bootstrap
 
Contingency tables
Contingency tablesContingency tables
Contingency tables
 
Regression (II)
Regression (II)Regression (II)
Regression (II)
 
Regression (I)
Regression (I)Regression (I)
Regression (I)
 
Analysis of covariation and correlation
Analysis of covariation and correlationAnalysis of covariation and correlation
Analysis of covariation and correlation
 
Analysis of two samples
Analysis of two samplesAnalysis of two samples
Analysis of two samples
 
Analysis of single samples
Analysis of single samplesAnalysis of single samples
Analysis of single samples
 
Centrality and spread
Centrality and spreadCentrality and spread
Centrality and spread
 
Fundamentals of statistical analysis
Fundamentals of statistical analysisFundamentals of statistical analysis
Fundamentals of statistical analysis
 
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
 
A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Engaging Scientific Communities in Contributing to a Biological Database

  • 1. Engaging a Scientific Community in Contributing to a Biological Database Paul Gardner June 21, 2013 Paul Gardner Engaging Scientists
  • 2. What is RNA? RNA is a fundamental biological molecule, essential for untold biological processes My aim is to build an analog to the Periodic Table for classifying RNA families and motifs, enabling researchers to predict function. New technologies are accelerating the rate of RNA discovery. base basepair R A U A G A U Y A C A U U 5´ Y G A A R 5´ C U U C G G 5´ R U R R R Y 5´ R R G C G U R A R A G C Y 5´ R Y G G A G Y R RR R C RR G A R R 5´ C G A A G Y Y R Y Y RR G G G R U G G A G 5´ C C R A Y C C C R U C C G A A C U Y G G 5´ A N Y A G N R A U N C G T loop U t ur n k t r n1 k t r n2 tw ist R C Y R G G A AC U G A RC R U Y AG U A C G GG A R R A5´ Y Y Y A GU A G Y R A G G A A R R R 5´ R Y G R Y A A Y C RY A Y Y A G R GA A Y C 5´ R C A GG A G Y 5´ A C A C U G R Y R Y G Y R R R R R Y C A R U Y 5´ R A G C R C G R A G Y AY G Y Y R G U U Y 5´ A A A A A G C Y R Y Y R R Y G G Y U U U U UU Y U Y5´ R R A R R Y Y U U UU U U Y5´ sar r ic1 sar r ic2 U A A G A N C sr C loop dom V t er m 1 t er m 2 R Y Y Y Y G C G A G C A G A C G C A R A A C R C C C R R Y R R Y G G G Y G U U Y U G C G U C U G C U C G C R R R R5´ Y U Y UC U C A A C AG UG Y U U G R R R A A Y 5´ Y Y Y Y Y A U GA Y G R Y Y Y YA A A Y Y Y YY R R G R R Y C U GAU Y Y Y R R R 5´ G G G U C U C U C U G Y U A G A C C A G AU CU G A G C C UG GG A G C U C U C U G G C U A R C U A G G G A A C C CA C5´ UG U A A A C A U C CU Y G A C U G G A A G C UG U R R R Y R Y R R RR G C U U U C A G U C G G A U G U U U G C 5´ U CU U U G G U U A U C U A G C U G UA U G AG U G Y Y R C RU C A UA A A G C U A G A U A C C G A AR U5´ C Y Y R UC C C U G A G A C C C U A A C Y U G U G AG Y U Y YY A G Y UU C A C A R G U R G G Y U C U Y G G G R CY R G G 5´ G C U A A A A G G A A C G A U C G U U G U G A U A U GC G U U RRU U YC G U U AC A U A U C A C A G U G A U U U U C C U U U A U A R CG C5´ C Y GY G Y Y C A U C U U A C Y G RG C A G U G U U G GA U G Y YY R R G Y C UC U A A Y A C U G YC U G G U A A Y G A U G R C RY C G G5´ Y Y Y Y R R GY A C A U R C U U C U U U A U A U C C C A U AY R A Y R R R CU A U G G A A U G U A A A G A A G U A U G U AY Y Y G G Y5´ Y R R YY C R U C A A A R U G G Y U G U G A R U G U Y R U CA U A U C A C A G C C A C U U U G A U G AG Y U Y R R5´ Y A A RA A G G G A A Y R G U U G C U G U G A U R U A Y Y Y A Y Y Y Y U YU A U A U C A C A G U G G C U G U U C U U U UU G G U Y5´ Y C R G G U G A G G U A G U A G G U U G U A U A G U U RR R R Y Y Y Y YG G A GY A A C U R U A C A A Y C U R C U A C U U Y C C U G R 5´ G G C U G G U C C G A R RG U A G U G G G U U A Y R U Y A AY Y Y Y U U R Y Y Y YU C Y C C CYC Y C A C U RC UR YA C U U G A C U R G C CU U U5´ Y Y Y C U G Y R R U G U C G UA R Y Y Y Y Y U G A R C CRAY Y Y Y Y Y G G G R G Y Y Y Y Y R G G YA G C C C YY G G GA A R C A A R Y R R R R Y R C C C A CCU R R R Y R YRG G U U C A R R R R Y A C G G C A Y Y R Y G G R Y YY Y5´ Y Y R C G R C C A UA C R R R G R A R C A CC Y G R U C C CA U C C G A A CY C R GA A G U UA A GC Y Y Y Y GG C Y R R G U A C U R G R YG RG R AYC CUG GG AA RY RGGU G Y Y G Y R RY 5´ G RU A GYYY AR Y G G Y AR R R C RY Y R G Y U Y A A Y Y R R RR Y R RG G UU C R AR U C C Y YY YR 5´ R R AAR Y U C R Y R R R R GYYAC R R YG A G U R Y Y R YRCUC Y CYYYY G G G A A GGU C U G A G A R G C CAY Y R C C CU G GGGYR Y Y Y Y Y Y GR R R R G R R R R Y G R G Y Y A C C AG A A A Y R R Y Y Y Y R RGY U U GGAA RRCUYRY GGCY RG Y R R Y U A G U C A A U R Y GRR Y R R Y Y Y R AAC Y C R A UUCAG A C U A UCU Y Y 5´ T R I T I R E SE C I S m ir -T A R m ir -30 m ir -9 lin-4 m ir -5 m ir -8 m ir -1 m ir -2 m ir -6 let -7 Y R N A 6S 5S t R N A R N aseP AURRGRYA G G YA U U G AA CUGU AU U G U G CR C C UU GCAUARAGCUAAAGCACUAAAAAGGAGUAA5´ A G U C A U G A U YG C U A U U C Y Y Y A A A U A G UG A U U G U G A U AG C G A U G C G G Y G U G U UG C G C A C R Y C G Y A Y C G CG C U5´ AGAGGAARCR G G G G C CAY G C A GAAGC G U UC ACG U C G C G G C C C CU GUC A G A U U C RGU R A A U C U GC GAAUUCUGCU5´ G A U AC A U A G G A A C C U C C U C A A A G G A U U C U A U GG A C AG U C G A U G C A G G G A G G G A CR R C U C C C U G C A U C G G CG A U U U U5´ A C G R RG U R RA R UG C G A U A A Y A YA A U A A U GAAA U U C C U CU U U G A C G G C C A A U A GC GA U A U U G G C CA U U U U U U U 5´ R Y C U U U A G C G GG Y U R RR U Y A R U CURG Y Y G G Y G U U U C G C C G R C Y YU R C Y Y U G A Y R Y 5´ RYYRYYCC G U G G UG A U U U G RYC GGCCGG C U U G C AG C C A C GU UAAAYAAUCGCUAAARAGGCCGRGGRRR5´ G UCGRR U Y Y C A C UG A U G AG U C Y U R ARGAC G A AA C 5´ Y Y R A U Y U AAA RA A A C A G CU U UC A AG U G CCU U U Y U GC A G U U YYY CARGAGCGC A A G A U RG R U A5´ R Y G GY Y G Y U U G C C A U A C G C C C YY Y YY C G G C A GG U A U G G A A R C A C C C YC G Y A CG A C U G GY Y C G G A C A CY GY C G U C CC G C C A G A U C 5´ CA C A U C A G A U U U C C U G G U G UA A CG A A U U U U C A A G U G C U U C U U G C A U A A G C A A G U U URA U C C C G C Y C CY YC G R G Y C G G G A UU U5´ A U GG A G A C A UGGCR U AA AG C C AG A R A G U R A G A AC R U A A C Y U A G A C U R U ACUUGAA C U G A U UYRC A U C U CA U U U U5´ G C R C Y G C AA AA U C R G R Y G C C G G G A U UG G YA YCCCG R A Y R R R R Y R A R C G C Y GCGYU U U U U U 5´ Y U R C G U G A C G A A G CG C G C G CA A A G UGG A C AA U A A AG C C UR A G C RU Y R A G UAG U C G Y CAG A C G C C G G U U A A G C C G G C G U UU U U U5´ YR Y A C G UR Y C Y G U U R UR G Y C C G G U U G C U U UG GU C G G U G A C C G G R R R R R A G C C C R C UU G G U G G G Y U UU U U5´ G G Y C R G C Y C R CC C C CC R G R G C Y G R C C G A C G G C C C C C G C U CC C C CCY GGCGGGGGYCGUC C C Y Y 5´ U U G G C G A U R UU U U U G GU U G G A A U G UAGUGY YY UU A R C A C U AA A CG C U G CC A C AA A U A A CCUG U CAGU U A U U U C A Y C A A A AA U A A A5´ RYYRYUG C C C UCY G G G CG UUUCCUCCCUAGACUU G G C Y Y YY R R G G C CU UUUUUUUYYY5´ SA M V sym R C P E B 3 F inP sr oB m sr SA M a H H 3 V m nt n3 livK D sr A C A E SA R isr K sr oD isr B 6C r spL suhB UY G C A UCCGCYAA Y CGGUYA G C C GU G UC G C GG A A G G U U Y Y Y A A C CA G C UR Y Y U Y Y G RA ACRRAG RRA GGUG A G C G 5´ UG A A A GAC G C G C A U U U GU U A U C A U CA UC C CU G U Y C A G AG A U GY A A U U U GG CC AC AG Y RY G U G G C C U U U U C 5´ * U U C U A C U G A C U C UU U U A AA A U A AU U A U U C A U U G G AG G U UU A A UA U G A A U A UA A A G G A U G A G CA U A U A G A AG C GUUUG C UCYUU GU U A G AU C R G U U A G U A G G AA 5´ G A U U UG G U R R C U G C G C U C UU C UA A G C C A G U U A C C CG G U U C A A A R A U U G C C A G C U U Y G A A C CU UC G A A A A A C C A C C U Y CR R G G U G G U U U U U U C GU 5´ R R R R R R R R C U C R U AU A A YYYCRRR AA U A UG GY Y Y G R R A GU U U C UAC C R R G Y R C CG U AAA YRYYYG A CU A Y G A G RR R5´ C G G C A U C C C C A U U A C C U A U G G AC A CG G U G C C G C A R G C U C U G G R A G UU C GUYCCRGAGYYUG Y Y G G A A R G G U U U U C C G U G U C C A G 5´ R R Y G G A R G CRR U GA R Y R Y Y Y YU Y A U YU G G GCA C Y U G R R R Y R YG G A G C YAG U R GU G C A ACCG R C C R Y R R R 5´ G U U G U A A C U AU G U U G C A R YA R A C G AG A A C C G AG U A U A G U U C A U GG G R U Y A CA UG AA UU G U UU A A CU RU CC U C U GG A U U C CC G U C C AU G R C A GU C G G U U C 5´ CUUA C U G A GA G C A C AA A GU UUC C C G U GC CA A C A G G G A G U G U UAU A AC G G UU UAUU A G U C U G G AG ACG G C A G A C U AU CCUCUUC C C G G U C CC CUA U G C C G G GU UUUUUUUAUGUC5´ UURGRYUYRCCUG A A U G U G A CU A U C A C U U CA AACRRYGRGYAACCUCAGUAUCAUCRYRGAGYUA A A C C C U C G C C G C CUG A C G G Y G A G G G U U UU CUUUUGGR5´ U G U A A A A A A C A U Y A U U UA G C GUGAYU U U C U A U C A ACAG C U A A C A A U U G U UA U U A C UG C CUA A Y G Y U C A UA A G G G U A AUU U U A A A A A AGG G CG A U A A AA A A C G A U U G G GGGA U G A G A Y A U G AAC G C UC A A G C A5´ C C C A G A G G U A U U G A UU G G U G A U R R C A Y Y U C U R U G Y U Y A U UY A U UR C A C C A A C C U G C G C RG A UGCGCAGGU U U U U U U U 5´ AR R R Y Y YYYAAURYCAACYUUUAGCGCACG G C U C U YY A A G A G C CA UUYCCCUA G R C C A A A C A G GAAU Y G U U U G G Y C UU UUUUU5´ G G G C A R G A U A U G U G A A GU R GC Y A C C GC AA GC YGR U A CY CUU CAC Y Y Y C C U U A U UC G C U Y GC U CAAC GGR A U C Y U G C U CU G C G A G G C Y5´ GUGCRRYCYRAUUYYR G Y Y G Y G C C Y R Y R A R AAC AUCAYAA R A U A CG G C R C R R CC ACRAUUUCCCUG G U G U U GG C G C A GU AUU C G C G C A C CC CGGUCUACC5´ Y U U Y R Y U R R U U U Y A U C A R A YC U GU U U G A U R R A A G Y U A R Y G A R R Y Y C A Y UA A C R G C U Y U Y GC Y G G C Y Y G A C C C G A G R Y Y G U UU U U U U5´ RACGUUCAY C C Y YY R G G RC GCAYRA Y C A R R Y C A Y GG AAC G G G G R Y Y U G R R 5´ sucA Sr aD sxy R N A I P ur ine SA M -C hl cdiG M P 2 A nt i-Q G adY r nk ldr P r fA O m r A -B R yeB t r aJ 2 Sr aH 23Sm et h D S-p ep U U C G G C C Y CG C R R C G YU U YU Y C G Y Y G CC C U C U G C A YG C C G U C G C C G A CGCAY U C C Y A U U CG A A Y Y G U G C G A U C C U G U C G C CY U C C U GC G G C G C G G C 5´ CG Y R G C G C U U G U UA U U U R Y Y G C U G U G U A G U GUC G U C Y YR A R Y Y R G R R Y Y Y A A A C C C C G C C Y UU Y G G C G G G G U U U UG C U U U U U5´ ** C U U A C C G G A G GY R U A UGGAC C C UG A UC C C AC Y C C U C U C C C C G A UG G A G AA U Y Y YU U U C C G G U A A GC C Y G Y C U Y Y R C U G Y Y U U A C C G G UG Y G U A A G G C A G UG A C G U Y U5´ G G R A G R Y R Y CU G GU G R Y C G G C U UC A AA CC GR Y G RR G Y R Y Y Y Y G G Y RGG U U C G AY U C C Y RY Y C U Y C C 5´ U G A C C C U U U A R C C R A G G G U C AC C U A G C C A A C U G A C GU U G U U AG U G A A Y YY A U G U U C A C A RA U A R GC C A A U C G C U U U G C G R U U G GC U U U U U U U U U5´ C U U A A UR A A CAA G A A A A C YAA R C G U A C Y U U C C Y C C U G AG UU C A G G C U G G A A UG C G C A CAG C U RA U U G U U G A U AA G G G CU ACUC AUACCGACAA GC CAGU G A A G C G AUG A AU G U C GG U U CC A C5´ R U Y Y RC U G A Y GA G U C C C A A AU A G G A CGA A A C G C GCGU CY G R A U 5´ CU C C A U GU A U C U UU G G G A C C U G U C A GC UG U G G C A G U CU C C C U UC C U A G CC A U G G AA G A G C A U A U U C UU G U U U AU U G G C A A A GC U G U CA C C A U UU RA U U G G UA U C A G A U U C U GAC U U G C A C A AG U A A C AU U C5´ C Y G G U U GG U G G C G C A C U U C C Y Y A C G G G C G G U G U R U Y A CG Y R Y U R Y R R Y A G A R R R A Y A C C A G C C C G C Y RR R A G C G G G C UU U U U U5´ G U C A U A C U A C G G UG C A A Y GY R RA A A G U A A AC G A U G A C C C Y A RG A A C U C Y RG G U A A A A U R CR UAUC A A A A U G Y A A A A U U G U Y U G A C C U G G GR UY Y UCCGGGUYRG Y U Y U U U U 5´ U R U G C U A A C U R R R A A YG U U G Y A U R Y A A CCC U U G R Y G C U U A U Y CC U U U R Y C A A GC A U A U U A Y AR C G R U C G Y YA A A G G A G A A A U G5´ U C R A A A G A A C AU G A A A U G G A G G AGAAAUU AC A GC A A U U UA UC AR C U G A A A UU A U AG G U GU AG ACA C A UGUC A GC R G UG G A A A CAGUU UC U A UC A A A A UU A A AG U A U UUAG A G AUUUU C C U C A AA U U U C AA A U5´ ACAG G G U A R G G R Y Y Y Y Y UU RU R R R R R Y C C U U A C C G GR UUUCU C A A R U Y G G R G YA AA Y C C G R U U G RA RUAUARAGGARG5´ CGYGUUA U A U G CC UU U A U U G UC ACARUUYUUUUUYYG Y U G R Y C A U U G GYAY YA U U R A U U Y C C A G CR AUAAAYG A C A A G C C C G A A C RY U G U U C G G G C U U U UU UUURRUYA5´ Y Y Y AU G G Y G G Y G R G G G R RCC UU Y G GG Y Y G C C G GUU C C YY R CCG GU Y U RC C A A C C C Y Y R C Y R C C AC C Y5´ AUGGAYRU G C G C A GGA A G C G CR AAGACARACAGGGACACRYAGGRA C CCG GA UGGYGGRRYAGGAUGUCAGGRAACAGUCUGCA A A G C C C C G C YY YG G C G G G G U U U U 5´ P s-R ho r nk ps M gsens t R N A S Q r r isr C H H 1 SN R 24 T r p ldr gr eA pr eQ 12 H A R 1F T er m L eu M icC C 4 R sm Y R ib osom e Paul Gardner Engaging Scientists
  • 3. What is Rfam? A database of ncRNA alignments and structures Used for annotating RNAs in genome sequences, bioinformatic algorithm development and molecular evolutionary analyses Gardner et al. (2008) Rfam: updates to the RNA families database Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 4. How can we keep textual descriptions of RNAs up to date? AC RF00005 ID tRNA CC Transfer RNA (tRNA) molecules are approximately 80 nucleotides in CC length. Their secondary structure includes four short CC double-helical elements and three loops (D, anti-codon, and T CC loops). Further hydrogen bonds mediate the characteristic CC L-shaped molecular structure. tRNAs have two regions of CC fundamental functional importance: the anti-codon, which is CC responsible for specific mRNA codon recognition, and the 3’ end, CC to which the tRNAs corresponding amino acid is attached (by CC aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of CC the genetic code in two manners: having more than one tRNA (with CC a specific anti-codon) for a particular amino acid; and ’wobble’ CC base-pairing, i.e. permitting non-standard base-pairing at the CC 3rd anti-codon position. RN [1] RM 8256282 RT The tertiary structure of tRNA and the development of the genetic RT code. RA Hou YM; RL Trends Biochem Sci 1993;18:362-364. RN [2] RM 9023104 RT tRNAscan-SE: a program for improved detection of transfer RNA genes RT in genomic sequence. RA Lowe TM, Eddy SR; RL Nucleic Acids Res 1997;25:955-964. Paul Gardner Engaging Scientists
  • 5. This Wikipedia thing looks pretty good! Paul Gardner Engaging Scientists
  • 6. WikiProject RNA The WikiProjects are social corners of Wikipedia for interested parties to discuss themed articles Involved in reviewing, ranking and rating articles Now rolled into the larger WikiProject Molecular and Cellular Biology Paul Gardner Engaging Scientists
  • 7. How has the Wikipedia experiment gone? x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x 0 2000 4000 6000 8000 10000 Number of Rfam pages edited Year Numberofedits 2007 2008 2009 2010 2011 9089 x x xxxxxxxxxxx xxxxxxxxxxxx xxxxx xx x 106 Total edits Vandalism Gardner et al. (2011) Rfam: Wikipedia, clans and the “decimal” release Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 8. Who are these Wikipedians donating their time? Rfambot Ppgardne Citationbot1 WillowW SmackBot DOI_bot Addbot Alexbateman Jebus989 JenniferRfm Zashaw Rjwilmsi Qwyrxian Yobot RE73 Narayanese RichFarmbrough Addshore Wgscott MiRroar RjwilmsiBot Arcadian DO11.10 Gortonk Banus Drmed36 FrescoBot Boghog Top 20 Rfam wikiproject editors Numberofedits 0 200 400 600 800 1000 Bots Proof Readers Scientists Paul Gardner Engaging Scientists
  • 9. What incentives can we give to Academics? Academics love publishing articles Introducing the “families track” at RNA Biology Publication requirements are an alignment & a Wikipedia article 100s of new families have been added thanks to this track Paul Gardner Engaging Scientists
  • 10. Who else is now using this model? Finn, Gardner, Bateman (2012) Making your database available through Wikipedia: the pros and cons Nucleic Acids Research. Paul Gardner Engaging Scientists
  • 11. Wikipedia need you! What is the highest impact contribution academics can make? Rule 1: Register an Account Rule 2: Learn the Five Pillars ENCYC, NPOV, FREE, RESPECT, NORULES Rule 3: Be Bold, but Not Reckless Rule 4: Know Your Audience Rule 5: Do Not Infringe Copyright ... Paul Gardner Engaging Scientists
  • 12. Who might be reading about your field? Paul Gardner Engaging Scientists
  • 13. Thanks! The Rfam Consortium Wikipedians & the long tail! PPG is supported by a Rutherford Discovery Fellowship from Government funding, administered by the Royal Society of New Zealand. Paul Gardner Engaging Scientists