Powerful Google developer tools for immediate impact! (2023-24 C)
Engaging Scientific Communities in Contributing to a Biological Database
1. Engaging a Scientific Community in Contributing
to a Biological Database
Paul Gardner
June 21, 2013
Paul Gardner Engaging Scientists
2. What is RNA?
RNA is a fundamental biological molecule, essential for untold
biological processes
My aim is to build an analog to the Periodic Table for
classifying RNA families and motifs, enabling researchers to
predict function.
New technologies are accelerating the rate of RNA discovery.
base basepair
R
A
U
A
G
A
U Y
A
C
A
U
U
5´
Y
G
A
A
R
5´
C
U
U C
G
G
5´
R
U
R R R
Y
5´
R
R
G
C
G
U
R A
R
A
G
C
Y
5´
R
Y
G
G
A
G
Y
R RR
R
C RR
G
A
R
R
5´
C
G
A
A
G
Y
Y
R
Y
Y RR
G
G
G
R
U
G
G
A
G
5´
C
C
R
A
Y
C
C
C
R
U C
C
G
A
A
C
U
Y
G
G
5´
A N Y A G N R A U N C G T loop U t ur n k t r n1 k t r n2 tw ist
R
C
Y
R
G
G
A
AC
U
G
A
RC
R
U
Y
AG
U
A
C
G
GG
A R R A5´
Y
Y
Y
A
GU
A
G Y R
A
G
G
A
A
R
R
R
5´ R
Y
G
R
Y
A
A
Y
C
RY
A Y
Y
A
G
R
GA
A
Y
C
5´
R
C
A
GG A
G
Y
5´
A
C
A
C U
G
R
Y
R
Y G Y R R R R
R
Y
C
A
R
U
Y
5´
R
A
G
C
R
C
G
R
A
G
Y AY
G
Y
Y
R
G
U
U
Y
5´
A
A
A
A
A
G
C
Y
R
Y
Y R
R
Y
G
G
Y
U
U
U
U
UU
Y U Y5´
R
R
A
R
R Y
Y
U
U
UU
U U Y5´
sar r ic1 sar r ic2 U A A G A N C sr C loop dom V t er m 1 t er m 2
R Y Y Y Y
G
C
G
A
G
C
A
G
A
C
G
C
A
R
A
A
C
R
C
C
C
R
R
Y
R
R
Y
G
G
G
Y
G
U
U
Y
U
G
C
G
U
C
U
G
C
U
C
G
C
R R R R5´
Y
U
Y
UC
U
C
A
A
C
AG
UG
Y
U
U
G
R
R
R
A
A
Y
5´ Y
Y
Y
Y
Y
A
U
GA
Y G
R
Y
Y
Y
YA
A
A Y
Y
Y
YY
R
R
G
R
R
Y
C U GAU
Y
Y
Y
R
R
R
5´
G
G
G
U
C
U
C
U
C
U
G
Y
U
A
G
A
C
C
A
G
AU
CU
G
A
G
C
C
UG
GG
A
G
C
U
C
U
C
U
G
G
C
U
A
R
C
U
A
G
G
G
A
A
C
C
CA
C5´ UG
U
A
A
A
C
A
U
C
CU
Y G
A
C
U
G
G
A
A
G
C
UG
U
R
R
R
Y R Y
R
R
RR
G
C
U
U
U
C
A
G
U
C
G
G
A
U
G
U
U
U
G
C
5´ U CU
U
U
G
G
U
U
A
U
C
U
A
G
C
U
G
UA
U
G
AG
U
G
Y
Y R
C
RU
C
A
UA
A
A
G
C
U
A
G
A
U
A
C
C
G
A
AR
U5´ C
Y
Y
R
UC
C
C
U
G
A
G
A
C
C
C
U
A
A
C
Y
U
G
U
G
AG
Y
U
Y
YY
A
G
Y
UU
C
A
C
A
R
G
U
R
G
G
Y
U
C
U
Y
G
G
G
R
CY
R
G
G
5´
G
C
U
A
A
A
A
G
G
A
A
C
G
A
U
C
G
U
U
G
U
G
A
U
A
U
GC
G
U
U
RRU
U
YC
G
U
U
AC
A
U
A
U
C
A
C
A
G
U
G
A
U
U
U
U
C
C
U
U
U
A
U
A
R
CG
C5´ C Y GY
G
Y
Y
C
A
U
C
U
U
A
C
Y
G
RG
C
A
G
U
G
U
U
G
GA
U
G
Y
YY R
R
G
Y
C
UC
U
A
A
Y
A
C
U
G
YC
U
G
G
U
A
A
Y
G
A
U
G
R
C
RY
C G G5´ Y Y Y Y R R GY
A
C
A
U
R
C
U
U
C
U
U
U
A
U
A
U
C
C
C
A
U
AY
R
A
Y
R
R
R
CU
A
U
G
G
A
A
U
G
U
A
A
A
G
A
A
G
U
A
U
G
U
AY
Y Y G G Y5´ Y R R YY
C
R
U
C
A
A
A
R
U
G
G
Y
U
G
U
G
A
R U
G
U
Y
R
U
CA
U
A
U
C
A
C
A
G
C
C
A
C
U
U
U
G
A
U
G
AG
Y U Y R R5´ Y A A RA
A
G
G
G
A
A
Y
R
G
U
U
G
C
U
G
U
G
A
U
R
U
A
Y
Y
Y
A Y
Y
Y
Y
U
YU
A
U
A
U
C
A
C
A
G
U
G
G
C
U
G
U
U
C
U
U
U
UU
G G U Y5´ Y
C
R
G
G
U
G
A
G
G
U
A
G
U
A
G
G
U
U
G
U
A
U
A
G
U
U
RR
R
R
Y
Y
Y
Y YG
G
A
GY
A
A
C
U
R
U
A
C
A
A
Y
C
U
R
C
U
A
C
U
U
Y
C
C
U
G
R
5´
G
G
C
U
G
G
U
C
C
G
A
R
RG
U
A
G
U
G
G
G
U
U
A
Y
R
U
Y
A
AY
Y
Y
Y
U
U
R
Y
Y Y YU
C
Y
C
C
CYC
Y
C
A
C
U RC
UR
YA
C
U
U
G
A
C
U
R
G
C
CU
U U5´ Y
Y
Y
C
U
G
Y
R
R
U
G
U
C
G
UA
R
Y
Y
Y
Y
Y
U
G
A
R
C
CRAY
Y
Y
Y
Y
Y
G
G
G
R
G
Y
Y
Y
Y
Y
R
G
G
YA
G C
C
C
YY
G
G
GA
A
R
C
A
A
R
Y
R
R
R
R
Y
R
C
C
C A CCU
R
R
R
Y
R
YRG
G
U
U
C
A
R
R
R
R
Y
A
C
G
G
C
A
Y
Y
R
Y
G
G
R
Y
YY
Y5´
Y
Y
R
C
G
R
C
C
A
UA
C
R
R
R
G
R
A
R
C
A
CC
Y
G
R
U
C
C
CA
U C
C
G
A A
CY
C
R
GA
A
G
U UA
A
GC
Y
Y
Y Y
GG
C Y
R R
G
U
A C U
R
G R YG
RG
R
AYC
CUG
GG
AA
RY
RGGU
G
Y
Y
G
Y
R
RY
5´
G
RU
A
GYYY
AR
Y
G
G
Y AR
R R C
RY
Y
R
G
Y
U
Y A
A
Y
Y
R
R
RR
Y
R
RG
G UU
C
R
AR
U
C
C
Y
YY
YR
5´
R
R
AAR
Y
U
C
R
Y
R
R
R
R
GYYAC
R
R
YG
A
G
U
R
Y
Y R
YRCUC
Y
CYYYY
G G G A A GGU
C U G A G
A
R
G
C
CAY
Y
R
C
C
CU
G
GGGYR
Y
Y
Y
Y
Y
Y
GR
R
R
R
G
R
R
R
R Y G R G Y Y
A
C
C
AG
A A A Y
R
R Y Y
Y
Y
R
RGY
U
U
GGAA
RRCUYRY
GGCY
RG Y R R Y U
A
G
U
C
A
A
U
R
Y
GRR
Y
R
R
Y
Y
Y
R
AAC
Y
C
R
A
UUCAG
A
C
U
A
UCU
Y
Y
5´
T R I T I R E SE C I S m ir -T A R m ir -30 m ir -9 lin-4 m ir -5 m ir -8 m ir -1 m ir -2 m ir -6 let -7 Y R N A 6S 5S t R N A R N aseP
AURRGRYA
G
G
YA
U
U
G
AA
CUGU
AU
U
G
U
G
CR
C
C
UU
GCAUARAGCUAAAGCACUAAAAAGGAGUAA5´
A
G
U
C
A
U
G
A
U
YG
C
U
A
U
U
C
Y
Y Y
A
A
A
U
A
G
UG
A
U
U
G
U
G
A
U
AG
C
G
A
U
G
C
G
G
Y
G
U
G
U
UG C
G
C
A
C
R
Y
C
G
Y
A
Y
C
G
CG
C U5´
AGAGGAARCR
G
G
G
G
C
CAY
G
C
A
GAAGC
G
U
UC
ACG
U
C
G
C
G
G
C
C
C
CU
GUC
A
G
A
U
U
C
RGU
R
A
A
U
C
U
GC
GAAUUCUGCU5´
G A U AC
A
U
A
G
G
A
A
C
C
U
C
C
U
C A
A
A
G
G
A
U
U
C
U
A
U
GG
A C AG
U
C
G
A
U
G
C
A
G
G
G
A
G
G
G A CR
R
C
U
C
C
C
U
G
C
A
U
C
G
G
CG
A U U U U5´ A
C
G
R
RG
U
R RA
R
UG
C
G
A U A A Y A YA
A
U
A
A
U
GAAA
U
U
C
C
U
CU
U U G A C
G
G
C
C
A
A
U
A
GC
GA
U
A
U
U
G
G
C
CA
U
U
U
U
U
U
U
5´ R
Y
C
U
U
U
A
G
C
G
GG
Y
U
R
RR
U
Y A R U CURG
Y
Y
G
G
Y
G
U
U
U
C
G
C
C
G
R
C
Y YU
R
C
Y
Y
U
G
A
Y
R
Y
5´
RYYRYYCC
G
U
G
G
UG
A
U
U
U
G
RYC
GGCCGG
C
U
U
G
C
AG
C
C
A
C
GU
UAAAYAAUCGCUAAARAGGCCGRGGRRR5´
G
UCGRR
U
Y Y C A
C
UG
A U G AG U C Y
U R
ARGAC
G
A
AA
C
5´ Y Y R
A
U
Y
U
AAA
RA
A
A
C
A G CU
U UC
A AG
U G CCU U U Y U GC
A G
U
U
YYY
CARGAGCGC
A
A
G
A
U
RG
R U A5´
R
Y
G
GY
Y G
Y
U
U
G
C
C
A
U
A
C
G
C
C
C
YY
Y YY
C
G
G
C A
GG
U
A
U
G
G
A
A
R
C
A
C
C
C
YC
G Y A CG
A
C
U
G
GY
Y
C G
G
A
C
A
CY
GY
C
G
U
C
CC
G
C
C
A
G
A
U
C
5´ CA
C
A
U
C
A
G
A
U U U
C
C
U
G
G
U
G
UA
A CG
A
A
U
U
U
U
C
A
A
G
U
G
C
U U C
U
U
G
C
A
U
A
A
G
C
A
A
G
U
U
URA
U
C
C
C
G
C
Y
C
CY
YC
G
R
G
Y
C
G
G
G
A
UU
U5´ A U GG
A
G
A
C
A
UGGCR
U
AA
AG
C C AG
A R
A
G U R A
G
A
AC
R U A A C
Y
U
A
G
A
C
U
R
U
ACUUGAA
C
U
G
A
U
UYRC
A
U
C
U
CA
U U U U5´
G
C
R
C
Y
G
C
AA
AA
U
C
R
G
R
Y
G
C
C G G G A
U UG
G
YA
YCCCG
R
A
Y
R
R
R
R
Y
R
A R C G C
Y
GCGYU
U
U
U
U
U
5´
Y U R C G U G A C G A A G CG
C
G
C
G
CA
A
A
G
UGG
A C
AA
U
A
A
AG
C
C
UR
A G C
RU
Y
R
A
G
UAG
U
C
G
Y
CAG
A
C
G
C
C
G
G
U
U A A
G
C
C
G
G
C
G
U
UU
U U U5´ YR
Y
A
C
G
UR
Y
C
Y
G
U
U
R
UR
G
Y
C
C
G
G
U
U
G
C
U
U
UG
GU
C
G
G
U
G
A
C
C
G
G
R
R R R
R
A
G
C
C
C
R
C
UU G
G
U
G
G
G
Y
U
UU
U U5´
G
G
Y
C
R
G
C
Y
C
R
CC
C C CC
R
G
R
G
C
Y
G
R
C
C
G A C G G C C C C C G C
U CC
C
C
CCY
GGCGGGGGYCGUC
C
C
Y
Y
5´
U U G G C G A U R UU
U
U
U
G
GU
U G
G
A
A
U
G
UAGUGY
YY
UU
A
R C A C U AA
A CG
C U G
CC
A C AA
A
U
A
A
CCUG
U
CAGU
U
A
U
U
U
C
A
Y
C
A
A
A
AA
U A A A5´
RYYRYUG
C
C
C
UCY
G
G
G
CG
UUUCCUCCCUAGACUU
G
G
C
Y
Y
YY
R
R
G
G
C
CU
UUUUUUUYYY5´
SA M V sym R C P E B 3 F inP sr oB m sr SA M a H H 3 V m nt n3 livK D sr A C A E SA R isr K sr oD isr B 6C r spL suhB
UY
G
C
A
UCCGCYAA
Y
CGGUYA
G C C GU G UC
G C GG A
A G
G
U
U
Y Y
Y
A
A
C
CA
G C UR
Y
Y U Y Y G RA
ACRRAG
RRA
GGUG
A
G
C
G
5´
UG
A
A
A
GAC
G
C
G
C
A
U
U
U
GU
U A U C A U CA
UC
C CU
G
U Y
C
A
G
AG
A
U
GY
A
A
U
U
U
GG
CC
AC
AG
Y
RY
G
U
G
G
C
C
U
U
U
U
C
5´
* U
U
C
U
A
C
U
G
A
C
U
C
UU
U
U
A
AA
A
U
A
AU
U
A
U
U
C
A
U
U
G
G
AG
G U UU
A
A
UA
U
G
A
A
U
A
UA
A A G G A U G A G CA
U A
U
A
G
A
AG
C
GUUUG
C
UCYUU
GU
U
A
G
AU
C
R
G
U
U
A
G
U
A
G
G
AA
5´
G A U U UG
G
U
R
R
C
U
G
C
G
C
U
C
UU
C UA
A
G
C
C
A
G
U
U
A
C
C
CG
G
U
U
C
A
A
A
R
A
U
U
G C C
A
G
C
U
U
Y
G
A
A
C
CU
UC
G
A
A
A
A
A
C
C
A
C
C
U
Y CR
R
G
G
U
G
G
U
U
U
U
U
U
C
GU
5´
R R R R R R R R
C
U
C
R
U
AU
A
A
YYYCRRR
AA
U
A
UG GY
Y Y G R R A
GU
U U C UAC
C R R G Y R
C CG
U
AAA
YRYYYG
A
CU
A
Y
G
A
G
RR
R5´
C
G
G
C
A
U
C
C
C
C
A
U
U
A C C
U
A
U
G
G AC
A
CG
G
U
G
C
C
G
C A R G C U C U G G R A
G UU
C
GUYCCRGAGYYUG
Y
Y
G
G
A
A
R
G
G
U
U
U
U
C
C
G
U
G
U
C
C
A
G
5´
R
R
Y
G
G
A
R
G
CRR
U
GA
R
Y
R
Y
Y
Y
YU
Y
A
U
YU
G G GCA
C
Y
U
G
R
R
R
Y
R
YG
G
A
G
C
YAG
U R GU
G
C
A
ACCG
R
C
C
R
Y
R
R
R
5´
G
U
U
G
U
A
A
C
U
AU
G
U
U
G
C
A
R
YA
R A C G AG
A
A
C
C
G
AG
U
A
U
A
G
U
U
C
A
U
GG
G
R
U Y A
CA
UG
AA
UU G U UU
A
A
CU
RU
CC
U
C
U
GG
A
U U
C
CC
G
U
C
C
AU
G
R
C
A
GU
C
G
G
U
U
C
5´
CUUA
C
U
G
A
GA
G
C
A
C
AA
A
GU
UUC
C
C
G
U
GC
CA
A
C
A
G
G
G
A
G
U
G
U
UAU
A
AC
G
G
UU
UAUU
A
G
U
C
U
G
G
AG
ACG
G
C
A
G
A
C
U
AU
CCUCUUC
C
C
G
G
U
C
CC
CUA
U
G
C
C
G
G
GU
UUUUUUUAUGUC5´
UURGRYUYRCCUG
A
A
U
G
U
G
A
CU
A
U
C
A
C
U
U
CA
AACRRYGRGYAACCUCAGUAUCAUCRYRGAGYUA
A
A
C
C
C
U
C
G
C
C
G
C
CUG
A
C
G
G
Y
G
A
G
G
G
U
U
UU
CUUUUGGR5´
U G U A A A A A A C A U Y A U U UA
G
C
GUGAYU
U
U
C
U
A
U
C
A
ACAG
C U A A C
A
A
U
U
G
U
UA
U
U
A
C
UG
C
CUA
A
Y
G
Y
U
C
A
UA
A G G G U A AUU
U
U
A
A
A
A
A
AGG
G CG
A
U
A
A
AA
A
A
C
G
A
U
U
G G GGGA
U
G
A
G
A
Y
A
U
G
AAC
G
C
UC
A A G C A5´
C C C A G A G G U A U U G A UU
G
G
U
G
A
U
R
R
C
A
Y
Y
U C U
R
U
G
Y
U
Y
A
U
UY
A
U
UR
C
A
C
C
A
A C C U G C G C RG
A
UGCGCAGGU
U
U
U
U
U
U
U
5´
AR
R
R
Y
Y
YYYAAURYCAACYUUUAGCGCACG
G
C
U
C
U
YY
A
A
G
A
G
C
CA
UUYCCCUA
G
R
C
C
A
A
A
C
A
G
GAAU
Y
G
U
U
U
G
G
Y
C
UU
UUUUU5´
G
G
G
C
A
R
G
A
U
A
U
G
U
G
A
A
GU
R
GC
Y
A
C
C
GC
AA
GC
YGR
U
A
CY
CUU
CAC
Y
Y Y C C
U
U
A U UC
G C
U
Y
GC
U
CAAC
GGR
A
U
C
Y
U
G
C
U
CU
G C G A G G C Y5´
GUGCRRYCYRAUUYYR
G
Y
Y
G
Y
G
C
C
Y
R
Y
R
A
R
AAC
AUCAYAA
R
A
U
A
CG
G
C
R
C
R
R
CC
ACRAUUUCCCUG
G
U
G
U
U
GG
C
G
C
A
GU
AUU
C
G
C
G
C
A
C
CC
CGGUCUACC5´
Y
U
U
Y
R
Y
U
R
R
U
U
U
Y
A
U
C
A
R
A
YC
U GU
U
U
G
A
U
R
R
A
A
G
Y
U
A
R
Y
G
A
R
R Y Y C A Y UA
A
C
R
G
C
U
Y
U
Y
GC
Y G
G
C
Y Y G
A
C
C
C
G
A
G
R
Y
Y
G
U
UU
U U U U5´
RACGUUCAY
C
C
Y
YY
R
G
G
RC
GCAYRA
Y
C
A
R
R
Y
C
A
Y
GG
AAC
G
G
G
G
R
Y
Y
U
G
R
R
5´
sucA Sr aD sxy R N A I P ur ine SA M -C hl cdiG M P 2 A nt i-Q G adY r nk ldr P r fA O m r A -B R yeB t r aJ 2 Sr aH 23Sm et h D S-p ep
U U C G G C C Y CG
C
R
R
C
G
YU
U YU
Y
C
G
Y
Y
G
CC
C U C U G C A YG
C
C
G
U
C
G
C
C
G
A
CGCAY
U
C
C
Y
A
U
U
CG
A
A Y Y G U
G
C
G
A
U
C
C
U
G
U
C
G
C CY
U
C
C
U
GC
G
G
C
G
C
G
G
C
5´ CG
Y
R
G
C
G
C
U
U
G
U
UA
U U
U
R
Y
Y
G C U
G
U
G
U
A
G U GUC
G
U
C
Y
YR
A R Y Y R G R R Y Y Y
A
A
A
C
C
C
C
G
C
C
Y
UU
Y
G
G
C
G
G
G
G
U
U
U
UG
C U U U U U5´
** C
U
U
A
C
C
G
G
A
G
GY
R
U
A
UGGAC
C
C
UG
A UC
C C AC
Y C C U
C
U
C
C
C
C G
A
UG
G
A
G
AA
U
Y
Y
YU
U
U
C
C
G
G
U
A
A
GC
C Y G Y C U Y Y
R
C
U
G
Y
Y
U
U
A
C
C
G
G UG
Y
G
U
A
A
G
G
C
A
G
UG
A C G U Y U5´
G
G
R
A
G
R
Y
R
Y
CU
G
GU G R
Y
C
G
G
C
U
UC
A AA
CC
GR
Y G
RR
G
Y
R
Y
Y
Y
Y
G
G
Y
RGG
U
U
C
G
AY
U
C
C
Y
RY
Y
C
U
Y
C
C
5´ U
G
A
C
C
C
U
U
U
A R
C
C
R
A
G
G
G
U
C
AC
C U A G C C A A C U G A C GU
U
G
U
U
AG
U
G
A
A
Y
YY
A
U
G
U
U
C
A
C A
RA
U
A
R
GC
C
A
A
U
C
G
C
U
U
U
G
C
G
R
U
U
G
GC
U U U U U U U U U5´ C U U A A UR
A
A
CAA
G
A
A
A
A
C
YAA
R C G
U
A
C
Y
U
U
C
C
Y C
C
U
G
AG
UU
C
A
G
G
C
U
G
G
A
A
UG
C
G
C A
CAG
C U RA
U U G U U G A U AA
G G G CU
ACUC
AUACCGACAA
GC
CAGU
G
A
A
G
C
G
AUG
A
AU
G
U
C
GG
U
U
CC
A C5´
R
U
Y
Y
RC
U
G
A
Y
GA
G
U
C
C
C
A
A AU
A
G
G
A
CGA
A
A C G C
GCGU
CY
G
R
A
U
5´ CU
C
C
A
U
GU
A
U
C
U
UU
G
G
G
A
C
C
U
G
U
C
A
GC
UG
U
G
G
C
A
G U
CU
C
C
C U
UC
C
U
A
G
CC
A
U
G
G
AA
G A G C A U A U U C UU
G
U
U
U
AU
U
G
G
C
A
A
A
GC
U
G
U
CA
C
C
A
U
UU
RA
U
U
G
G
UA
U
C
A
G
A U U
C
U
GAC
U
U
G
C
A
C
A
AG
U
A
A
C
AU
U C5´ C Y G G U U GG
U
G
G
C
G
C
A
C
U
U
C
C
Y
Y
A
C
G
G
G
C
G
G
U
G
U R
U
Y
A
CG
Y R Y U R Y R R Y A G A R R R A Y A C C
A
G
C
C
C
G
C
Y
RR
R
A
G
C
G
G
G
C
UU
U U U U5´
G
U
C
A
U
A
C
U
A
C
G
G
UG
C
A
A
Y
GY
R
RA
A
A
G
U A
A
AC
G
A
U
G
A
C
C C Y
A
RG
A
A
C
U
C
Y
RG
G U A
A A
A
U
R
CR
UAUC
A
A
A
A
U
G
Y
A
A
A
A
U
U
G
U
Y U G A C C U G G GR
UY
Y
UCCGGGUYRG
Y
U
Y
U
U
U
U
5´
U R U G C U A A C U R R R A A YG
U
U
G
Y
A U
R
Y
A
A
CCC
U
U
G
R
Y
G
C
U
U
A
U Y
CC
U
U
U
R
Y
C
A
A
GC
A U A U U A Y AR
C
G
R
U
C
G
Y
YA
A A G G A G A A A U G5´
U C R A A A G A A C AU
G
A
A
A
U
G
G
A
G
G
AGAAAUU
AC
A
GC
A A U U UA
UC
AR
C U
G
A
A
A
UU
A
U
AG
G
U
GU
AG
ACA
C A
UGUC
A
GC
R G UG
G
A
A
A
CAGUU
UC U A
UC
A A A A UU
A A AG
U
A
U
UUAG
A
G
AUUUU
C
C
U
C A
AA
U
U
U
C
AA
A U5´
ACAG
G
G
U
A
R
G
G
R
Y
Y
Y
Y
Y
UU
RU
R
R
R
R
R
Y
C
C
U
U
A
C
C
G
GR
UUUCU
C
A
A
R
U
Y
G
G
R
G
YA
AA
Y
C
C
G
R
U
U
G
RA
RUAUARAGGARG5´
CGYGUUA
U
A
U
G
CC
UU
U
A
U
U
G
UC
ACARUUYUUUUUYYG
Y
U
G
R
Y
C
A
U
U
G
GYAY
YA
U
U
R
A
U
U
Y
C
C
A
G
CR
AUAAAYG
A
C
A
A
G
C
C
C
G
A
A
C
RY
U
G
U
U
C
G
G
G
C
U
U
U
UU
UUURRUYA5´
Y Y Y AU
G
G
Y
G
G
Y
G
R
G
G
G
R
RCC
UU
Y
G GG Y
Y
G
C
C
G
GUU
C
C
YY
R
CCG
GU Y U RC
C
A
A
C
C
C
Y
Y
R
C
Y
R
C
C
AC
C Y5´
AUGGAYRU
G
C
G
C
A
GGA
A
G
C
G
CR
AAGACARACAGGGACACRYAGGRA
C
CCG
GA
UGGYGGRRYAGGAUGUCAGGRAACAGUCUGCA
A
A
G
C
C
C
C
G
C
YY
YG
G
C
G
G
G
G
U
U
U
U
5´
P s-R ho r nk ps M gsens t R N A S Q r r isr C H H 1 SN R 24 T r p ldr gr eA pr eQ 12 H A R 1F T er m L eu M icC C 4 R sm Y R ib osom e
Paul Gardner Engaging Scientists
3. What is Rfam?
A database of ncRNA alignments and structures
Used for annotating RNAs in genome sequences, bioinformatic
algorithm development and molecular evolutionary analyses
Gardner et al. (2008) Rfam: updates to the RNA families database
Nucleic Acids Research.
Paul Gardner Engaging Scientists
4. How can we keep textual descriptions of RNAs up to date?
AC RF00005
ID tRNA
CC Transfer RNA (tRNA) molecules are approximately 80 nucleotides in
CC length. Their secondary structure includes four short
CC double-helical elements and three loops (D, anti-codon, and T
CC loops). Further hydrogen bonds mediate the characteristic
CC L-shaped molecular structure. tRNAs have two regions of
CC fundamental functional importance: the anti-codon, which is
CC responsible for specific mRNA codon recognition, and the 3’ end,
CC to which the tRNAs corresponding amino acid is attached (by
CC aminoacyl-tRNA synthetases). tRNAs cope with the degeneracy of
CC the genetic code in two manners: having more than one tRNA (with
CC a specific anti-codon) for a particular amino acid; and ’wobble’
CC base-pairing, i.e. permitting non-standard base-pairing at the
CC 3rd anti-codon position.
RN [1]
RM 8256282
RT The tertiary structure of tRNA and the development of the genetic
RT code.
RA Hou YM;
RL Trends Biochem Sci 1993;18:362-364.
RN [2]
RM 9023104
RT tRNAscan-SE: a program for improved detection of transfer RNA genes
RT in genomic sequence.
RA Lowe TM, Eddy SR;
RL Nucleic Acids Res 1997;25:955-964.
Paul Gardner Engaging Scientists
6. WikiProject RNA
The WikiProjects are social corners of Wikipedia for interested
parties to discuss themed articles
Involved in reviewing, ranking and rating articles
Now rolled into the larger WikiProject Molecular and Cellular
Biology
Paul Gardner Engaging Scientists
7. How has the Wikipedia experiment gone?
x x x x
x
x
x x
x
x
x x x x x x x
x x x x x x x x
x x
x
x x x x x x x x x x
x
x x
x
x x x
0
2000
4000
6000
8000
10000
Number of Rfam pages edited
Year
Numberofedits
2007 2008 2009 2010 2011
9089
x x xxxxxxxxxxx xxxxxxxxxxxx xxxxx xx x
106
Total edits
Vandalism
Gardner et al. (2011) Rfam: Wikipedia, clans and the “decimal”
release Nucleic Acids Research.
Paul Gardner Engaging Scientists
8. Who are these Wikipedians donating their time?
Rfambot
Ppgardne
Citationbot1
WillowW
SmackBot
DOI_bot
Addbot
Alexbateman
Jebus989
JenniferRfm
Zashaw
Rjwilmsi
Qwyrxian
Yobot
RE73
Narayanese
RichFarmbrough
Addshore
Wgscott
MiRroar
RjwilmsiBot
Arcadian
DO11.10
Gortonk
Banus
Drmed36
FrescoBot
Boghog
Top 20 Rfam wikiproject editors
Numberofedits
0
200
400
600
800
1000
Bots
Proof Readers
Scientists
Paul Gardner Engaging Scientists
9. What incentives can we give to Academics?
Academics love publishing articles
Introducing the “families track” at RNA Biology
Publication requirements are an alignment & a Wikipedia
article
100s of new families have been added thanks to this track
Paul Gardner Engaging Scientists
10. Who else is now using this model?
Finn, Gardner, Bateman (2012) Making your database available
through Wikipedia: the pros and cons Nucleic Acids Research.
Paul Gardner Engaging Scientists
11. Wikipedia need you!
What is the highest impact contribution academics can make?
Rule 1: Register an Account
Rule 2: Learn the Five Pillars
ENCYC, NPOV, FREE, RESPECT, NORULES
Rule 3: Be Bold, but Not Reckless
Rule 4: Know Your Audience
Rule 5: Do Not Infringe Copyright
...
Paul Gardner Engaging Scientists
12. Who might be reading about your field?
Paul Gardner Engaging Scientists
13. Thanks!
The Rfam Consortium
Wikipedians & the long
tail!
PPG is supported by a Rutherford Discovery Fellowship from Government funding, administered by the Royal
Society of New Zealand.
Paul Gardner Engaging Scientists