The document describes stable isotope data from algal samples collected from Wash Cresc Lake. It includes a table with sample identifiers, weights, carbon and nitrogen composition percentages, delta C-13 and delta N-15 values, and calibration corrections for delta values. The table contains stable isotope measurements for 23 algal samples. Additional context is provided by notes that it is stable isotope data from algal samples collected on December 16th and references are given for the standard deviations of delta values.
Unblocking The Main Thread Solving ANRs and Frozen Frames
Stable Isotope Data from Wash Cres Lake Algal Samples
1. From
Calisphere,
Couretsy
of
UC
Riverside,
California
Museum
of
Photography
From
Calisphere,
Courtesy
of
Thousand
Oaks
Library
Data
Management
for
Researchers
Tips,
Tools,
&
Why
You
Should
Care
Carly
Strasser,
PhD
California
Digital
Library
@carlystrasser
carly.strasser@ucop.edu
Cal
Poly
Oct
2013
2. Roadmap
4. Toolbox
3. Best
practices
2. Why
you
should
care
1. Background
3. What
role
can
libraries
play
in
data
education?
NSF
funded
DataNet
Project
Office
of
Cyberinfrastructure
Why
don’t
people
share
data?
Do
attitudes
about
sharing
differ
among
disciplines?
What
barriers
to
sharing
can
we
eliminate?
Is
data
management
being
taught?
How
can
we
promote
storing
data
in
repositories?
4. What
role
can
libraries
play
in
data
education?
Why
don’t
people
share
data?
Do
attitudes
about
sharing
differ
among
disciplines?
What
barriers
to
sharing
can
we
eliminate?
Is
data
management
being
taught?
How
can
we
promote
storing
data
in
repositories?
5.
6. From
Calisphere
via
Santa
Clara
University,
ark:/13030/kt696nc7j2
A
Brief
History
of
Data
Collection
Or…
how
scientists
came
to
be
so
bad
at
data
management
7. Back in the day…
Curie
Newton
Da
Vinci
classicalschool.blogspot.com
Darwin
8. From
Flickr
by
US
Army
Environmental
Command
From
Flickr
by
deltaMike
From
Flickr
by
DW0825
From
Flickr
by
Flickmor
Courtesey
of
WHOI
Digital
data
C.
Strasser
15. Because
they
care:
All
data
must
be
in
a
public
archive.
You
can’t
hoard
it.
If
it’s
not
available
you
can’t
cite
it.
Include
a
data
section
with
how
to
find
datasets.
16. Data
Management:
Who
Knew
Could
be
a
Hot
Topic?
r!
te
La
From
Flickr
by
Velo
Steve
Carly
Strasser,
PhD
California
Digital
Library
@carlystrasser
Cal
Poly
Oct
2013
23. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
24. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
25. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
26. 2.
Data
collection
&
organization
Create
unique
identifiers
From
Flickr
by
zebbie
• Decide
on
naming
scheme
early
• Create
a
key
• Different
for
each
sample
From
Flickr
by
sjbresnahan
27. 2.
Data
collection
&
organization
Standardize
• Consistent
within
columns
– only
numbers,
dates,
or
text
• Consistent
names,
codes,
formats
Modified
from
K.
Vanderbilt
From
Pink
Floyd,
The
Wall
themurkyfringe.com
28. 2.
Data
collection
&
organization
Standardize
• Reduce
possibility
of
manual
error
by
constraining
entry
choices
Excel
lists
Google
Docs
Data
Forms
validataion
Modified
from
K.
Vanderbilt
29. 2.
Data
collection
&
organization
Create
parameter
table
Create
a
site
table
From
doi:10.3334/ORNLDAAC/777
From
doi:10.3334/ORNLDAAC/777
From
R
Cook,
ESA
Best
Practices
Workshop
2010
30. 2.
Data
collection
&
organization
What
about
databases?
A
relational
database
is
A
set
of
tables
Relationships
among
the
tables
A
language
to
specify
&
query
the
tables
A
RDB
provides
Scalability:
millions+
records
Features
for
sub-‐setting,
querying,
sorting
Reduced
redundancy
&
entry
errors
From
Mark
Schildhauer
31. 2.
Data
collection
&
organization
You
should
invest
time
in
learning
databases
if
your
data
sets
are
large
or
complex
Consider
investing
time
in
learning
databases
if
your
data
are
small
and
humble
you
ever
intend
to
share
your
data
you
are
<
30
years
old
From
Mark
Schildhauer
32. 2.
Data
collection
&
organization
Use
descriptive
file
names
*
• Unique
• Reflect
contents
Bad:
Mydata.xls
2001_data.csv
best
version.txt
Better:
Eaffinis_nanaimo_2010_counts.xls
Study
organism
Site
name
Year
What
was
measured
*Not
for
everyone
From
R
Cook,
ESA
Best
Practices
Workshop
2010
33. 2.
Data
collection
&
organization
Organize
files
logically
Biodiversity
Lake
Experiments
Biodiv_H20_heatExp_2005to2008.csv
Biodiv_H20_predatorExp_2001to2003.csv
…
Field
work
Biodiv_H20_PlanktonCount_2001toActive.csv
Biodiv_H20_ChlAprofiles_2003.csv
…
Grassland
From
S.
Hampton
34. 2.
Data
collection
&
organization
Preserve
information
• Keep
raw
data
raw
• Use
scripts
to
process
data
&
save
them
with
data
Raw
data
as
.csv
R
script
for
processing
&
analysis
35. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
36. 3.
Quality
control
and
quality
assurance
Before
data
collection
From
Flickr
by
StacieBee
• Define
&
enforce
standards
• Assign
responsibility
for
data
quality
37. 3.
Quality
control
and
quality
assurance
After
data
entry
• Check
for
missing,
impossible,
anomalous
values
• Perform
statistical
summaries
• Look
for
outliers
60
50
40
30
20
10
0
0
10
20
30
40
38. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
40. 4.
Metadata
basics
From
Flickr
by
//ichael
Patric|{
Metadata
=
Data
reporting
WHO
created
the
data?
WHAT
is
the
content
of
the
data
set?
WHEN
was
it
created?
WHERE
was
it
collected?
HOW
was
it
developed?
WHY
was
it
developed?
41. 4.
Metadata
basics
•
Scientific
context
•
•
Environmental
conditions
during
collection
•
Where
collected
&
spatial
resolution
When
collected
&
temporal
resolution
•
The
name(s)
of
the
data
file(s)
in
the
data
set
What
instruments
(including
model
&
serial
number)
were
used
Standards
or
calibrations
used
Name
of
the
data
set
•
What
data
were
collected
•
Digital
context
Scientific
reason
why
the
data
were
collected
•
•
•
•
Date
the
data
set
was
last
modified
•
Example
data
file
records
for
each
data
type
file
•
Pertinent
companion
files
•
•
Information
about
parameters
List
of
related
or
ancillary
data
sets
How
each
was
measured
or
produced
•
Software
(including
version
number)
used
to
prepare/read
the
data
set
Units
of
measure
•
•
Format
used
in
the
data
set
•
•
•
Data
processing
that
was
performed
•
Precision
&
accuracy
if
known
Personnel
&
stakeholders
•
•
Quality
assurance
&
control
measures
•
Funders
Definitions
of
codes
used
•
Who
to
contact
with
questions
•
Information
about
data
•
Who
collected
•
•
Known
problems
that
limit
data
use
(e.g.
uncertainty,
sampling
problems)
How
to
cite
the
data
set
42. 4.
Metadata
basics
What
is
metadata?
Select
the
appropriate
standard
• Provides
structure
to
describe
data
Common
terms
|
definitions
|
language
|
structure
• Lots
of
different
standards
EML
,
FGDC,
ISO19115,
DarwinCore,…
• Tools
for
creating
metadata
files
Morpho
(EML),
Metavist
(FGDC),
NOAA
MERMaid
(CSGDM)
43. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
44. 5.
Workflows
Workflow:
how
you
get
from
the
raw
data
to
the
final
products
of
your
research
Simple
workflows:
flow
charts
Temperature
data
Salinity
data
“Clean”
T
&
S
data
Data
import
into
R
Data
in
R
format
Quality
control
&
data
cleaning
Analysis:
mean,
SD
Graph
production
Summary
statistics
45. 5.
Workflows
Workflow:
how
you
get
from
the
raw
data
to
the
final
products
of
your
research
Simple
workflows:
commented
scripts
• R,
SAS,
MATLAB
• Well-‐documented
code
is…
Easier
to
review
Easier
to
share
Easier
to
repeat
analysis
%
#
$
&
47. 5.
Workflows
Workflows
enable
Reproducibility
Transparency
Executability
From
Flickr
by
merlinprincesse
48. 5.
Workflows
Minimally:
document
your
analysis
commented
code;
simple
flow-‐chart
www.littlebytesoflife.com
Emerging
workflow
applications
will…
− Link
software
for
executable
end-‐to-‐end
analysis
− Provide
detailed
info
about
data
&
analysis
− Facilitate
re-‐use
&
refinement
of
complex,
mn:
o ulti-‐step
o
analyses
ng
S aring
omi
sh
C
− Enable
efficient
swapping
of
alternative
models
s!
kflow ent &
r
wo
algorithms
uirem
req
− Help
automate
tedious
tasks
49. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
50. 6.
Data
stewardship
&
reuse
From
Flickr
by
greensambaman
The
20-‐Year
Rule
The
metadata
accompanying
a
data
set
should
be
written
for
a
user
20
years
into
the
future
RULE
(National
Research
Council
1991)
51. 6.
Data
stewardship
&
reuse
Use
stable
formats
csv,
txt,
tiff
Create
back-‐up
copies
original,
near,
far
Periodically
test
ability
to
restore
information
Modified from R. Cook
52. 6.
Data
stewardship
&
reuse
Store
your
data
in
a
repository
Institutional
archive
Ask
a
librarian
Discipline/specialty
archive
Repos
of
repos:
databib.org
re3data.org
From
Flickr
by
torkildr
53. 6.
Data
stewardship
&
reuse
Practice
Data
Citation
Example:
Sidlauskas,
B.
2007.
Data
from:
Testing
for
unequal
rates
of
morphological
diversification
in
the
absence
of
a
detailed
phylogeny:
a
case
study
from
characiform
fishes.
Dryad
Digital
Repository.
doi:10.5061/dryad.20
Persistent
Unique
Identifier
Allows
readers
to
find
data
products
Get
credit
for
data
and
publications
Promotes
reproducibility
Better
measure
of
research
impact
54. From
Flickr
by
Big
Swede
Guy
Best
Practices
ent
data managem
1. Planning
2. Data
collection
&
organization
3. Quality
control
&
assurance
4. Metadata
5. Workflows
6. Data
stewardship
&
reuse
55. What
is
a
data
management
plan?
A
document
that
describes
what
you
will
do
with
your
data
throughout
the
research
project
From Flickr by Barbies Land
56. From
Flickr
by
401(K)
2013
DMP
for
funders:
A
short
plan
submitted
alongside
grant
applications
An
outline
of
– what
will
be
collected
– methods
– Standards
But they all have
– Metadata
different requirements
– sharing/access
and express them in
– long-‐term
storage
different ways
Includes
how
and
why
57. NSF
DMP
Requirements
From
Grant
Proposal
Guidelines:
DMP
supplement
may
include:
1. the
types
of
data,
samples,
physical
collections,
software,
curriculum
materials,
and
other
materials
to
be
produced
in
the
course
of
the
project
2.
the
standards
to
be
used
for
data
and
metadata
format
and
content
(where
existing
standards
are
absent
or
deemed
inadequate,
this
should
be
documented
along
with
any
proposed
solutions
or
remedies)
3.
policies
for
access
and
sharing
including
provisions
for
appropriate
protection
of
privacy,
confidentiality,
security,
intellectual
property,
or
other
rights
or
requirements
4.
policies
and
provisions
for
re-‐use,
re-‐distribution,
and
the
production
of
derivatives
5.
plans
for
archiving
data,
samples,
and
other
research
products,
and
for
preservation
of
access
to
them
58. From
Flickr
by
OZinOH
The
Data
Management
Planning
Tool
DMPTool
Carly
Strasser
|
@carlystrasser
California
Digital
Library
5
August
2013
ESA
2013
SS
2
67. From
Flickr
by
mikerosebery
There
is
a
social
contract
of
science:
we
have
an
obligation
to
ensure
dissemination,
validation,
&
advancement.
To
not
do
so
is
science
malpractice.
–
Brian
Hole,
Ubiquity
Press
at
UCL
68. My
website
Email
me
Tweet
me
My
slides
carlystrasser.net
carlystrasser@gmail.com
@carlystrasser
slideshare.net/carlystrasser