Beyond Management: Data Curation as Scholarship in Archaeology

Beyond Management:
Data Curation as Scholarship
in Archaeology
Sarah Whitcher Kansa,
The Alexandria Archive Institute
with Anne Austin, Ixchel Faniel,
Beth Yakel, Ran Boytner,
Eric Kansa, Jennifer Jacobs
& Phoebe France


Open Context: 10 years of iterative development

Linked: Links with other systems & data (tDAR, EOL, ORCID, etc)

Open: Code, data (mainly CC-By) on GitHub, machine-readable formats, APIs

Long-term: NSF, NEH data management. California Digital Library archiving.

Global: Mirroring, collaboration with the German Archaeological Institute (DAI)

Recognition: Awards from Digital Curation (2014),

Archaeological Institute of America (2016), and the White House (2013)


Open Context: 10 years of iterative development

Linked: Links with other systems & data (tDAR, EOL, ORCID, etc)

Open: Code, data (mainly CC-By) on GitHub, machine-readable formats, APIs

Long-term: NSF, NEH data management. California Digital Library archiving.

Global: Mirroring, collaboration with the German Archaeological Institute (DAI)

Recognition: Awards from Digital Curation (2014),

Archaeological Institute of America (2016), and the White House (2013)

Why a Publishing Metaphor?
1. Editorial (curatorial) co-production
2. Promote vision of data as more than a
“residue” of research
Why a Publishing Metaphor?
1. Editorial (curatorial) co-production
2. Promote vision of data as more than a
“residue” of research

Data
Reuse
Data
Creation
Usable / Useful Data
Tension between what data creators do
and what data reusers need.


Need to better align data creation and reuse!

Tension between what data creators do
and what data reusers need.


Need to better align data creation and reuse!

Important to get it right
because you can only
excavate a site once!
Data issues are central
to the practice of
research in the 21st
century.
Important to get it right
because you can only
excavate a site once!
Data issues are central
to the practice of
research in the 21st
century.

Create
Process
AnalyzePreserve
Share
Reuse
SLO-data (Secret Life of Data)
●
Builds on previous qualitative
research (DIPIR, NEH, EOL)
●
Explores relationships between
data creation practices and reuse
●
Better align data creation with
reuse needs
●
Encourage more thoughtful data
creation and dissemination,
training in digital literacy
●
Guidance, exemplars, and
“recipes” for creating high-
quality, usable data
●
Builds on previous qualitative
research (DIPIR, NEH, EOL)
●
Explores relationships between
data creation practices and reuse
●
Better align data creation with
reuse needs
●
Encourage more thoughtful data
creation and dissemination,
training in digital literacy
●
Guidance, exemplars, and
“recipes” for creating high-
quality, usable data

SLO-data (Secret Life of Data)
Researcher
Interviews
Field
Observations
Database
Analysis
Reuser
Interviews

https://alexandriaarchive.org/secret-life-of-data/
https://alexandriaarchive.org/secret-life-of-data/

Year 1 observations
explored how data are:
●
Structured
●
Stored
●
Accessed
●
Modified
●
Backed-up/Secured
●
Made Consistent
●
Identified
…or not!
Year 1 observations
explored how data are:
●
Structured
●
Stored
●
Accessed
●
Modified
●
Backed-up/Secured
●
Made Consistent
●
Identified
…or not!


Central control vs. local control
Central control vs. local control


Field conditions
Field conditions


Identifier consistency
Identifier consistency


Recording incompatibilities
between paper and database

Recording incompatibilities
between paper and database


How to maintain knowledge
continuity when team has
contingent involvement.

How to maintain knowledge
continuity when team has
contingent involvement.

Recommendations:
●
Broader data literacy & more
formal processes
●
Establish expectations and
protocols for specialists to share
data
●
Better identifier management
●
Data validation
●
Promote shared controlled
vocabularies at the time of data
creation (especially referenced by
URIs for Linked Data).
Recommendations:
●
Broader data literacy & more
formal processes
●
Establish expectations and
protocols for specialists to share
data
●
Better identifier management
●
Data validation
●
Promote shared controlled
vocabularies at the time of data
creation (especially referenced by
URIs for Linked Data).

Year 2 & Beyond:
●
Provide recommendations for workflow and
database changes; Observe in field
●
Establish expectations from a project's inception
(including training, specialists, database)
●
Code/analyze Year 2 field & data reuser
interviews
●
Create exemplars and guidelines (universal
handbook for data management)
Year 2 & Beyond:
●
Provide recommendations for workflow and
database changes; Observe in field
●
Establish expectations from a project's inception
(including training, specialists, database)
●
Code/analyze Year 2 field & data reuser
interviews
●
Create exemplars and guidelines (universal
handbook for data management)

Beyond Management: Data Curation as Scholarship in Archaeology

Recommended

Recommended

More Related Content

What's hot

What's hot (12)

Similar to Beyond Management: Data Curation as Scholarship in Archaeology

Similar to Beyond Management: Data Curation as Scholarship in Archaeology (20)

Recently uploaded

Recently uploaded (20)

Beyond Management: Data Curation as Scholarship in Archaeology