Dat is a non-profit that provides tools for versioning and sharing data. It allows researchers to import, version, publish, and replicate data through a simple workflow and command line interface. Dat uses decentralized peer-to-peer technology so data can be cloned, pulled, and pushed between researchers' computers. This facilitates improved collaboration and reproducibility in research.
1. Dat: version and share
your data
Karissa McKelvey
Software Developer and Project Manager
and Science Evangelist and Designer (I wear a lot of hats)
U.S. Open Data
@karissamck
37. Versioning
$ dat add files cities us_cities_viz.png
This will override us_cities_viz.png at
c2342. OK?
$ dat cities add rows updated_data.csv
This will update 3,434,245 rows. OK?
$ dat push
44. Datscript: example commands
background
- executes command, but doesnât wait for it to ïŹnish
âš
map
- pipes ïŹrst argument into rest of arguments
run
- a serial command (executes and ïŹnishes command)