The document proposes a vision to build a health data infrastructure that shares every public health file from California counties, nonprofits, and state and national organizations. It describes a process to convert datasets to common formats, clean the data, and distribute it to websites and analysts' desktops for easy analysis. This would be done through an industrial data management system called Ambry.io at a cost of $1000 per month per county. The system involves data wranglers converting datasets for $100 each and systems administrators selecting datasets for analysts. This would make health data easier to find, understand, and use to allow more analysis and coordination between organizations.
visakhapatnam Call Girls đ 6297143586 đ Genuine WhatsApp Number for Real Meet
Â
San Diego Data Library HCDS Idea-thon
1. Ask a data question at lunch
answer it in the afternoon.
The Vision
Build a Health Data
Infrastructure that shares
every public ďŹle everywhere.
The Idea
Eric Busboom & Michael Samuel, San Diego Regional Data Library
I want to start not with an idea but a vision, that anyone in the health ďŹelds who has basic skills with Excel and the internet can < quote
above>. If youâve tried to answer simple questions, such as âWhat is the vaccination rate by community, and what are the correlates with low
rates?â know that if the data or a report isnât already on your desk, this question can be difficult or impossible to answer without a lot of effort
or professional help.
1 Health Data Ideathon Presentation.key - June 10, 2014
2. â˘Every public health file.
â˘From every CA County and
Nonprofit, State & National
â˘On every countyâs website
and every analysts desktop.
â˘Cleaned and prepared, right
format
A lot of data
From a lot of sources
In a familiar place
Ready to analyze
To the question of what data? All of it. We really want to incorporate every public health data ďŹle in the state. And,
weâll make it useable, be ensuring the data is converted to the ďŹle formats people most need, in the formats they use,
such as CSV, STATA, SAS, or SQL databases. That data, in the right formats can be sent directory to analysts websites,
or even desktops, they places they already know where to get ďŹles.
2 Health Data Ideathon Presentation.key - June 10, 2014
3. Specialization
Standard Packages
Mass Distribution
How?: Industrial Process For Data
Deatils?: Ambry, Business Model
Ambry.io, Data Management
Convert Dataset, $100 per
$1000 / Mo / County, 25 Counties
This is an audacious goal, but it is based on sound principles, and it already works. Weâve developed an industrial
process for data, with an Open Source data management system, Ambry.io. Using ambry, we can build a business
around deploying public data, at a break-even cost of about $1000 per month per California county.
3 Health Data Ideathon Presentation.key - June 10, 2014
4. Database
Inputs Library Repos Use
Data â¨
Wranglers
Sys Admins
Data
Analysts
Our system involves the distributed conversion of datasets, at a cost of about $100 per dataset, by data wrangers,
who have common programming skills. An organizationâs systems administrators can select datasets from a library of
all sets, so analysts only have to see the ďŹles they use most, and those data are already cleaned and ready to use.
4 Health Data Ideathon Presentation.key - June 10, 2014
5. â˘Data becomes comprehensible,
comparable.
â˘More analysis, less talking.
Coordinate without
communication.
â˘Syndicate kidsdata.org, 50% the
cost.
â˘Already exists, and works.
sandiegodata.org/hdi & ambry.io
This health data infrastructure makes data easier to ďŹnd, understand and use, allowing counties to share data with
less effort and communication. It makes it easy to send data to analysts but also databases, so data driven indicator
sites can be built much less expensively. Best of all, the system is already working, and is ready to be tested in a pilot
project.
5 Health Data Ideathon Presentation.key - June 10, 2014