Learning to make use of Jupyter to document your Data Science process - real time - and in whatever programming language you want! Using this methodology will allow you to provide insights that help your organization make better decisions to solve their business problems.
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Practical Data Science the WPC Healthcare Strategy for Delivering Meaningful Data Science Projects
1. 2015 Healthcare Data Science
Practical Data Science: The WPC Healthcare Strategy for
Delivering Meaningful Data Science Projects
Damian Mingle
@OPENDATASCI
4. What’s the Problem?
3
A Common Scenario: Johnny Data Scientist
• Does not like working with others
• Too much black magic – not enough explanation
• His process is always different
• Uses multiple languages
• Hates producing presentations
• Constant unclear project status
• Doesn’t capture business needs
• Models aren’t production quality
5. Why Jupyter?
4
Interactive Computing Environment
• Notebook Web Application: Writing and running code
interactively
• Kernels: Over 40 programming languages
• Notebook Documents: Self-contained documents which
include: Live code, Interactive widgets, Plots, Narrative text,
Equations, Images, and Video
6. Why a Data Science Methodology?
5
Data Science Projects Involve Risk
• Strategically: Provides confidence to the business that Data
Science projects can be delivered profitably
• Tactically: Management can understand status assessments
• Operationally: Empowers the Data Science team to do the
right thing, the right way, the first time.
7. Business Understanding
Uncover important factors at the Start
• Determine business objectives
• Assess situation
• Determine data science goals
• Produce project plan
Understand the
Data Science
project objectives
and requirements
from a business
perspective. Then
convert this
knowledge into a
Data Science
problem definition
and preliminary
plan designed to
achieve the
objective.
6
9. Data Understanding
Become familiar with the data
• Collect initial data
• Describe data
• Explore data
• Verify data quality
Identify data
quality problems,
discover first
insights into the
data, and/or
detect interesting
subsets to form
hypotheses
regarding hidden
information.
8
11. Data Preparation
Construct the Final Dataset
• Select data
• Clean data
• Construct data
• Integrate data
• Format data
Data Science task
in this phase have
to do with
selection of table,
record, and
attributes. In
addition,
transformation
and cleaning of
data.
10
13. Modeling
Various Modeling Techniques Are Selected
• Select modeling technique
• Generate test design
• Build model
• Assess model
In this phase,
calibrating
parameters is
important. Some
techniques may
require the Data
Scientist to go
back to the data
preparation phase.
12
15. Evaluation
Review Your Steps with Certainty
• Evaluate results
• Review process
• Determine next steps
At the end of this
phase, a decision
on the use of the
data science
results should be
reached.
14
17. Deployment
Make Use of The Model
• Plan deployment
• Plan monitoring and maintenance
• Produce final report
• Review project
This phase can be
as simple as
generating a
report or as
complex as
implementing a
repeatable data
science process
across the
enterprise.
16
19. Data Scientist 2.0
18
Lead Analytically Your Organization
• Use Jupyter to document your process – real time – using
whatever language you want!
• Establish a Data Science Methodology that is comprehensive
• Provide insights that help the organization make better
decisions to solve their business problems
We primarily focus on clinical, financial, and operational data. We work with both Payers and Providers in Healthcare.
A sample of some of the clients that we work with…
In a business context don’t be a “Johnny Data Scientist”.
Some benefits:
It gets things out into the open
It serves as a great placeholder
All your analysis can be in a single-place; even if you work with other languages.
Some benefits:
Having a method allows you to be creative
Having a method allows a business only person follow along
Having a method allows you to create learnings that you can reuse for future data science projects