SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Challenges in business analytics
MiklÂŽos Koren
CEU
@korenmiklos
#ceubusinessanalytics
Budapest BI Meetup
Analytics is hard
Analytics is hard
Insights from ”Enterprise Data Analysis and Visualization: An
Interview Study” (Kandel et al, 2012)
1
”I spend more than half of my time integrating, cleansing and
transforming data without doing any actual analysis.”
2
”In practice it tends not to be just data prep, you are learning
about the data at the same time, you are learning about what as-
sumptions you can make.”
3
”We meet every week, we ïŹnd some interesting insight, and we say
that’s great. There’s no repeatable knowledge, so we end up
repeating the process 1 year later.”
4
”It is really hard to know where the data is. We have all the data,
but there is no huge schema where we can say this data is here
and this variable is there. ... It’s more like folklore. Knowledge is
transmitted as you join.”
”Diversity is pretty important. A generalist is more valuable than a
specialist. A specialist isn’t ïŹ‚uid enough. We look for pretty broad
skills and data passion.”
6
”Run a Hadoop job, then run a Hadoop job on results, then awk
it... Hadoop job chained to Hadoop job chained to a Python script
to actually process data.”
7
Why is analytics hard?
Statistics is hard.
People with diïŹ€erent skills interact.
Multiplicity of tools. (Natural for a fast evolving technology.)
Implementing a business analytics project is subject to the
economics of technology adoption.
8
Why is analytics hard?
Statistics is hard.
People with diïŹ€erent skills interact.
Multiplicity of tools. (Natural for a fast evolving technology.)
Implementing a business analytics project is subject to the
economics of technology adoption.
”Hard” pays oïŹ€ (Strata survey 2013).
Hackers, scripters and users
Application UserScripterHacker
Analytics
BiologyDatam
art
FinanceFinanceHealthcare
Healthcare
Healthcare
Healthcare
Insurance
M
arketing
M
arketing
NewsAggregator
RetailRetailSocialNetworking
SocialNetworking
SocialNetworking
Visualization
Visualization
W
ebW
eb
Analytics
Analytics
Analytics
FinanceHealthcare
M
ediaRetail
Finance
Insurance
RetailRetailSportsW
eb
Security
DiscoveryLocating Data x x x x x x x x x x x x x x x x x
Field Definitions x x x x x x x x x x x x x x x x
Wrangle Data Integration x x x x x x x x x x x x x x x x x x x x x x x
Parsing Semi-Structued x x x x x x x x x x x x x x x x x x x x x x x
Advanced Aggregation and Filtering x x x x x x x x x x x x x x x x
Profile Data Quality x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
Verifying Assumptions x x x x x x x x x x x x x x x x x x x x x x x x
Model Feature Selection x x x x x x x x x x x x x x x x x x x x
Scale x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x
Advanced Analytics x x x x x x x x x x x x x x
Report Communicating Assumptions x x x x x x x x x x x x x x x x x
Static Reports x x x x x x x x x x x x x x x x x
Workflow Data Migration x x x x x x x x x x x x x x x x
Operationalizing Workflows x x x x x x x x x x x x x x x x
Database SQL x x x x x x x x x x x x x x x x x x x x x x x x x x x
Hadoop/Hive/Pig x x x x x x x
MongoDB x x
CustomDB x x x x x x x
Scripting Java x x x x x x x x x x x x x
Perl x x
Python x x x x x x x x x x x x x x x
Clojure x
Visual Basic x x x
Modeling R x x x x x x x x x x x x x x x x
Matlab x x x x x x x
SAS x x x x x
Excel x x x x x x x x x x x x x x x x x x x x x
ProcessTools
Fig. 1. Respondents, Challenges and Tools. The matrix displays interviewees (grouped by archetype and sector) and their corresponding chal-
lenges and tools. Hackers faced the most diverse set of challenges, corresponding to the diversity of their workïŹ‚ows and toolset. Application users
and scripters typically relied on the IT team to perform certain tasks and therefore did not perceive them as challenges.
Hackers reported using a variety of tools for visualization, includ-
ing statistical packages, Tableau, Excel, PowerPoint, D3, and Raphšael.
Six hackers viewed tools that produce interactive visualizations as re-
porting tools and not exploratory analytics tools. Since they could not
pivoting. I ask the IT team to pull it.
Application users typically worked on smaller data sets than the
other groups and generally did not export data from the spreadsheet 9
Data scientists using more tools are paid morenumber of tools. Median base salary is constant at $100k for those
using up to 10 tools, but increases with new tools after that.9
Given the two patterns we have just examined—the relationships be‐
10
This calls for an interdisciplinary approach
This calls for an interdisciplinary approach
Our new program puts equal weight on
1. statistics,
2. economics,
3. management,
4. and technology.
11
Program highlights
U.S. accredited M.Sc. degree (”feln˝ottk®epz®es” in progress)
1-year program, full time or part time
10 core courses and many electives
Final analytics project
Industry partners: IBM, Clementine Consulting, ...

Weitere Àhnliche Inhalte

Was ist angesagt?

Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its TrendsDr.K.Sreenivas Rao
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overviewColleen Farrelly
 
Data analytics
Data analyticsData analytics
Data analyticsBhanu Pratap
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...Edureka!
 
Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)heba_ahmad
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data scienceShilpaKrishna6
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiProfessor Lili Saghafi
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceMark West
 
Different Career Paths in Data Science
Different Career Paths in Data ScienceDifferent Career Paths in Data Science
Different Career Paths in Data ScienceRoger Huang
 
When Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensWhen Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensChase McMichael
 
Data Science Lifecycle
Data Science LifecycleData Science Lifecycle
Data Science LifecycleSwapnilDahake2
 
Exploring the Data science Process
Exploring the Data science ProcessExploring the Data science Process
Exploring the Data science ProcessVishal Patel
 
Data science
Data scienceData science
Data scienceSreejith c
 

Was ist angesagt? (20)

Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Classification of data
Classification of dataClassification of data
Classification of data
 
Data analytics
Data analyticsData analytics
Data analytics
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overview
 
Data analytics
Data analyticsData analytics
Data analytics
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Reports vs analysis
Reports vs analysisReports vs analysis
Reports vs analysis
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)
 
Data science 101
Data science 101Data science 101
Data science 101
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
Big data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data Science
 
Data science
Data scienceData science
Data science
 
Different Career Paths in Data Science
Different Career Paths in Data ScienceDifferent Career Paths in Data Science
Different Career Paths in Data Science
 
When Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensWhen Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic Happens
 
Data Science Lifecycle
Data Science LifecycleData Science Lifecycle
Data Science Lifecycle
 
Exploring the Data science Process
Exploring the Data science ProcessExploring the Data science Process
Exploring the Data science Process
 
Data science
Data scienceData science
Data science
 

Ähnlich wie Challenges in business analytics

Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Agile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
Agile Testing Days 2017 Intoducing AgileBI Sustainably - ExcercisesAgile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
Agile Testing Days 2017 Intoducing AgileBI Sustainably - ExcercisesRaphael Branger
 
Bigdataanalytics
BigdataanalyticsBigdataanalytics
BigdataanalyticsHaroon Karim
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data ScienceSanghamitra Deb
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPeter Wang
 
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera, Inc.
 
Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)Linkurious
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
 
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...Dana Gardner
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricCambridge Semantics
 
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Ali Alkan
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights Joe Lamantia
 
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-shareBigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-sharestelligence
 
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Big Data Analytics - Best of the Worst : Anti-patterns & AntidotesBig Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Big Data Analytics - Best of the Worst : Anti-patterns & AntidotesKrishna Sankar
 
Big data analytics 1
Big data analytics 1Big data analytics 1
Big data analytics 1gauravsc36
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
 
Introduction to Data Analysis Course Notes.pdf
Introduction to Data Analysis Course Notes.pdfIntroduction to Data Analysis Course Notes.pdf
Introduction to Data Analysis Course Notes.pdfGraceOkeke3
 

Ähnlich wie Challenges in business analytics (20)

Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Agile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
Agile Testing Days 2017 Intoducing AgileBI Sustainably - ExcercisesAgile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
Agile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Bigdataanalytics
BigdataanalyticsBigdataanalytics
Bigdataanalytics
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data Analysis
 
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
 
Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
Democratizing Advanced Analytics Propels Instant Analysis Results to the Ubiq...
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
Intelligently Automating Machine Learning, Artificial Intelligence, and Data ...
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
 
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-shareBigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
 
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Big Data Analytics - Best of the Worst : Anti-patterns & AntidotesBig Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
 
Big data analytics 1
Big data analytics 1Big data analytics 1
Big data analytics 1
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Introduction to Data Analysis Course Notes.pdf
Introduction to Data Analysis Course Notes.pdfIntroduction to Data Analysis Course Notes.pdf
Introduction to Data Analysis Course Notes.pdf
 

KĂŒrzlich hochgeladen

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 

KĂŒrzlich hochgeladen (20)

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 

Challenges in business analytics

  • 1. Challenges in business analytics MiklÂŽos Koren CEU @korenmiklos #ceubusinessanalytics Budapest BI Meetup
  • 3. Analytics is hard Insights from ”Enterprise Data Analysis and Visualization: An Interview Study” (Kandel et al, 2012) 1
  • 4. ”I spend more than half of my time integrating, cleansing and transforming data without doing any actual analysis.” 2
  • 5. ”In practice it tends not to be just data prep, you are learning about the data at the same time, you are learning about what as- sumptions you can make.” 3
  • 6. ”We meet every week, we ïŹnd some interesting insight, and we say that’s great. There’s no repeatable knowledge, so we end up repeating the process 1 year later.” 4
  • 7. ”It is really hard to know where the data is. We have all the data, but there is no huge schema where we can say this data is here and this variable is there. ... It’s more like folklore. Knowledge is transmitted as you join.”
  • 8. ”Diversity is pretty important. A generalist is more valuable than a specialist. A specialist isn’t ïŹ‚uid enough. We look for pretty broad skills and data passion.” 6
  • 9. ”Run a Hadoop job, then run a Hadoop job on results, then awk it... Hadoop job chained to Hadoop job chained to a Python script to actually process data.” 7
  • 10. Why is analytics hard? Statistics is hard. People with diïŹ€erent skills interact. Multiplicity of tools. (Natural for a fast evolving technology.) Implementing a business analytics project is subject to the economics of technology adoption. 8
  • 11. Why is analytics hard? Statistics is hard. People with diïŹ€erent skills interact. Multiplicity of tools. (Natural for a fast evolving technology.) Implementing a business analytics project is subject to the economics of technology adoption. ”Hard” pays oïŹ€ (Strata survey 2013).
  • 12. Hackers, scripters and users Application UserScripterHacker Analytics BiologyDatam art FinanceFinanceHealthcare Healthcare Healthcare Healthcare Insurance M arketing M arketing NewsAggregator RetailRetailSocialNetworking SocialNetworking SocialNetworking Visualization Visualization W ebW eb Analytics Analytics Analytics FinanceHealthcare M ediaRetail Finance Insurance RetailRetailSportsW eb Security DiscoveryLocating Data x x x x x x x x x x x x x x x x x Field Definitions x x x x x x x x x x x x x x x x Wrangle Data Integration x x x x x x x x x x x x x x x x x x x x x x x Parsing Semi-Structued x x x x x x x x x x x x x x x x x x x x x x x Advanced Aggregation and Filtering x x x x x x x x x x x x x x x x Profile Data Quality x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x Verifying Assumptions x x x x x x x x x x x x x x x x x x x x x x x x Model Feature Selection x x x x x x x x x x x x x x x x x x x x Scale x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x Advanced Analytics x x x x x x x x x x x x x x Report Communicating Assumptions x x x x x x x x x x x x x x x x x Static Reports x x x x x x x x x x x x x x x x x Workflow Data Migration x x x x x x x x x x x x x x x x Operationalizing Workflows x x x x x x x x x x x x x x x x Database SQL x x x x x x x x x x x x x x x x x x x x x x x x x x x Hadoop/Hive/Pig x x x x x x x MongoDB x x CustomDB x x x x x x x Scripting Java x x x x x x x x x x x x x Perl x x Python x x x x x x x x x x x x x x x Clojure x Visual Basic x x x Modeling R x x x x x x x x x x x x x x x x Matlab x x x x x x x SAS x x x x x Excel x x x x x x x x x x x x x x x x x x x x x ProcessTools Fig. 1. Respondents, Challenges and Tools. The matrix displays interviewees (grouped by archetype and sector) and their corresponding chal- lenges and tools. Hackers faced the most diverse set of challenges, corresponding to the diversity of their workïŹ‚ows and toolset. Application users and scripters typically relied on the IT team to perform certain tasks and therefore did not perceive them as challenges. Hackers reported using a variety of tools for visualization, includ- ing statistical packages, Tableau, Excel, PowerPoint, D3, and Raphšael. Six hackers viewed tools that produce interactive visualizations as re- porting tools and not exploratory analytics tools. Since they could not pivoting. I ask the IT team to pull it. Application users typically worked on smaller data sets than the other groups and generally did not export data from the spreadsheet 9
  • 13. Data scientists using more tools are paid morenumber of tools. Median base salary is constant at $100k for those using up to 10 tools, but increases with new tools after that.9 Given the two patterns we have just examined—the relationships be‐ 10
  • 14. This calls for an interdisciplinary approach
  • 15. This calls for an interdisciplinary approach Our new program puts equal weight on 1. statistics, 2. economics, 3. management, 4. and technology. 11
  • 16. Program highlights U.S. accredited M.Sc. degree (”feln˝ottkÂŽepzÂŽes” in progress) 1-year program, full time or part time 10 core courses and many electives Final analytics project Industry partners: IBM, Clementine Consulting, ...