SlideShare ist ein Scribd-Unternehmen logo
1 von 12
SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD
DEPARTMENT OF COMPUTER ENGINEERING
ACADEMIC YEAR 2020-21
SUBJECT: DATAANALYTICS
CASE STUDY ON GINA
By
Name: Divya Prafull Wani
Roll No:41
Class: BE Computer
Date:16-08-2020
Case StudyOn GINA
1
GINA
■ Global Innovation Network and Analysis.
■ The GINA case study provides an example of how a team applied the Data Analytics
Lifecycle to analyze innovation data at EMC.
■ GINA is a group of senior technologists located in centers of excellence around the world.
■ The GINA team thought its approach would provide a means to share ideas globally and
increase knowledge sharing among members who may be separated geographically.
■ It planned to create a data repository containing both structured and unstructured data to
accomplish 3 main goals .
1. Store Formal and informal data.
2. Track research from global technologists.
3. Mine the data for patterns and insights to improve the teams operations and strategy.
Case StudyOn GINA 2
Phase 1 : Discovery
■ Team Members and Roles
 Business user, project sponsor, project manager -Vice President from Office of CTO.
 BI analyst – person from IT
 Data Engineer and DBA – people from IT
 Data Scientist – distinguished engineer.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ Hypothesis grouped into 2 categories
 Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation
 Predictive analytics to advise executive management of where it should be investing in the future.
Case StudyOn GINA 3
Phase 2 : Data Preparation
■ Set up anAnalytical Sandbox to store and experiment on the data.
■ Discovered that certain data needed conditioning and normalization and that missing
datasets were critical.
■ Team recognized that poor quality data could impact subsequent steps.
■ They discovered many names were misspelled and problems with extra spaces.
■ Important to determine what level of data quality and cleanliness was sufficient for
the project being undertaken.
Case StudyOn GINA 4
Phase 3: Model Planning
■ Included following considerations :
 Identify the right milestones to achieve the goals
 Trace how people move ideas from each ,milestone towards the goal.
 Once this is done, trace ideas that die and others that reach the goal.Compare the
journeys of ideas that make it and those that do not.
 Compare times and outcomes using a few different methods.These could be as simple
as t-tests or perhaps involve different types of classificationAlgorithms.
Case StudyOn GINA 5
Phase 4 : Model Building
■ The GINA team employed several analytical methods.This included work by the data
scientist using Natural Language Processing (NLP) techniques on the textual
descriptions of the innovation Roadmap ideas.
■ Social Network Analysis using R and Rstudio.
■ Developed SocialGraphs andVisualizations.
Case StudyOn GINA 6
Social Graph Data
Submitters and Finalists
and Graph of top
innovation influencers
• Fig shows socai graphs that portray
relationships between idea submitters within
GINA.
• Each colour represents an innovator from a
different country.
• The large dots with red circles around them
represent hubs.
• A hub represents a person with high
connectivity and a high “betweeness” score.
• The team usedTableau software for data
visualization and exploration and used the
Pivotal Greenplum database as the main data
repository and analytics engine.
Case StudyOn GINA 7
Phase 5 : Communicate Results
■ This project was considered successful in identifying boundary spanners and hidden
innovators.
■ The GINA project promoted knowledge sharing related to innovation an researchers
spanning multiple areas within the company and outside of it.
■ The GINA also enables EMC to cultivate additional property leads to research topics
and provided opportunities to forge relationships with universities for joint academic
research in the fields of Data Science and Big Data.
■ Study was successful in identifying hidden innovators.(found high density in Cork,
Ireland)
■ The CTO office launched longitudinal studies.
Case StudyOn GINA 8
Phase 6: Operationalize
■ Deployment was not really discussed
■ Key Findings
 Need more data in future
 Some data were sensitive
 A parallel initiative needs to be created to improve basic BI activities.
 A mechanism is needed to continually reevaluate the model after deployment.
Case StudyOn GINA 9
Analytic Plan from the EMC GINA
Project
Case StudyOn GINA 10
Reference
■ Data-science-and-big-data-analy-nieizv_book
■ https://bhavanakhivsara.wordpress.com/subjects/data-analytics/
Case StudyOn GINA 11
ThankYou!!
Case StudyOn GINA 12

Weitere ähnliche Inhalte

Was ist angesagt?

Map reduce in BIG DATA
Map reduce in BIG DATAMap reduce in BIG DATA
Map reduce in BIG DATAGauravBiswas9
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationDr. Abdul Ahad Abro
 
[AIIM17] Knowledge Management and the Internet of Things - Katrina Pugh
[AIIM17]  Knowledge Management and the Internet of Things - Katrina Pugh[AIIM17]  Knowledge Management and the Internet of Things - Katrina Pugh
[AIIM17] Knowledge Management and the Internet of Things - Katrina PughAIIM International
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceSrishti44
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataHari Priya
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering methodrajshreemuthiah
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptxSadhanaParameswaran
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data modelmoni sindhu
 
SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?Venu Anuganti
 
Big Data & Data Science
Big Data & Data ScienceBig Data & Data Science
Big Data & Data ScienceBrijeshGoyani
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in TelecommunicationsMohsin Nadaf
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data AnalyticsUtkarsh Sharma
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Ramandeep Kaur Bagri
 
Biology protein structure in cloud computing
Biology protein structure in cloud computingBiology protein structure in cloud computing
Biology protein structure in cloud computinggaurav jain
 

Was ist angesagt? (20)

Map reduce in BIG DATA
Map reduce in BIG DATAMap reduce in BIG DATA
Map reduce in BIG DATA
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
What is big data?
What is big data?What is big data?
What is big data?
 
[AIIM17] Knowledge Management and the Internet of Things - Katrina Pugh
[AIIM17]  Knowledge Management and the Internet of Things - Katrina Pugh[AIIM17]  Knowledge Management and the Internet of Things - Katrina Pugh
[AIIM17] Knowledge Management and the Internet of Things - Katrina Pugh
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering method
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data mining
Data miningData mining
Data mining
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?
 
Big Data & Data Science
Big Data & Data ScienceBig Data & Data Science
Big Data & Data Science
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in Telecommunications
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case
 
Biology protein structure in cloud computing
Biology protein structure in cloud computingBiology protein structure in cloud computing
Biology protein structure in cloud computing
 

Ähnlich wie Analyzing Innovation Data at EMC's GINA Team

IT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxIT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxJack Zheng
 
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaNew Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaInstitute of Contemporary Sciences
 
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesCustomer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesLuciano Pesci, PhD
 
A Survey on Big Data Analytics
A Survey on Big Data AnalyticsA Survey on Big Data Analytics
A Survey on Big Data AnalyticsBHARATH KUMAR
 
Fealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyFealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyinnovationoecd
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdfAyele40
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...Joshua Gorinson
 
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Sahilakhurana
 
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Sarah Jones
 
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdfMuhammadZafarHasan
 
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...apidays
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceUmmeSalmaM1
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzaniaijsrd.com
 

Ähnlich wie Analyzing Innovation Data at EMC's GINA Team (20)

IT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxIT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptx
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaNew Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesCustomer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
 
A Survey on Big Data Analytics
A Survey on Big Data AnalyticsA Survey on Big Data Analytics
A Survey on Big Data Analytics
 
Fealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyFealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policy
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
 
Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146
 
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
 
Software Analytics
Software AnalyticsSoftware Analytics
Software Analytics
 
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
 
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
 
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
 
PlanetData Project Overview
PlanetData Project OverviewPlanetData Project Overview
PlanetData Project Overview
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
 

Mehr von divyawani2

Case Study on Cray T3E Architecture
Case Study on Cray T3E ArchitectureCase Study on Cray T3E Architecture
Case Study on Cray T3E Architecturedivyawani2
 
Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store divyawani2
 
Case study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTCase study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTdivyawani2
 
Automatic Water Dispenser using IOT
Automatic Water Dispenser using IOTAutomatic Water Dispenser using IOT
Automatic Water Dispenser using IOTdivyawani2
 
Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)divyawani2
 
Flutter technology Based on Web Development
Flutter technology Based on Web Development Flutter technology Based on Web Development
Flutter technology Based on Web Development divyawani2
 

Mehr von divyawani2 (6)

Case Study on Cray T3E Architecture
Case Study on Cray T3E ArchitectureCase Study on Cray T3E Architecture
Case Study on Cray T3E Architecture
 
Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store
 
Case study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTCase study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOT
 
Automatic Water Dispenser using IOT
Automatic Water Dispenser using IOTAutomatic Water Dispenser using IOT
Automatic Water Dispenser using IOT
 
Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)
 
Flutter technology Based on Web Development
Flutter technology Based on Web Development Flutter technology Based on Web Development
Flutter technology Based on Web Development
 

Kürzlich hochgeladen

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Kürzlich hochgeladen (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

Analyzing Innovation Data at EMC's GINA Team

  • 1. SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD DEPARTMENT OF COMPUTER ENGINEERING ACADEMIC YEAR 2020-21 SUBJECT: DATAANALYTICS CASE STUDY ON GINA By Name: Divya Prafull Wani Roll No:41 Class: BE Computer Date:16-08-2020 Case StudyOn GINA 1
  • 2. GINA ■ Global Innovation Network and Analysis. ■ The GINA case study provides an example of how a team applied the Data Analytics Lifecycle to analyze innovation data at EMC. ■ GINA is a group of senior technologists located in centers of excellence around the world. ■ The GINA team thought its approach would provide a means to share ideas globally and increase knowledge sharing among members who may be separated geographically. ■ It planned to create a data repository containing both structured and unstructured data to accomplish 3 main goals . 1. Store Formal and informal data. 2. Track research from global technologists. 3. Mine the data for patterns and insights to improve the teams operations and strategy. Case StudyOn GINA 2
  • 3. Phase 1 : Discovery ■ Team Members and Roles  Business user, project sponsor, project manager -Vice President from Office of CTO.  BI analyst – person from IT  Data Engineer and DBA – people from IT  Data Scientist – distinguished engineer. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ Hypothesis grouped into 2 categories  Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation  Predictive analytics to advise executive management of where it should be investing in the future. Case StudyOn GINA 3
  • 4. Phase 2 : Data Preparation ■ Set up anAnalytical Sandbox to store and experiment on the data. ■ Discovered that certain data needed conditioning and normalization and that missing datasets were critical. ■ Team recognized that poor quality data could impact subsequent steps. ■ They discovered many names were misspelled and problems with extra spaces. ■ Important to determine what level of data quality and cleanliness was sufficient for the project being undertaken. Case StudyOn GINA 4
  • 5. Phase 3: Model Planning ■ Included following considerations :  Identify the right milestones to achieve the goals  Trace how people move ideas from each ,milestone towards the goal.  Once this is done, trace ideas that die and others that reach the goal.Compare the journeys of ideas that make it and those that do not.  Compare times and outcomes using a few different methods.These could be as simple as t-tests or perhaps involve different types of classificationAlgorithms. Case StudyOn GINA 5
  • 6. Phase 4 : Model Building ■ The GINA team employed several analytical methods.This included work by the data scientist using Natural Language Processing (NLP) techniques on the textual descriptions of the innovation Roadmap ideas. ■ Social Network Analysis using R and Rstudio. ■ Developed SocialGraphs andVisualizations. Case StudyOn GINA 6
  • 7. Social Graph Data Submitters and Finalists and Graph of top innovation influencers • Fig shows socai graphs that portray relationships between idea submitters within GINA. • Each colour represents an innovator from a different country. • The large dots with red circles around them represent hubs. • A hub represents a person with high connectivity and a high “betweeness” score. • The team usedTableau software for data visualization and exploration and used the Pivotal Greenplum database as the main data repository and analytics engine. Case StudyOn GINA 7
  • 8. Phase 5 : Communicate Results ■ This project was considered successful in identifying boundary spanners and hidden innovators. ■ The GINA project promoted knowledge sharing related to innovation an researchers spanning multiple areas within the company and outside of it. ■ The GINA also enables EMC to cultivate additional property leads to research topics and provided opportunities to forge relationships with universities for joint academic research in the fields of Data Science and Big Data. ■ Study was successful in identifying hidden innovators.(found high density in Cork, Ireland) ■ The CTO office launched longitudinal studies. Case StudyOn GINA 8
  • 9. Phase 6: Operationalize ■ Deployment was not really discussed ■ Key Findings  Need more data in future  Some data were sensitive  A parallel initiative needs to be created to improve basic BI activities.  A mechanism is needed to continually reevaluate the model after deployment. Case StudyOn GINA 9
  • 10. Analytic Plan from the EMC GINA Project Case StudyOn GINA 10