SlideShare a Scribd company logo
1 of 15
Download to read offline
Notes
Workshop
AGENDA
BIG DATA !
BIG DATA: HYPE OR REALITY?
DEEP DIVE INTO THE INFRASTRUCTURE
BIG DATA SCENARIO A 2 Z
DATA ANALYTICS
DATA VISUALIZATION
EMOTION INTELIGENCE
WORD EMBEDDING IN NLP
DEEP LEARNING IN AUTONOMOUS CAR
PREDICTION MODELS IN OIL AND GAS
MICROSOFT AZURE
5G: IMT 2020
This Document Includes lecture/workshop notes regarding BIG
DATA SCIENCE workshop at NTI 6-7th of Dec 2017
https://www.linkedin.com/in/mrastro
BIGDATA !
Not About How BIG it is!
https://www.linkedin.com/in/mrastro
“Big data is a term that describes the large volume of data – both structured
and unstructured – that inundates a business on a day-to-day basis. But it’s not
the amount of data that’s important. It’s what organizations do with the data
that matters. Big data can be analyzed for insights that lead to better decisions
and strategic business moves.” 2
Definition
“Big data is about looking ahead, beyond
what everybody else sees.” 1
Peter Sondergaard, senior vice president and global head of research at Gartner
Although there’s no fixed number marking the beginning of “big”, we’re talking much bigger
than conventional tools like spreadsheets and relational databases can handle easily. Many
case studies of big data involve datasets of many petabytes—or even exabytes—
made possible only by using high-performance cloud-based computing.
Many big-data applications, such as cancer research, use historical data, but much attention
is being paid to how to leverage real-time data—not just collected in real time, but processed
and accessed in real time too. In many scenarios, users must be able to ask questions
iteratively and get answers in minutes, not days.
Big data covers not just “structured” data neatly normalized into a fixed schema
and exported from ERP or CRM systems. It also includes semi-structured data,
(which, although it has no fixed configuration, is categorized using tags or other
metadata) and unstructured data, such as email messages and videos.
MOST DEFINITIONS OF BIG DATA AGREE THAT IT INVOLVES THE “THREE VS” 4
Any technology is only useful if it solves a problem (or problems).
As we all know, there is data, lots of it: historical data, sure, but also new
data generated from social media apps, click stream data from web
applications, IoT sensor data, and on and on. The amount of data is larger
than ever, coming in at ever-increasing rates, and in many different formats.
3
The
Problem
Gartner published earlier this year 2017 5 on emerging technologies.
They mention Many of the emerging technologies, including virtual personal
assistants, machine learning, the IoT, and M2M, use data to track performance and
generate big data to define success.
A closer look to the peak, we can see IoT, machine/deep learning with about 2-5 years to
diverse (expected between 2020-22) which creates a world of connectivity
And HINT
The Connected World Amplifies Big Data AND ITS EXISTENCE EVERYWHERE
A DEEP DIVE INTO INFRASTRUCTURE
Traditional Data Management Systems [6]
SHARED I/O
SHARED PROCESSING
LIMITED SCALABILITY
SERVICE BOTTLENECKS
HIGH COST FACTOR
Abstraction of BIG DATA Platform [6]
PARALLEL PROCESSING
LINEAR SCALABILITY
DISTRIBUTED SERVICE
LOW COST FACTOR
Notes: The Main Key Advantages of Distributed Systems are being Software Defined
where cluster is optimized for software execution (e.g Hadoop). Files/DataSet can be
split in to segments and can be distributed across different nodes (Worker Nodes )
within the network to be processed in parallel which in turn gives more performance.
Reliability and Capability for to be upgradable where more resources can be added
easily, this also reduces the cost factor.
SHARED NOTHING
Notes: For any Big data File, Slice the File into blocks then those blocks will be spreaded into
the available worker nodes. Hint: n nodes (They are not necessary to be physical nodes but
we can deploy n-physical node with m-vm (virtual nodes/machines) to act finally as a single
Cluster. Hint: each node takes one or more block (depending on the size)
SCENARIO
Selecting a Modeling Technique [6]
DEVELOP YOUR USE CASE [6]
“Formulate a Data-Driven Use Case
Hi-level description and objectives of the use case
Challenges addressed by the use case
Pain points and impact of each challenge
Goals, success criteria, constraints and assumptions
Available data, data sources and required resources
Modeling approach for each challenge
Overall model structure & workflow
Application of the use case into operational solution”
STRUCTURED DATA [6]
“Commonly refers to Database Tables with well defined columns structure including
data types and specifications It might also include other non-database managed
formats like OLAP Cubes, csv files and fixed column files as long as they are
consistently generated. i.e. exported from database, generated by ATM
machine…etc”
UNSTRUCTURED DATA [6]
“Data NOT following well defined structure either because of the nature of data
generation or the nature of the data format. Most of the data generated around the
globe is unstructured data with different degree:
Semi-structured: XML log files, HTML content
Quasi-structured: query strings in websites URLs, log events/alerts
Unstructured: text, pdf, word, social feeds, web content, images, video”
Img src: http://bigdata.black/infrastructure/storage/unstructured-data
“Unfortunately, it’s often very difficult to analyze unstructured data. To help with the
problem, organizations have turned to a number of different software solutions designed
to search unstructured data and extract important information. The primary benefit of
these tools is the ability to glean actionable information that can help a business succeed
in a competitive environment. Because the volume of unstructured data is growing so
rapidly, many enterprises also turn to technological solutions to help them better manage
and store their unstructured data. These can include hardware or software solutions that
enable them to make the most efficient use of their available storage space. “ [7]
DESCRIPTIVE ANALYTICS -PRACTICAL TOUR
AUTOMOTIVE INDUSTRY
DEEP LEARNING IN AUTONOMOUS CAR
Self-Driving Cars to the 2020 Tokyo Olympics
Telecom: Case Study
Leveraging Data to better satisfy Understand Customers
needs ,Churn prevention
Monitor and Visualize all kind of site and services Alarms,
solve KPIs problems, and predict insights almost in realtime
Predictive Maintenance
Hisham Arafat
Digital Transformation Lead Consultant Solutions
Architect, Technology Strategist & Researcher
Linkedin
Thanks to
References
1:Gartner Says Big Data Creates Big Jobs
2:SaaS-Big Data! What it is and why it matters
3:IBM-What is big data? More than volume, velocity and variety
4:Verizon-BIG DATA: HYPE OR REALITY?
5:Top Trends in the Gartner Hype Cycle for Emerging Technologies, 2017
6: Digital Transformation Industry Perspective, Eng.Hisham
7:Unstructured Data: BIGDATA
https://www.linkedin.com/in/mrastro

More Related Content

What's hot

Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonAlan Morrison
 
Business_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanBusiness_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanLuke Caratan
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked DataMatt Turner
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceDavid Feinleib
 
Towards a big data roadmap for europe
Towards a big data roadmap for europeTowards a big data roadmap for europe
Towards a big data roadmap for europeBIG Project
 
Data Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesData Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesCSCJournals
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
BIG DATA(PPT)
BIG DATA(PPT)BIG DATA(PPT)
BIG DATA(PPT)josnapv
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONijmnct
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsShilpaKrishna6
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public CloudIMC Institute
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsRamakant Gawande
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesBooz Allen Hamilton
 
Enabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityEnabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityBooz Allen Hamilton
 

What's hot (20)

Big Data
Big DataBig Data
Big Data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrison
 
Business_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanBusiness_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_Caratan
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked Data
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
 
Towards a big data roadmap for europe
Towards a big data roadmap for europeTowards a big data roadmap for europe
Towards a big data roadmap for europe
 
Data Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesData Mining And Visualization of Large Databases
Data Mining And Visualization of Large Databases
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Visual Data Mining
Visual Data MiningVisual Data Mining
Visual Data Mining
 
BIG DATA(PPT)
BIG DATA(PPT)BIG DATA(PPT)
BIG DATA(PPT)
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
 
Fraud and Risk in Big Data
Fraud and Risk in Big DataFraud and Risk in Big Data
Fraud and Risk in Big Data
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data Applications
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
 
"Big Data Dreams"
"Big Data Dreams""Big Data Dreams"
"Big Data Dreams"
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
 
Enabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityEnabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level Security
 

Similar to Big Data Science Workshop Documentation V1.0

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET Journal
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A ReviewIRJET Journal
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A StudyIRJET Journal
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfKarishma Chaudhary
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET Journal
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analyticsAhmed Banafa
 
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...YogeshIJTSRD
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAgeFriendlyEconomy
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data securitybalasahebcomp
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 

Similar to Big Data Science Workshop Documentation V1.0 (20)

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdf
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big Data
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data security
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 

Recently uploaded

kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...Health
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Bridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptxBridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptxnuruddin69
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stageAbc194748
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...soginsider
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projectssmsksolar
 

Recently uploaded (20)

kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Bridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptxBridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptx
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 

Big Data Science Workshop Documentation V1.0

  • 2. AGENDA BIG DATA ! BIG DATA: HYPE OR REALITY? DEEP DIVE INTO THE INFRASTRUCTURE BIG DATA SCENARIO A 2 Z DATA ANALYTICS DATA VISUALIZATION EMOTION INTELIGENCE WORD EMBEDDING IN NLP DEEP LEARNING IN AUTONOMOUS CAR PREDICTION MODELS IN OIL AND GAS MICROSOFT AZURE 5G: IMT 2020 This Document Includes lecture/workshop notes regarding BIG DATA SCIENCE workshop at NTI 6-7th of Dec 2017 https://www.linkedin.com/in/mrastro
  • 3. BIGDATA ! Not About How BIG it is! https://www.linkedin.com/in/mrastro
  • 4. “Big data is a term that describes the large volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. But it’s not the amount of data that’s important. It’s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.” 2 Definition “Big data is about looking ahead, beyond what everybody else sees.” 1 Peter Sondergaard, senior vice president and global head of research at Gartner Although there’s no fixed number marking the beginning of “big”, we’re talking much bigger than conventional tools like spreadsheets and relational databases can handle easily. Many case studies of big data involve datasets of many petabytes—or even exabytes— made possible only by using high-performance cloud-based computing. Many big-data applications, such as cancer research, use historical data, but much attention is being paid to how to leverage real-time data—not just collected in real time, but processed and accessed in real time too. In many scenarios, users must be able to ask questions iteratively and get answers in minutes, not days. Big data covers not just “structured” data neatly normalized into a fixed schema and exported from ERP or CRM systems. It also includes semi-structured data, (which, although it has no fixed configuration, is categorized using tags or other metadata) and unstructured data, such as email messages and videos. MOST DEFINITIONS OF BIG DATA AGREE THAT IT INVOLVES THE “THREE VS” 4 Any technology is only useful if it solves a problem (or problems). As we all know, there is data, lots of it: historical data, sure, but also new data generated from social media apps, click stream data from web applications, IoT sensor data, and on and on. The amount of data is larger than ever, coming in at ever-increasing rates, and in many different formats. 3 The Problem
  • 5. Gartner published earlier this year 2017 5 on emerging technologies. They mention Many of the emerging technologies, including virtual personal assistants, machine learning, the IoT, and M2M, use data to track performance and generate big data to define success. A closer look to the peak, we can see IoT, machine/deep learning with about 2-5 years to diverse (expected between 2020-22) which creates a world of connectivity And HINT The Connected World Amplifies Big Data AND ITS EXISTENCE EVERYWHERE
  • 6. A DEEP DIVE INTO INFRASTRUCTURE
  • 7. Traditional Data Management Systems [6] SHARED I/O SHARED PROCESSING LIMITED SCALABILITY SERVICE BOTTLENECKS HIGH COST FACTOR Abstraction of BIG DATA Platform [6] PARALLEL PROCESSING LINEAR SCALABILITY DISTRIBUTED SERVICE LOW COST FACTOR Notes: The Main Key Advantages of Distributed Systems are being Software Defined where cluster is optimized for software execution (e.g Hadoop). Files/DataSet can be split in to segments and can be distributed across different nodes (Worker Nodes ) within the network to be processed in parallel which in turn gives more performance. Reliability and Capability for to be upgradable where more resources can be added easily, this also reduces the cost factor. SHARED NOTHING Notes: For any Big data File, Slice the File into blocks then those blocks will be spreaded into the available worker nodes. Hint: n nodes (They are not necessary to be physical nodes but we can deploy n-physical node with m-vm (virtual nodes/machines) to act finally as a single Cluster. Hint: each node takes one or more block (depending on the size) SCENARIO
  • 8. Selecting a Modeling Technique [6] DEVELOP YOUR USE CASE [6] “Formulate a Data-Driven Use Case Hi-level description and objectives of the use case Challenges addressed by the use case Pain points and impact of each challenge Goals, success criteria, constraints and assumptions Available data, data sources and required resources Modeling approach for each challenge Overall model structure & workflow Application of the use case into operational solution”
  • 9. STRUCTURED DATA [6] “Commonly refers to Database Tables with well defined columns structure including data types and specifications It might also include other non-database managed formats like OLAP Cubes, csv files and fixed column files as long as they are consistently generated. i.e. exported from database, generated by ATM machine…etc” UNSTRUCTURED DATA [6] “Data NOT following well defined structure either because of the nature of data generation or the nature of the data format. Most of the data generated around the globe is unstructured data with different degree: Semi-structured: XML log files, HTML content Quasi-structured: query strings in websites URLs, log events/alerts Unstructured: text, pdf, word, social feeds, web content, images, video” Img src: http://bigdata.black/infrastructure/storage/unstructured-data “Unfortunately, it’s often very difficult to analyze unstructured data. To help with the problem, organizations have turned to a number of different software solutions designed to search unstructured data and extract important information. The primary benefit of these tools is the ability to glean actionable information that can help a business succeed in a competitive environment. Because the volume of unstructured data is growing so rapidly, many enterprises also turn to technological solutions to help them better manage and store their unstructured data. These can include hardware or software solutions that enable them to make the most efficient use of their available storage space. “ [7]
  • 11. AUTOMOTIVE INDUSTRY DEEP LEARNING IN AUTONOMOUS CAR Self-Driving Cars to the 2020 Tokyo Olympics
  • 12. Telecom: Case Study Leveraging Data to better satisfy Understand Customers needs ,Churn prevention Monitor and Visualize all kind of site and services Alarms, solve KPIs problems, and predict insights almost in realtime Predictive Maintenance
  • 13.
  • 14. Hisham Arafat Digital Transformation Lead Consultant Solutions Architect, Technology Strategist & Researcher Linkedin Thanks to
  • 15. References 1:Gartner Says Big Data Creates Big Jobs 2:SaaS-Big Data! What it is and why it matters 3:IBM-What is big data? More than volume, velocity and variety 4:Verizon-BIG DATA: HYPE OR REALITY? 5:Top Trends in the Gartner Hype Cycle for Emerging Technologies, 2017 6: Digital Transformation Industry Perspective, Eng.Hisham 7:Unstructured Data: BIGDATA https://www.linkedin.com/in/mrastro