SlideShare ist ein Scribd-Unternehmen logo
1 von 23
INTRODUCTION TO
GOOGLE CLOUD PLATFORM
FOR BIG DATA
”
FIRST THINGS FIRST...
● Who Am I?
● What I'm Going to Talk About?
2
3
● Brazilian Data Analyst
● Databases Management Student
● Google fan
● Mom of 1 / Pet Mom of 8
● Plant Based Geek
● Crazy about Nature
4
WHAT I'M GOING TO TALK ABOUT?
■ Big Data Beyond the Hype
[ What Is | The 5 Vs ]
■ What is the Google Cloud Platform?
[ What Is | The Ecosystem ]
■ GCP Products for Big Data
[ Example of Big Data Lifecycle | Ingesting | Storing | Processing | Analysing ]
■ GCP Big Data Solutions to IMWT's Portfolio
[ Challenges | Example | Steps to Success ]
5
Big Data
Beyond the Hype
6
High-volume, high-velocity
and high-variety information
assets that demand cost-
effective, innovative forms of
information processing for
enhanced insight and
decision making.
WHAT IS BIG DATA?
Source: Gartner IT Glossary
7
BIG
DATA
Source: Adapted from Michael Walker (2012)
THE 5 Vs
Terabytes to Exabytes
of existing data
to process
Milliseconds to Seconds
to process
VOLUME
Data at Rest
VALUE
Data Into Money
VERACITY
Data In Doubt
VARIETY
Data In Many Forms
VELOCITY
Data In Motion
Structured, unstructured,
text, multimedia...Uncertainty due to data
inconsistency, incompleteness,
Ambiguities, model approximations...
Business models can be
associated to the data
8
What Is
Google Cloud Platform?
9
A suite of cloud
computing services that
runs on the same
infrastructure that
Google uses internally
for its end-user products.
WHAT IS GOOGLE CLOUD PLATFORM?
Source: GCP Website (2018)
10
GCP
ECOSYSTEM
Source: Google Cloud Platform (2018)
11
GCP
ECOSYSTEM
12
GCP Products to
Big Data
13
EXAMPLE OF BIG DATA LIFECYCLE
Source: GCP Website(2018)
14
INGESTION
Source: GCP Website(2018)
Serverless, fully managed, scalable and pay-
for-use platform for apps and beckends.
Save money while focus on code
rather than infrastructure
Integrated, open and global real-time event
stream ingestion, delivery and analysis
platform.
Fast reporting, targeting and
optimization in advertising and media
15
PROCESSING
Source: GCP Website(2018)
Simple, automated
and reliable stream
and batch data
processing platform.
Fast, easy-to-use and
fully managed cloud
service for running
Apache Spark and
Hadoop cluster.
Minimize latency and
maximize utilization.
Low costs. Focus on the
data, not on the cluster.
16
STORAGE
Source: GCP Website(2018)
In memory, relational,
non-relational, object
and warehouse cloud
storage solutions.
Secure, cost-effective and easily
access storage for every need.
17
EXPLORATION
Source: GCP Website(2018)
Easy-to-use and interactive
tool for data exploration,
analysis, visualization and
machine learning.
Fast, scalable, cost-effective
and fully managed cloud
data warehouse for
analytics.
Set of integrated data-and-
marketing analysis products.
Free. May incur compute, storage
and other cloud services.
Serverless and built-in Machine
Learning.
18
ANALYTICS
Source: GCP Website(2018)
Fast, large scale and easy-to-
use
AI products and services.
Easy-to-use deep learning
models to speech-to-text /
image-to-JSON conversion
and dynamic translation.
Pre trained models.
No advanced ML
skill required.
Better training performance
compared to other
deep learning systems.
19
GCP Big Data Solutions to
IMWT's Portfolio
20
Source: Adapted from Nasser T, Tariq RS (2015) Big Data Challenges. J Comput Eng Inf Technol 4:3
CHALLENGES
STORAGE
21
EXAMPLE
INGESTION PROCESSING EXPLORATION ANALYSIS
Web Crawler Solution
Simplified Architecture
APP ENGINE DATAFLOW
DATAPROC
SQL DATAPREP
DATALAB MACHINE LEARNING
DATA STUDIO
22
Source: Adapted from IBM (2014)
STEPS TO SUCCESS
Identify high-value opportunities
Establish the right architecture and funding model
Prove value to business through pilot programs
Scale by expanding to additional use cases
Transform to a data-driven culture
”
Thank You!

Weitere ähnliche Inhalte

Was ist angesagt?

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Po-Chuan Chen
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Mihai Criveti
 

Was ist angesagt? (20)

Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
 
Customizing LLMs
Customizing LLMsCustomizing LLMs
Customizing LLMs
 
Data Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platformsData Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platforms
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
 
An Introduction to Generative AI
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AI
 
FLiP Into Trino
FLiP Into TrinoFLiP Into Trino
FLiP Into Trino
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks Streaming
 
Vector database
Vector databaseVector database
Vector database
 
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data Science
 
ML Infrastracture @ Dropbox
ML Infrastracture @ Dropbox ML Infrastracture @ Dropbox
ML Infrastracture @ Dropbox
 
Parquet performance tuning: the missing guide
Parquet performance tuning: the missing guideParquet performance tuning: the missing guide
Parquet performance tuning: the missing guide
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Observability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineageObservability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineage
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache Spark
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
 
Amazon Simpledb
Amazon Simpledb Amazon Simpledb
Amazon Simpledb
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
Zipline: Airbnb’s Machine Learning Data Management Platform with Nikhil Simha...
Zipline: Airbnb’s Machine Learning Data Management Platform with Nikhil Simha...Zipline: Airbnb’s Machine Learning Data Management Platform with Nikhil Simha...
Zipline: Airbnb’s Machine Learning Data Management Platform with Nikhil Simha...
 
Slim Baltagi – Flink vs. Spark
Slim Baltagi – Flink vs. SparkSlim Baltagi – Flink vs. Spark
Slim Baltagi – Flink vs. Spark
 

Ähnlich wie Introduction to Google Cloud Platform for Big Data - Trusted Conf

Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache FlinkSuneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Flink Forward
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
ConnectaDigital
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
Koltiva
 

Ähnlich wie Introduction to Google Cloud Platform for Big Data - Trusted Conf (20)

Eric Andersen Keynote
Eric Andersen KeynoteEric Andersen Keynote
Eric Andersen Keynote
 
Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017
 
How to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st centuryHow to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st century
 
Lambda architecture for real time big data
Lambda architecture for real time big dataLambda architecture for real time big data
Lambda architecture for real time big data
 
Big data
Big dataBig data
Big data
 
Keynote at the MTSR conference
Keynote at the MTSR conferenceKeynote at the MTSR conference
Keynote at the MTSR conference
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
 
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache FlinkSuneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
 
Case Study - Gordon Foods Delivers Fresh Data to the Cloud
Case Study - Gordon Foods Delivers Fresh Data to the CloudCase Study - Gordon Foods Delivers Fresh Data to the Cloud
Case Study - Gordon Foods Delivers Fresh Data to the Cloud
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
BigDataFinal.pptx
BigDataFinal.pptxBigDataFinal.pptx
BigDataFinal.pptx
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big data
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Google Cloud Data Platform - Why Google for Data Analysis?
Google Cloud Data Platform - Why Google for Data Analysis?Google Cloud Data Platform - Why Google for Data Analysis?
Google Cloud Data Platform - Why Google for Data Analysis?
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...Advanced data science algorithms applied to scalable stream processing by Dav...
Advanced data science algorithms applied to scalable stream processing by Dav...
 

Mehr von In Marketing We Trust

Mehr von In Marketing We Trust (20)

Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...Surviving the Analytics Apocalypse_  The Death of Universal Analytics and the...
Surviving the Analytics Apocalypse_ The Death of Universal Analytics and the...
 
Data Driven Internal Linking With Botify
Data Driven Internal Linking With BotifyData Driven Internal Linking With Botify
Data Driven Internal Linking With Botify
 
AI-Powered SEO with Botify: Automation in Prevention, Execution, and Implemen...
AI-Powered SEO with Botify: Automation in Prevention, Execution, and Implemen...AI-Powered SEO with Botify: Automation in Prevention, Execution, and Implemen...
AI-Powered SEO with Botify: Automation in Prevention, Execution, and Implemen...
 
IMPACT: Improving My Performance And Corroborating Them
IMPACT: Improving My Performance And Corroborating ThemIMPACT: Improving My Performance And Corroborating Them
IMPACT: Improving My Performance And Corroborating Them
 
COVID-19 Consumer Trends and Post-Pandemic SEO
COVID-19 Consumer Trends and Post-Pandemic SEOCOVID-19 Consumer Trends and Post-Pandemic SEO
COVID-19 Consumer Trends and Post-Pandemic SEO
 
How has the Talent Market Changed After the Pandemic?
How has the Talent Market Changed After the Pandemic?How has the Talent Market Changed After the Pandemic?
How has the Talent Market Changed After the Pandemic?
 
How to Effectively Communicate with Clients and Teammates
How to Effectively Communicate with Clients and TeammatesHow to Effectively Communicate with Clients and Teammates
How to Effectively Communicate with Clients and Teammates
 
The Explosion of Online Shopping During the Pandemic
The Explosion of Online Shopping During the PandemicThe Explosion of Online Shopping During the Pandemic
The Explosion of Online Shopping During the Pandemic
 
Work with Google, Play with Google! Google Search Operators
Work with Google, Play with Google! Google Search OperatorsWork with Google, Play with Google! Google Search Operators
Work with Google, Play with Google! Google Search Operators
 
Manipulated or Influenced? The Power of Persuasion
Manipulated or Influenced? The Power of PersuasionManipulated or Influenced? The Power of Persuasion
Manipulated or Influenced? The Power of Persuasion
 
Influencer Marketing: Why it Works Despite the Pandemic
Influencer Marketing: Why it Works Despite the PandemicInfluencer Marketing: Why it Works Despite the Pandemic
Influencer Marketing: Why it Works Despite the Pandemic
 
First-Party World Problems: Future-Proof Your Business with First-Party Data
First-Party World Problems: Future-Proof Your Business with First-Party DataFirst-Party World Problems: Future-Proof Your Business with First-Party Data
First-Party World Problems: Future-Proof Your Business with First-Party Data
 
Getting Started with Google Analytics 4
Getting Started with Google Analytics 4Getting Started with Google Analytics 4
Getting Started with Google Analytics 4
 
Building an Integrated Digital Powerhouse
Building an Integrated Digital PowerhouseBuilding an Integrated Digital Powerhouse
Building an Integrated Digital Powerhouse
 
What Does Google See When It Crawls My Site?
What Does Google See When It Crawls My Site?What Does Google See When It Crawls My Site?
What Does Google See When It Crawls My Site?
 
Unleash the Power of Google Without Keywords
Unleash the Power of Google Without KeywordsUnleash the Power of Google Without Keywords
Unleash the Power of Google Without Keywords
 
The Great Divide: Insight to Action
The Great Divide: Insight to ActionThe Great Divide: Insight to Action
The Great Divide: Insight to Action
 
The Importance of a Data-Driven Dynamic Creative Strategy
The Importance of a Data-Driven Dynamic Creative StrategyThe Importance of a Data-Driven Dynamic Creative Strategy
The Importance of a Data-Driven Dynamic Creative Strategy
 
Data-Driven Internal Linking Optimisation
Data-Driven Internal Linking OptimisationData-Driven Internal Linking Optimisation
Data-Driven Internal Linking Optimisation
 
Building a Marketing Data Warehouse in Google BigQuery with Supermetrics
Building a Marketing Data Warehouse in Google BigQuery with SupermetricsBuilding a Marketing Data Warehouse in Google BigQuery with Supermetrics
Building a Marketing Data Warehouse in Google BigQuery with Supermetrics
 

Kürzlich hochgeladen

Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 

Kürzlich hochgeladen (20)

Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 

Introduction to Google Cloud Platform for Big Data - Trusted Conf

  • 1. INTRODUCTION TO GOOGLE CLOUD PLATFORM FOR BIG DATA
  • 2. ” FIRST THINGS FIRST... ● Who Am I? ● What I'm Going to Talk About? 2
  • 3. 3 ● Brazilian Data Analyst ● Databases Management Student ● Google fan ● Mom of 1 / Pet Mom of 8 ● Plant Based Geek ● Crazy about Nature
  • 4. 4 WHAT I'M GOING TO TALK ABOUT? ■ Big Data Beyond the Hype [ What Is | The 5 Vs ] ■ What is the Google Cloud Platform? [ What Is | The Ecosystem ] ■ GCP Products for Big Data [ Example of Big Data Lifecycle | Ingesting | Storing | Processing | Analysing ] ■ GCP Big Data Solutions to IMWT's Portfolio [ Challenges | Example | Steps to Success ]
  • 6. 6 High-volume, high-velocity and high-variety information assets that demand cost- effective, innovative forms of information processing for enhanced insight and decision making. WHAT IS BIG DATA? Source: Gartner IT Glossary
  • 7. 7 BIG DATA Source: Adapted from Michael Walker (2012) THE 5 Vs Terabytes to Exabytes of existing data to process Milliseconds to Seconds to process VOLUME Data at Rest VALUE Data Into Money VERACITY Data In Doubt VARIETY Data In Many Forms VELOCITY Data In Motion Structured, unstructured, text, multimedia...Uncertainty due to data inconsistency, incompleteness, Ambiguities, model approximations... Business models can be associated to the data
  • 9. 9 A suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products. WHAT IS GOOGLE CLOUD PLATFORM? Source: GCP Website (2018)
  • 13. 13 EXAMPLE OF BIG DATA LIFECYCLE Source: GCP Website(2018)
  • 14. 14 INGESTION Source: GCP Website(2018) Serverless, fully managed, scalable and pay- for-use platform for apps and beckends. Save money while focus on code rather than infrastructure Integrated, open and global real-time event stream ingestion, delivery and analysis platform. Fast reporting, targeting and optimization in advertising and media
  • 15. 15 PROCESSING Source: GCP Website(2018) Simple, automated and reliable stream and batch data processing platform. Fast, easy-to-use and fully managed cloud service for running Apache Spark and Hadoop cluster. Minimize latency and maximize utilization. Low costs. Focus on the data, not on the cluster.
  • 16. 16 STORAGE Source: GCP Website(2018) In memory, relational, non-relational, object and warehouse cloud storage solutions. Secure, cost-effective and easily access storage for every need.
  • 17. 17 EXPLORATION Source: GCP Website(2018) Easy-to-use and interactive tool for data exploration, analysis, visualization and machine learning. Fast, scalable, cost-effective and fully managed cloud data warehouse for analytics. Set of integrated data-and- marketing analysis products. Free. May incur compute, storage and other cloud services. Serverless and built-in Machine Learning.
  • 18. 18 ANALYTICS Source: GCP Website(2018) Fast, large scale and easy-to- use AI products and services. Easy-to-use deep learning models to speech-to-text / image-to-JSON conversion and dynamic translation. Pre trained models. No advanced ML skill required. Better training performance compared to other deep learning systems.
  • 19. 19 GCP Big Data Solutions to IMWT's Portfolio
  • 20. 20 Source: Adapted from Nasser T, Tariq RS (2015) Big Data Challenges. J Comput Eng Inf Technol 4:3 CHALLENGES
  • 21. STORAGE 21 EXAMPLE INGESTION PROCESSING EXPLORATION ANALYSIS Web Crawler Solution Simplified Architecture APP ENGINE DATAFLOW DATAPROC SQL DATAPREP DATALAB MACHINE LEARNING DATA STUDIO
  • 22. 22 Source: Adapted from IBM (2014) STEPS TO SUCCESS Identify high-value opportunities Establish the right architecture and funding model Prove value to business through pilot programs Scale by expanding to additional use cases Transform to a data-driven culture