SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Big Data…Are you ready for
the next wave?
MAHMOUD SABRY
Agenda
 The Next Wave of Computing
 Information, Data & Knowledge
 Data warehouse vs. Database
 Row-oriented vs. Column-oriented DB
 Big Data Era
 HP-Vertica
 HP-Vertica Bulk Upload Example
The next wave of computing
Cloud + Big Data Analytics, Mobile & Social leads to what IBM calls “The Era of Smart.”
Data, Information & Knowledge (1/3)
 Data are raw facts, and have no meaning on their own
 Example
 Yes, Yes, No, Yes, No, Yes, No, Yes
Data, Information & Knowledge (2/3)
 Information is data that have meaning in a context
 Example
Yes, Yes, No, Yes, No, Yes, No,
Yes, No, Yes, YesRaw Data
Context
Responses to the market
research question – “Would
you buy brand x at price y?”
Information ???
Processing
Data, Information & Knowledge (3/3)
 Knowledge
 understanding of someone or something
 acquired through experience or learning
 Example
Based on last collected information, A Marketing Manager could use this information to
decide whether or not to raise or lower price.
Data Warehouse vs. Database (1/2)
Feature DWH DB
Data Stored It usually stores the Historical data
whose accuracy is maintained over
time.
It mainly stores the Current data
which always guaranteed to be up-
to-date.
Characteristic It is based on Informational
Processing.
It is based on Operational
Processing.
Function It is used for long-term informational
requirements and decision support.
It is used for day-to-day operations.
Focus The focus is on “Information OUT” The focus is on “Data IN”
Number of
records
accessed
A bunch of millions of records. A few tens of records.
Data Warehouse vs. Database (2/2)
Feature DWH DB
Access It mostly use the read access for the
stored data.
The most frequent type of access
type is read/write.
Orientation Based on Analysis. Based on Transaction.
Common users Analysts DBAs
KPI Query throughput Transaction throughput
Unit of work Complex queries Short and simple transactions
Row-based vs. Column-based DB (1/2)
Feature Row-oriented DB Column-oriented DB
Mechanism stores table records in sequence or
rows.
stores table records in sequence or
columns.
Retrieval of a few
columns
Slower Faster
Retrieval of a single
record
Faster Slower
Row-based vs. Column-based DB (2/2)
Feature Row-oriented DB Column-oriented DB
Insertion/Updating of single new
record
Faster Slower
Aggregation of Single Column, e.g.
sum(price)
Slower Faster
Compression Higher. As stores similar data
together
It depends!
Examples Oracle OLTP
MS SQL Server up to 2008
HP-Vertica
Sybase IQ
MS SQL Server 2012
Big Data Era – Famous quote
From the dawn of civilization until
2003, humankind generated five
exabytes of data. Now we produce
five exabytes every two days…and
the pace is accelerating.
Eric Schmidt,
Executive Chairman, Google
Big Data – Definition
 The basic idea behind the phrase 'Big Data' is that everything we do is
increasingly leaving a digital trace (or data), which we (and others) can
use and analyze.
 Big Data therefore refers to our ability to make use of the ever-increasing
volumes of data.
Big Data – Characteristics (6 V’s)
 Volume  Terabytes, Distributed, Tables & Files
 Velocity  Real-time processing/Streams
 Variety  Structured, Semi-structured & Un-structured data
 Value  Correlations & Statistical analysis
 Veracity  Accountability, Trust, Origin & Reputation
 Variability  Changing Data/model
Big Data – What is Hadoop ?
 Apache™ Hadoop® is an open source platform that enables distributed
processing of large data sets across clusters of servers.
 It is designed to scale up from a single server to thousands of machines,
with very high degree of fault tolerance.
Big Data – HL Archit. Of Hadoop
A programming model for large scale
data processing.
A distributed file-system that stores data
on Clustered machines.
Schedules map or reduce jobs to task
trackers with an awareness of the data
location.
keeps the directory tree of all files in the file
system, and tracks where across the cluster
the file data is kept.
Big Data – MapReduce, How it works ?
Map: Filter & sort of data Reduce: Summarize & aggregate of data
HP-Vertica
 An analytic database management platform. Founded in 2005.
 Helps you monetize all of your data in real-time and at massive scale.
 Queries run 50-1,000x faster.
 Store 10-30x more data per server.
 Openness and simplicity (use any BI/ETL tools, Hadoop, etc.)
 Suited for “Structured” data only.
 HP Vertica Flex Zone: built on the HP Vertica core, enables load and
analyze structured and semi-structured data, such as social media, sensor,
log files, and machine data.
HP-Vertica Features (1/6)
 Column Orientation
 Vertica organizes data for each column
 Each column is stored separately on disk
 Only reads the columns needed to answer
the query
 Significant reduction of disk I/O
HP-Vertica Features (2/6)
 Advanced Compression
 Vertica replaces slower disk I/O with faster CPU cycles
to encode data elements into a more compact form
and query them.
 Vertica’s innovative query engine operates directly
on compressed data, meaning that it can actually
require fewer CPU operations to process
the compressed version of a table.
HP-Vertica Features (3/6)
 High Availability
 RAID-like functionality within database
 If a node fails, a copy is available on one of the surviving nodes
 Always-on Queries and Loads
 System continues to load and query when nodes are down
 Automatically recovers missing data by querying other nodes
HP-Vertica Features (4/6)
 Automatic Database Designer (DBD)
 Recommends a physical DB design that provides the best performance for the user's workload
 Analyzes your logical schema, sample data, and sample queries
 Minimizes DBA tuning
 Run anytime for additional optimization, without stopping the database
HP-Vertica Features (5/6)
 Massively Parallel Processing (MPP)
 Parallel design leverages data projections to
enable distributed storage and workload
 Active redundancy
 Automatic replication, failover and recovery
HP-Vertica Features (6/6)
 Native SQL and Application Integration
 Standard SQL Interface
 Simple integration with Hadoop and existing BI and ETL tools
 Supports SQL, ODBC, JDBC and majority ETL and BI reporting products
HP-Vertica Bulk Upload Example (1/3)
 HP Vertica 7.0, Single instance on VMware workstation v.11
 Installed on Windows 8 machine HP-EliteBook 8440p, 8GB RAM
 HP Vertica virtual machine has below specs:
 Memory: 4GB
 Disk: 16 GB
 Processor: Single Processor Intel(R) Core(TM) i7 CPU M 620 @ 2.67GHz
HP-Vertica Bulk Upload Example (2/3)
 Scenario Details
 Upload a table with “|” delimited columns from a local text file
 Table file size is: 1.30099 GB
 No. of rows is: 10252864
 No. of columns Is: 21
 Table is partitioned into 10 partitions by using “customer_age“ column
 Only one projection (the default “<table_name>_Super” )
HP-Vertica Bulk Upload Example (3/3)
 Scenario Actions and Results
 Upload by using “DIRECT” option to load directly to disk (ROS: Read Optimized
Storage)
 COPY public.myTable from '/opt/vertica/examples/VMart_Schema/myTable.tbl'
DIRECT;
 Time taken is more than 4 minutes
 Upload without using “DIRECT” option to load to memory(WOS: Write Optimized
Storage)
 COPY public.myTable from '/opt/vertica/examples/VMart_Schema/myTable.tbl';
 Time taken is about 2.3 minutes
Thanks!

Weitere ähnliche Inhalte

Was ist angesagt?

Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Big Data Analytics 2014
Big Data Analytics 2014Big Data Analytics 2014
Big Data Analytics 2014Stratebi
 
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionHow One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionDataWorks Summit
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a surveyredpel dot com
 
Data ware house design
Data ware house designData ware house design
Data ware house designSayed Ahmed
 
From Traditional Data Warehouse To Real Time Data Warehouse
From Traditional Data Warehouse To Real Time Data WarehouseFrom Traditional Data Warehouse To Real Time Data Warehouse
From Traditional Data Warehouse To Real Time Data WarehouseOsama Hussein
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designCalpont
 
Data Warehouse Logical Design using Mysql
Data Warehouse Logical Design using MysqlData Warehouse Logical Design using Mysql
Data Warehouse Logical Design using MysqlHAFIZ Islam
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.George Joseph
 
NTT Data - Shinichi Yamada - Hadoop World 2010
NTT Data - Shinichi Yamada - Hadoop World 2010NTT Data - Shinichi Yamada - Hadoop World 2010
NTT Data - Shinichi Yamada - Hadoop World 2010Cloudera, Inc.
 
Making MySQL Great For Business Intelligence
Making MySQL Great For Business IntelligenceMaking MySQL Great For Business Intelligence
Making MySQL Great For Business IntelligenceCalpont
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland Bouman
 
Building High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsBuilding High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsCalpont
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAPAchmad Solichin
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsCloudera, Inc.
 

Was ist angesagt? (20)

Informatica doc
Informatica docInformatica doc
Informatica doc
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Big data
Big dataBig data
Big data
 
Big Data Analytics 2014
Big Data Analytics 2014Big Data Analytics 2014
Big Data Analytics 2014
 
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionHow One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
 
Big Data Concepts
Big Data ConceptsBig Data Concepts
Big Data Concepts
 
Data ware house design
Data ware house designData ware house design
Data ware house design
 
OLAP
OLAPOLAP
OLAP
 
From Traditional Data Warehouse To Real Time Data Warehouse
From Traditional Data Warehouse To Real Time Data WarehouseFrom Traditional Data Warehouse To Real Time Data Warehouse
From Traditional Data Warehouse To Real Time Data Warehouse
 
The thinking persons guide to data warehouse design
The thinking persons guide to data warehouse designThe thinking persons guide to data warehouse design
The thinking persons guide to data warehouse design
 
Data Warehouse Logical Design using Mysql
Data Warehouse Logical Design using MysqlData Warehouse Logical Design using Mysql
Data Warehouse Logical Design using Mysql
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
IN-MEMORY DATABASE SYSTEMS FOR BIG DATA MANAGEMENT.SAP HANA DATABASE.
 
NTT Data - Shinichi Yamada - Hadoop World 2010
NTT Data - Shinichi Yamada - Hadoop World 2010NTT Data - Shinichi Yamada - Hadoop World 2010
NTT Data - Shinichi Yamada - Hadoop World 2010
 
Making MySQL Great For Business Intelligence
Making MySQL Great For Business IntelligenceMaking MySQL Great For Business Intelligence
Making MySQL Great For Business Intelligence
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
 
Building High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic ApplicationsBuilding High Performance MySQL Query Systems and Analytic Applications
Building High Performance MySQL Query Systems and Analytic Applications
 
02. Data Warehouse and OLAP
02. Data Warehouse and OLAP02. Data Warehouse and OLAP
02. Data Warehouse and OLAP
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
 

Ähnlich wie Big Data .. Are you ready for the next wave?

Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010nzhang
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiativeMansi Mehra
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisYuanyuan Tian
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiativeMansi Mehra
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry PerspectiveCloudera, Inc.
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Martin Bém
 
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalHPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalDataWorks Summit
 
AWS Summit 2011: Big Data Analytics in the AWS cloud
AWS Summit 2011: Big Data Analytics in the AWS cloudAWS Summit 2011: Big Data Analytics in the AWS cloud
AWS Summit 2011: Big Data Analytics in the AWS cloudAmazon Web Services
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop DeveloperEdureka!
 
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookHow Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookAmr Awadallah
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?James Serra
 
Google Data Engineering.pdf
Google Data Engineering.pdfGoogle Data Engineering.pdf
Google Data Engineering.pdfavenkatram
 
Data Engineering on GCP
Data Engineering on GCPData Engineering on GCP
Data Engineering on GCPBlibBlobb
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 

Ähnlich wie Big Data .. Are you ready for the next wave? (20)

Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
 
Final deck
Final deckFinal deck
Final deck
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalHPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
 
Big Data , Big Problem?
Big Data , Big Problem?Big Data , Big Problem?
Big Data , Big Problem?
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
AWS Summit 2011: Big Data Analytics in the AWS cloud
AWS Summit 2011: Big Data Analytics in the AWS cloudAWS Summit 2011: Big Data Analytics in the AWS cloud
AWS Summit 2011: Big Data Analytics in the AWS cloud
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop Developer
 
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookHow Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
Google Data Engineering.pdf
Google Data Engineering.pdfGoogle Data Engineering.pdf
Google Data Engineering.pdf
 
Data Engineering on GCP
Data Engineering on GCPData Engineering on GCP
Data Engineering on GCP
 
The future of Big Data tooling
The future of Big Data toolingThe future of Big Data tooling
The future of Big Data tooling
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 

Kürzlich hochgeladen

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 

Kürzlich hochgeladen (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 

Big Data .. Are you ready for the next wave?

  • 1. Big Data…Are you ready for the next wave? MAHMOUD SABRY
  • 2. Agenda  The Next Wave of Computing  Information, Data & Knowledge  Data warehouse vs. Database  Row-oriented vs. Column-oriented DB  Big Data Era  HP-Vertica  HP-Vertica Bulk Upload Example
  • 3. The next wave of computing Cloud + Big Data Analytics, Mobile & Social leads to what IBM calls “The Era of Smart.”
  • 4. Data, Information & Knowledge (1/3)  Data are raw facts, and have no meaning on their own  Example  Yes, Yes, No, Yes, No, Yes, No, Yes
  • 5. Data, Information & Knowledge (2/3)  Information is data that have meaning in a context  Example Yes, Yes, No, Yes, No, Yes, No, Yes, No, Yes, YesRaw Data Context Responses to the market research question – “Would you buy brand x at price y?” Information ??? Processing
  • 6. Data, Information & Knowledge (3/3)  Knowledge  understanding of someone or something  acquired through experience or learning  Example Based on last collected information, A Marketing Manager could use this information to decide whether or not to raise or lower price.
  • 7. Data Warehouse vs. Database (1/2) Feature DWH DB Data Stored It usually stores the Historical data whose accuracy is maintained over time. It mainly stores the Current data which always guaranteed to be up- to-date. Characteristic It is based on Informational Processing. It is based on Operational Processing. Function It is used for long-term informational requirements and decision support. It is used for day-to-day operations. Focus The focus is on “Information OUT” The focus is on “Data IN” Number of records accessed A bunch of millions of records. A few tens of records.
  • 8. Data Warehouse vs. Database (2/2) Feature DWH DB Access It mostly use the read access for the stored data. The most frequent type of access type is read/write. Orientation Based on Analysis. Based on Transaction. Common users Analysts DBAs KPI Query throughput Transaction throughput Unit of work Complex queries Short and simple transactions
  • 9. Row-based vs. Column-based DB (1/2) Feature Row-oriented DB Column-oriented DB Mechanism stores table records in sequence or rows. stores table records in sequence or columns. Retrieval of a few columns Slower Faster Retrieval of a single record Faster Slower
  • 10. Row-based vs. Column-based DB (2/2) Feature Row-oriented DB Column-oriented DB Insertion/Updating of single new record Faster Slower Aggregation of Single Column, e.g. sum(price) Slower Faster Compression Higher. As stores similar data together It depends! Examples Oracle OLTP MS SQL Server up to 2008 HP-Vertica Sybase IQ MS SQL Server 2012
  • 11. Big Data Era – Famous quote From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating. Eric Schmidt, Executive Chairman, Google
  • 12. Big Data – Definition  The basic idea behind the phrase 'Big Data' is that everything we do is increasingly leaving a digital trace (or data), which we (and others) can use and analyze.  Big Data therefore refers to our ability to make use of the ever-increasing volumes of data.
  • 13. Big Data – Characteristics (6 V’s)  Volume  Terabytes, Distributed, Tables & Files  Velocity  Real-time processing/Streams  Variety  Structured, Semi-structured & Un-structured data  Value  Correlations & Statistical analysis  Veracity  Accountability, Trust, Origin & Reputation  Variability  Changing Data/model
  • 14. Big Data – What is Hadoop ?  Apache™ Hadoop® is an open source platform that enables distributed processing of large data sets across clusters of servers.  It is designed to scale up from a single server to thousands of machines, with very high degree of fault tolerance.
  • 15. Big Data – HL Archit. Of Hadoop A programming model for large scale data processing. A distributed file-system that stores data on Clustered machines. Schedules map or reduce jobs to task trackers with an awareness of the data location. keeps the directory tree of all files in the file system, and tracks where across the cluster the file data is kept.
  • 16. Big Data – MapReduce, How it works ? Map: Filter & sort of data Reduce: Summarize & aggregate of data
  • 17. HP-Vertica  An analytic database management platform. Founded in 2005.  Helps you monetize all of your data in real-time and at massive scale.  Queries run 50-1,000x faster.  Store 10-30x more data per server.  Openness and simplicity (use any BI/ETL tools, Hadoop, etc.)  Suited for “Structured” data only.  HP Vertica Flex Zone: built on the HP Vertica core, enables load and analyze structured and semi-structured data, such as social media, sensor, log files, and machine data.
  • 18. HP-Vertica Features (1/6)  Column Orientation  Vertica organizes data for each column  Each column is stored separately on disk  Only reads the columns needed to answer the query  Significant reduction of disk I/O
  • 19. HP-Vertica Features (2/6)  Advanced Compression  Vertica replaces slower disk I/O with faster CPU cycles to encode data elements into a more compact form and query them.  Vertica’s innovative query engine operates directly on compressed data, meaning that it can actually require fewer CPU operations to process the compressed version of a table.
  • 20. HP-Vertica Features (3/6)  High Availability  RAID-like functionality within database  If a node fails, a copy is available on one of the surviving nodes  Always-on Queries and Loads  System continues to load and query when nodes are down  Automatically recovers missing data by querying other nodes
  • 21. HP-Vertica Features (4/6)  Automatic Database Designer (DBD)  Recommends a physical DB design that provides the best performance for the user's workload  Analyzes your logical schema, sample data, and sample queries  Minimizes DBA tuning  Run anytime for additional optimization, without stopping the database
  • 22. HP-Vertica Features (5/6)  Massively Parallel Processing (MPP)  Parallel design leverages data projections to enable distributed storage and workload  Active redundancy  Automatic replication, failover and recovery
  • 23. HP-Vertica Features (6/6)  Native SQL and Application Integration  Standard SQL Interface  Simple integration with Hadoop and existing BI and ETL tools  Supports SQL, ODBC, JDBC and majority ETL and BI reporting products
  • 24. HP-Vertica Bulk Upload Example (1/3)  HP Vertica 7.0, Single instance on VMware workstation v.11  Installed on Windows 8 machine HP-EliteBook 8440p, 8GB RAM  HP Vertica virtual machine has below specs:  Memory: 4GB  Disk: 16 GB  Processor: Single Processor Intel(R) Core(TM) i7 CPU M 620 @ 2.67GHz
  • 25. HP-Vertica Bulk Upload Example (2/3)  Scenario Details  Upload a table with “|” delimited columns from a local text file  Table file size is: 1.30099 GB  No. of rows is: 10252864  No. of columns Is: 21  Table is partitioned into 10 partitions by using “customer_age“ column  Only one projection (the default “<table_name>_Super” )
  • 26. HP-Vertica Bulk Upload Example (3/3)  Scenario Actions and Results  Upload by using “DIRECT” option to load directly to disk (ROS: Read Optimized Storage)  COPY public.myTable from '/opt/vertica/examples/VMart_Schema/myTable.tbl' DIRECT;  Time taken is more than 4 minutes  Upload without using “DIRECT” option to load to memory(WOS: Write Optimized Storage)  COPY public.myTable from '/opt/vertica/examples/VMart_Schema/myTable.tbl';  Time taken is about 2.3 minutes

Hinweis der Redaktion

  1. Source: http://www.slideshare.net/ibmcanada/ibm-canada-cloud-and-softlayer SmarterBiz 1980 2000 Mainframe 2020 PC Internet Web 2.0 1 2 3 Computing Platform Generations Web 3.0Eras of IT Deployment (Illustrative) Cloud + Analytics, Mobile & Social leads to what IBM calls “The Era of Smart.” Mobile & Wireless “Things” Sensors, Phones, Tablets, etc. Cloud Social Analytics Big Data Data Collected and Stored Mobile The Era Of “Smart” IBM HorizonWatch: Technology Trends To Watch In 2014 Cognitive Computing “Watson”
  2. Source: https://www.google.com.sa/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0CB4QFjAA&url=http%3A%2F%2Fwww.teach-ict.com%2Fas_a2_ict_new%2Focr%2FAS_G061%2F311_data_info_knowledge%2Fdata_info_knowledge%2Ftheory%2Fppt1%2Fkid1.ppt&ei=MCxsVdn8OsSuU9m1gIAD&usg=AFQjCNHPERF3ZkytlFMNEI65wcYjkw9ZxQ&sig2=dJR8akaNXYk2E0yIoSIvZA
  3. Source: http://www.durofy.com/differences-between-database-and-datawarehouse/
  4. Source: http://www.durofy.com/differences-between-database-and-datawarehouse/
  5. Source: http://www.timestored.com/time-series-data/what-is-a-column-oriented-database http://saphanatutorial.com/column-data-storage-and-row-data-storage-sap-hana/
  6. Source: http://www.dbbest.com/blog/column-oriented-database-technologies/
  7. Source: http://www.slideshare.net/BernardMarr/140228-big-data-slide-share?from_action=save
  8. Source: http://www.uazone.org/demch/worksinprogress/sne-2013-02-techreport-bdaf-draft02.pdf
  9. Source: http://www-01.ibm.com/software/data/infosphere/hadoop/
  10. Source: http://opensource.com/life/14/8/intro-apache-hadoop-big-data
  11. Source: http://blog.sqlauthority.com/2013/10/09/big-data-buzz-words-what-is-mapreduce-day-7-of-21/
  12. Source: http://www.vertica.com/about
  13. Source: http://www.vertica.com/wp-content/uploads/2011/01/ImprovingEDWsWithVertica.pdf
  14. Source:
  15. Source:
  16. Source:
  17. Source: