SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
BIG DATA ANALYTICS – ARCHITECTURES & OPPORTUNITIES
Gus Verzosa
08/03/2013
OBJECTIVES
The purpose of this paper is to provide an overview of:
1) The various emerging architectures for big data analytics.
2) The market opportunities for big data analytics.
ARCHITECTURES
The emerging big data analytics architectures utilize a combined front-
end, back-end, and cross-architecture set of components. Please see
the diagram below:
The back-end components are (collectively referred to as the analytic
sandbox):
1) The high-performance hardware – this is typically a multi-node
cluster, a large memory unit with back-up disk, or a combination of
both. Parallel-processing hardware typically depends on multi-
node clusters while in-memory hardware utilizes large memory
units.
2) The high-performance database – this is a database tightly
coupled to the high-performance hardware appliance. It normally
accommodates structured (e.g. from relational data tables, from
data warehouses), semi-structured (e.g. text data files, XML),
quasi-structured (e.g. clickstream), and unstructured data (text
documents, PDF, images, video).
The front-end components are:
1) The data exploration tool – for the Discovery phase of the Data
Analytics Lifecycle.
2) The data preparation tool – for the Data Preparation phase of the
Data Analytics Lifecycle.
3) The modelling tool – for the Model Planning and Model Building
phases of the Data Analytics Lifecycle.
4) The workflow tool – for organizing the functions into a structured
process during the Operationalization phase of the Data Analytics
Lifecycle.
The cross-architecture component, which moves between the front and
back is the In-database analytics component – comprising procedures
that run within the database itself that speed up the analysis of large
volumes of data.
Both the in-database analytics and the back-end high performance
database and hardware (utilizing parallel processing, in-memory
processing, or a combination of both) facilitate the capability to analyse
with a high level of complexity large volumes of various types/formats of
data quickly. This capability goes beyond traditional BI-EDW’s and is
required by big data analytics.
The architectural components above have been implemented by key
vendors as follows:
(continued next page)
OPPORTUNITIES
The big data opportunities in the various industries are as follows:
(continued next page)
COPYRIGHT NOTICE:
This work is copyright. Apart from any use permitted under the Australian Copyright
Act 1968, no part may be reproduced by any process, nor may any other exclusive
right be exercised, without the permission of Augusto Verzosa of 42 Lampard Circuit,
Bruce ACT 2617. This work was written by Augusto Verzosa on 8 March 2013.
This work is internationally protected by the Universal Copyright Convention, the
Berne Convention, and the WIPO Copyright Treaty.

Weitere ähnliche Inhalte

Andere mochten auch

Healthy foods activity
Healthy foods activityHealthy foods activity
Healthy foods activityvandrade
 
Constitucionalidad de-los-derechos-fundamentales-e-internet
Constitucionalidad de-los-derechos-fundamentales-e-internetConstitucionalidad de-los-derechos-fundamentales-e-internet
Constitucionalidad de-los-derechos-fundamentales-e-internetGian Carlos Estrada Cervantes
 
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618PT. Siwali Swantika
 
Fichadelproyecto virna
Fichadelproyecto virnaFichadelproyecto virna
Fichadelproyecto virnavandrade
 
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...PT. Siwali Swantika
 
Addiction and substance abuse
Addiction and substance abuseAddiction and substance abuse
Addiction and substance abuseSara Mofti
 
Mr_azer kelas 11 matematika wajib 2016
Mr_azer kelas 11 matematika wajib 2016Mr_azer kelas 11 matematika wajib 2016
Mr_azer kelas 11 matematika wajib 2016Reza Radhitya
 
Taller Convivencia escolar
Taller Convivencia escolarTaller Convivencia escolar
Taller Convivencia escolarmariasilva9104
 
Trabalho Servidor FTP
Trabalho Servidor FTPTrabalho Servidor FTP
Trabalho Servidor FTPJunior Cesar
 
Gam immunologia di fond-oinfondo_maggio 2015_professional
Gam immunologia   di fond-oinfondo_maggio 2015_professionalGam immunologia   di fond-oinfondo_maggio 2015_professional
Gam immunologia di fond-oinfondo_maggio 2015_professionalroberto parizzi
 
Daniel Lee Whelan CV 2015
Daniel Lee Whelan CV 2015Daniel Lee Whelan CV 2015
Daniel Lee Whelan CV 2015Daniel Whelan
 

Andere mochten auch (18)

Healthy foods activity
Healthy foods activityHealthy foods activity
Healthy foods activity
 
Constitucionalidad de-los-derechos-fundamentales-e-internet
Constitucionalidad de-los-derechos-fundamentales-e-internetConstitucionalidad de-los-derechos-fundamentales-e-internet
Constitucionalidad de-los-derechos-fundamentales-e-internet
 
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618
Datasheet Fluke 742A. Hubungi PT. Siwali Swantika 021-45850618
 
Ortografía básica
Ortografía básicaOrtografía básica
Ortografía básica
 
Fichadelproyecto virna
Fichadelproyecto virnaFichadelproyecto virna
Fichadelproyecto virna
 
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...
Datasheet Fluke Electrical Power Standard. Hubungi PT. Siwali Swantika 021-45...
 
Addiction and substance abuse
Addiction and substance abuseAddiction and substance abuse
Addiction and substance abuse
 
El sapo distraido
El sapo distraidoEl sapo distraido
El sapo distraido
 
Mr_azer kelas 11 matematika wajib 2016
Mr_azer kelas 11 matematika wajib 2016Mr_azer kelas 11 matematika wajib 2016
Mr_azer kelas 11 matematika wajib 2016
 
Taller Convivencia escolar
Taller Convivencia escolarTaller Convivencia escolar
Taller Convivencia escolar
 
Kenzo tange y alvar aalto
Kenzo tange y alvar aaltoKenzo tange y alvar aalto
Kenzo tange y alvar aalto
 
ColorCuboid_2015
ColorCuboid_2015ColorCuboid_2015
ColorCuboid_2015
 
Trabalho Servidor FTP
Trabalho Servidor FTPTrabalho Servidor FTP
Trabalho Servidor FTP
 
Are Your Students Prepared for Distance Learning?
Are Your Students Prepared for Distance Learning? Are Your Students Prepared for Distance Learning?
Are Your Students Prepared for Distance Learning?
 
Gam immunologia di fond-oinfondo_maggio 2015_professional
Gam immunologia   di fond-oinfondo_maggio 2015_professionalGam immunologia   di fond-oinfondo_maggio 2015_professional
Gam immunologia di fond-oinfondo_maggio 2015_professional
 
Google
GoogleGoogle
Google
 
Daniel Lee Whelan CV 2015
Daniel Lee Whelan CV 2015Daniel Lee Whelan CV 2015
Daniel Lee Whelan CV 2015
 
Visit Visa 2016 Muhammad Younas Kashif
Visit Visa 2016 Muhammad Younas KashifVisit Visa 2016 Muhammad Younas Kashif
Visit Visa 2016 Muhammad Younas Kashif
 

Ähnlich wie BIG_DATA_ANALYTICS_PAPER_FINAL

2013 NIST Big Data Subgroups Combined Outputs
2013 NIST Big Data Subgroups Combined Outputs 2013 NIST Big Data Subgroups Combined Outputs
2013 NIST Big Data Subgroups Combined Outputs Bob Marcus
 
NIST Big Data Working Group.pdf
NIST Big Data Working Group.pdfNIST Big Data Working Group.pdf
NIST Big Data Working Group.pdfBob Marcus
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdferamfatima43
 
Acunu Whitepaper v1
Acunu Whitepaper v1Acunu Whitepaper v1
Acunu Whitepaper v1Acunu
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataVipin Batra
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET Journal
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET Journal
 
Approach to composition of mis
Approach to composition of misApproach to composition of mis
Approach to composition of misYatendra Paliwal
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfan
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architectureRahul Chaturvedi
 
History Of Database Technology
History Of Database TechnologyHistory Of Database Technology
History Of Database TechnologyJacqueline Thomas
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewIRJET Journal
 
Design of file system architecture with cluster
Design of file system architecture with clusterDesign of file system architecture with cluster
Design of file system architecture with clustereSAT Publishing House
 
Introduction-to-Databases.pptx
Introduction-to-Databases.pptxIntroduction-to-Databases.pptx
Introduction-to-Databases.pptxIvanDarrylLopez
 
An Overview of Security in Distributed Database Management System
An Overview of Security in Distributed Database Management SystemAn Overview of Security in Distributed Database Management System
An Overview of Security in Distributed Database Management SystemIJSRD
 
Efficient usage of memory management in big data using “anti caching”
Efficient usage of memory management in big data using “anti caching”Efficient usage of memory management in big data using “anti caching”
Efficient usage of memory management in big data using “anti caching”eSAT Journals
 
Data Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural FrameworkData Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural FrameworkDr. Sunil Kr. Pandey
 

Ähnlich wie BIG_DATA_ANALYTICS_PAPER_FINAL (20)

2013 NIST Big Data Subgroups Combined Outputs
2013 NIST Big Data Subgroups Combined Outputs 2013 NIST Big Data Subgroups Combined Outputs
2013 NIST Big Data Subgroups Combined Outputs
 
NIST Big Data Working Group.pdf
NIST Big Data Working Group.pdfNIST Big Data Working Group.pdf
NIST Big Data Working Group.pdf
 
BD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdfBD_Architecture and Charateristics.pptx.pdf
BD_Architecture and Charateristics.pptx.pdf
 
Acunu Whitepaper v1
Acunu Whitepaper v1Acunu Whitepaper v1
Acunu Whitepaper v1
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
IRJET- Deduplication Detection for Similarity in Document Analysis Via Vector...
 
Approach to composition of mis
Approach to composition of misApproach to composition of mis
Approach to composition of mis
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architecture
 
History Of Database Technology
History Of Database TechnologyHistory Of Database Technology
History Of Database Technology
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
 
Design of file system architecture with cluster
Design of file system architecture with clusterDesign of file system architecture with cluster
Design of file system architecture with cluster
 
Introduction-to-Databases.pptx
Introduction-to-Databases.pptxIntroduction-to-Databases.pptx
Introduction-to-Databases.pptx
 
An Overview of Security in Distributed Database Management System
An Overview of Security in Distributed Database Management SystemAn Overview of Security in Distributed Database Management System
An Overview of Security in Distributed Database Management System
 
Efficient usage of memory management in big data using “anti caching”
Efficient usage of memory management in big data using “anti caching”Efficient usage of memory management in big data using “anti caching”
Efficient usage of memory management in big data using “anti caching”
 
Data Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural FrameworkData Warehousing & Basic Architectural Framework
Data Warehousing & Basic Architectural Framework
 
J0212065068
J0212065068J0212065068
J0212065068
 
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
[IJCT-V3I2P32] Authors: Amarbir Singh, Palwinder Singh
 

BIG_DATA_ANALYTICS_PAPER_FINAL

  • 1. BIG DATA ANALYTICS – ARCHITECTURES & OPPORTUNITIES Gus Verzosa 08/03/2013 OBJECTIVES The purpose of this paper is to provide an overview of: 1) The various emerging architectures for big data analytics. 2) The market opportunities for big data analytics. ARCHITECTURES The emerging big data analytics architectures utilize a combined front- end, back-end, and cross-architecture set of components. Please see the diagram below:
  • 2. The back-end components are (collectively referred to as the analytic sandbox): 1) The high-performance hardware – this is typically a multi-node cluster, a large memory unit with back-up disk, or a combination of both. Parallel-processing hardware typically depends on multi- node clusters while in-memory hardware utilizes large memory units. 2) The high-performance database – this is a database tightly coupled to the high-performance hardware appliance. It normally accommodates structured (e.g. from relational data tables, from data warehouses), semi-structured (e.g. text data files, XML), quasi-structured (e.g. clickstream), and unstructured data (text documents, PDF, images, video). The front-end components are: 1) The data exploration tool – for the Discovery phase of the Data Analytics Lifecycle. 2) The data preparation tool – for the Data Preparation phase of the Data Analytics Lifecycle. 3) The modelling tool – for the Model Planning and Model Building phases of the Data Analytics Lifecycle. 4) The workflow tool – for organizing the functions into a structured process during the Operationalization phase of the Data Analytics Lifecycle. The cross-architecture component, which moves between the front and back is the In-database analytics component – comprising procedures that run within the database itself that speed up the analysis of large volumes of data. Both the in-database analytics and the back-end high performance database and hardware (utilizing parallel processing, in-memory processing, or a combination of both) facilitate the capability to analyse with a high level of complexity large volumes of various types/formats of data quickly. This capability goes beyond traditional BI-EDW’s and is required by big data analytics.
  • 3. The architectural components above have been implemented by key vendors as follows: (continued next page)
  • 4. OPPORTUNITIES The big data opportunities in the various industries are as follows: (continued next page)
  • 5. COPYRIGHT NOTICE: This work is copyright. Apart from any use permitted under the Australian Copyright Act 1968, no part may be reproduced by any process, nor may any other exclusive right be exercised, without the permission of Augusto Verzosa of 42 Lampard Circuit, Bruce ACT 2617. This work was written by Augusto Verzosa on 8 March 2013. This work is internationally protected by the Universal Copyright Convention, the Berne Convention, and the WIPO Copyright Treaty.