SlideShare ist ein Scribd-Unternehmen logo
1 von 8
Downloaden Sie, um offline zu lesen
Big Data and Enterprise Data: Bridging Two
Worlds with Oracle Data Integrator 12c
(ODI12c)
O R A C L E W H I T E P A P E R | J A N U A R Y 2 0 1 5
Big Data and Enterprise Data: Bridging Two Worlds with Oracle Data Integrator 12c (ODI12c)
The advantage of Big Data comes into play when you have the ability to correlate Big Data with your existing
enterprise data. There’s an implicit product requirement here in consolidating these various architecture principles
into a single integration solution. The advantages of a single solution allow you to address not only the complexities
of mapping, accessing, and loading Big Data but also combining it with your enterprise data – and this correlation
requires integrating across mixed application environments. The correlation is key to taking full advantage of Big
Data and requires a single unified tool that can straddle both environments.
Introduction
Volume, Velocity, Variety - Big Data is all the vogue. Why? Enterprises know that there is a treasure
trove of information they should be tapping into. This information is predominantly less structured data
consisting of weblogs, social media, email, sensors, and photographs that can be mined for useful
information. Whether it is healthcare, financial, manufacturing, government or retail, Big Data
presents a big opportunity. A recent McKinsey study (see figure 1) illustrates that the US retail industry
could realize a 60% increase in operating margins – just by fully exploiting Big Data. And this is just the
beginning. Big Data turns traditional information architecture on its head, putting into question
commonly accepted notions of where and how data should be aggregated processed, analyzed, and
stored. This is where low cost options of Hadoop and NoSQL come in – new technologies which solve
new problems for managing unstructured data. But one shouldn’t forget what’s already running in
today’s IT. Today’s Business Analytics, Data Warehouses, Business Applications (ERP, CRM, SCM,
HCM), and even many social, mobile, cloud applications still rely almost exclusively on structured data.
This dilemma is what today’s IT leaders are up against: what are the best ways to bridge enterprise
data with Big Data through a common set of unified data integration tools? And what are the best
strategies for dealing with the complexities of these two unique worlds?
FIGURE 1: REAL WORLD USES CASES FOR BIG DATA
New Data What is Possible Why
Health Care
Improve Quality And
Efficiency
Practitioner’s Notes;
Machine Statistics
Best Practices; Reduced Hospitalization Increase industry value by
$300Bn per year
Retail
One Size Fits All
Marketing
Weblogs; Click Streams Micro-Segmentation; Recommendations Increase net margin by
60%
Banking
Fraud Detection; Risk
Analysis
Weblogs, Transaction , Systems,
Fraud Reports
Semantic Discovery; Pattern Detection Billions of dollars lost
annually in bank fraud
Location Based
Services
Based On Home Zip
Personal Location Data Geo- Advertising, Traffic, Local
Search
Increase Revenue For
Providers By $100B+
BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C)
Code
Utilities
Resilient And
Adaptable Grid
Smart Meter Reading, Call Center
Data
Real Time And Predictive
Utilization Analysis
Energy use expected to
increase by 22% by 2030.
Harnessing the Power of Big Data – Beyond the SQL World
Big data is having a tremendous impact on the data integration solution space. This has to do primarily with the fact
that Big Data poses new questions for the best ways to process volumes and varieties of data at higher speeds and
at faster velocities while keeping operating costs to a minimum. But before we look at the impact in more detail, let’s
first look at the current state of data integration. Data integration in a ‘traditional sense’ solves the issues of bulk data
movement, replication, synchronization, virtualization, transformation, data quality, and data services.
These capabilities serve as a key technology component for moving data between Data Warehouses, Business
Analytics, Master Data Management, Enterprise Applications, and Custom Applications. But now there’s more to be
moved. Much, much more. In fact, it’s not just the scale, but it’s the complexity of new technologies. Data Integration
tools have evolved to support integration to and interoperation with technology offerings like Hadoop Distributed File
System (“HDFS”) using innovative techniques like MapReduce and NoSQL. Oozie, Sqoop and new access
methodologies like MapReduce, Pig and HiveQL have all merged the realms of Data Integration into the Big Data
world. And in many cases to effectively leverage the power of this technology effectively it has to run natively in the
technology. Much like E-LT runs natively on relational database systems, so too data integration technology needs
to run its transformation, loading natively in Big Data environments.
Big Data Technologies – What You Need to Know
» Hadoop - is an open source software project that supports data-intensive distributed applications. It enables
applications to work with thousands of computational independent computers and petabytes of data.
» MapReduce - is a software framework that allows developers to write programs that process massive amounts of
unstructured data in parallel across a distributed cluster of processors or stand- alone computers
» Hive – is a query language similar to standard SQL statements. The query language is called Hive Query
Language (HQL). Hive is executed on Hadoop clusters based on MapReduce operations.
» NoSQL – is a term used for a wide variety of database management systems whose one thing in common is that
they don’t use SQL. Oracle NoSQL Database, for example, is a key value database, where data is stored under
and accessed by a unique key, and variable schemas are easily supported.
» Hadoop Distributed File System (HDFS) - is a distributed, scalable, and portable file system written in Java for
the Hadoop framework. In order to scale a Hadoop cluster to hundreds and thousands of nodes, HDFS is critical.
» MongoDB – is a NoSQL open source Database designed for scale, flexible data aggregation and to store files of
any size. It has rich querying, high availability and full indexing support and is fast being adopted by many
businesses.
» HBase – is an Apache open source, distributed, versioned project that provides real - time read/write access to
Big Data atop clusters of commodity hardware. Apache HBase provides these capabilities on top of Hadoop
and HDFS.
» Sqoop – is the tool designed for efficiently transferring bulk data between the Apache distribution of Hadoop and
structured relation Data Bases. (SQL to Hadoop = Sqoop)
» Pig – is a programming platform to write Map Reduce programs to query large scale data. General Pig scripts are
written in the Pig Latin language.
BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C)
» Oozie –is a workflow scheduler system to manage Apache Hadoop jobs. Oozie is integrated with the rest of the
Hadoop stack supporting several types of Hadoop jobs such as Java map-reduce, Streaming map-reduce, Pig,
Hive, and Sqoop.
Challenges Posed By Big Data:
» A shift from familiar Data Integration tools to new technologies to deal with Big Data problems will impact
productivity of the organization and increase costs to invest in new skills and tools.
» Performance constraints of the Data Integration tools to deal with larger and more complex data within ever
shortening time windows and real time analytics.
» Costly network traffic in moving data within and outside the business and storage areas, and lastly
» Redundancy of existing costly IT investment which become obsolete if they are not “Big Data Enabled.”
Oracle’s approach to Big Data solves each of these concerns. Oracle Data Integrator 12c (ODI12c) addresses
productivity and skill requirements without leaving familiar programming and IT environments. Oracle Data
Integrator Application Adapter for Hadoop helps bring the data together while Oracle’s Integrated Platform delivers
optimal Big Data flow ensuring not just data size and complexity, but also data speed and delivers Fast Data.
Best Technology Approaches for Integrating Big Data
Integrated Design Tools: Improve Productivity
The advantage of Big Data comes into play when you have the ability to correlate Big Data with your existing
enterprise data. There’s an implicit product requirement here in consolidating these various architecture principles
into a single integration solution. The advantages of a single solution allow you to address not only the complexities
of mapping, accessing, and loading Big Data but also combining it with your enterprise data – and this correlation
requires integrating across mixed application environments. The correlation is key to taking full advantage of Big
Data and requires a single unified tool that can straddle both environments.
In addition, Big Data sources consist of many different types and in many different forms. How can anyone be sure
of the quality of that data? In the Big Data scenario, Data Quality is important because of the multitude of data
sources. Multiple data sources make it difficult to trust the underlying data. Being able to quickly and easily identify
and resolve any data discrepancies, missing values, etc in an automated fashion is beneficial to the applications and
systems that use this information. Oracle Enterprise Data Quality integrated with Oracle Data Integrator 12c
(ODI12c) enables data analysts to investigate, identify and resolve data quality issues by discovering and analyzing
anomalies.
Oracle Data Integrator 12c (ODI12c): Loading and Transforming Big Data
Oracle Data Integrator 12c (ODI12c) leverages unified tooling for both Big Data and enterprise data which translates
into a faster learning curve as well as seamless usability so that the data scientist or data analyst can focus on
integration versus usability. Flow based declarative designs in Oracle Data Integrator 12c (ODI12c) helps build
complex expressions that are easy to maintain and support. Two representations of the same ELT mappings, the
logical representation and the physical representation, provide customized working environments for business
analysts and data scientists.
BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C)
Fig. 2: Loading and Transforming Big Data using Oracle Data Integrator12c (ODI12c)
Oracle Data Integrator Application Adapters for Hadoop
As a component of Oracle Big Data Connectors, Oracle Data Integrator Application Adapter for Hadoop provides
native Hadoop integration together with Oracle Data Integrator 12c (ODI12c). Knowledge Modules can then be used
to build Hadoop metadata within Oracle Data Integrator 12c (ODI12c), load data into Hadoop, transform data within
Hadoop, and load data easily and directly into Oracle Database utilizing Oracle Loader for Hadoop. Once the data is
processed and organized on the Hadoop cluster, Oracle Data Integrator 12c (ODI12c) loads the data directly into
Oracle Database utilizing the Oracle Loader for Hadoop.
Oracle Data Integrator Application Adapter for Hadoop simplifies data loading and movement between Hadoop and
an Oracle Database through Oracle Data Integrator 12c (ODI12c)’s easy to use prebuilt interfaces. Oracle Data
Integrator Application Adapter for Hadoop simplifies data integration from Hadoop and an Oracle Database through
Oracle Data Integrator 12c (ODI12c)’s easy to use interface. By providing efficient connectivity between Oracle
Database and Hadoop, Oracle Big Data Connectors enables analysis of all data, both structured and unstructured,
in enterprise data warehouses. Additionally, enterprises that are already using a Hadoop solution integrate data from
HDFS using Oracle Big Data Connectors as a standalone software solution as well.
Integrated Platform: Simplify and Optimize
Taking all the miscellaneous technologies around Big Data – which are new to many organizations - and making
them each work with one another is challenging. Making them work together in a production- grade environment is
even more daunting. Integrated systems can help an organization radically simplify their Big Data architectures by
integrating the necessary hardware and software components to provide fast and cost-efficient access, and
mapping, to NoSQL and HDFS.
BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C)
Combined hardware and software systems can be optimized for redundancy with mirrored disks, optimized for high
availability with hot-swappable power, and optimized for scale by adding new racks with more memory and
processing power. Take it one step further and you can use these same systems to build out more elastic capacity
to meet the flexibility requirements Big Data demands.
Figure 3: Oracle’s Big Data Story
Oracle is uniquely qualified to combine everything needed to meet the Big Data challenge –including software and
hardware – into one engineered system. The Oracle Big Data Appliance is an engineered system that combines
optimized hardware with the most comprehensive software stack featuring both the Cloudera Distribution including
Apache Hadoop and specialized solutions developed by Oracle to deliver a complete, easy-to-deploy solution for
acquiring, organizing and loading Big Data into Oracle Database. It is designed to deliver extreme analytics on all
data types, with enterprise-class performance, availability, supportability and security. With Oracle Big Data
Connectors and Oracle Data Integrator 12c (ODI12c), the solution is tightly integrated with Oracle Exadata and
Oracle Database, so that data can be analyzed data with extreme performance.
Real-time Business Analytics: Extends the Value of Big Data
One of the key expectations for Big Data is to yield real-time analytics that improve business insights. But how timely
is the data is a question that will still need to be answered for both Big Data or traditional enterprise data.
While Big Data on its own has no means of applying ‘traditional’ change data capture [since there are no log files in
NoSQL or HDFS], it’s still an important requirement to implement real-time solutions in conjunction with big data.
Otherwise the speed advantage to indexing realms of Big Data will be undone by sluggish ETL processing that it’s
dependent on. Big Data can be processed at high volume with high velocity. In fact, this is the entire strategy behind
the invention of MapReduce – providing search responses of the highest quality in close to the blink of a human eye.
Combine this power with real-time solutions in replication, change data capture, synchronization, and the integration
BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C)
to business analytics tooling, and you have what amounts to the compelling advantages of real-time business
analytics.
The combination of Oracle Data Integrator and Oracle GoldenGate pack a powerful punch when it comes to
achieving true real-time business analytics. Oracle GoldenGate is an integral part of the real- time business
analytics use case to accomplish real- time data replication and capture ensuring applications have the data they
need immediately.
Conclusion
Big data continues to be the center of attention - take away the hype and there's a key takeaway. For years,
companies have been running their critical business infrastructure and building business insights based on
transactional data stored in relational databases. Beyond that critical data, however, is a potential treasure trove of
less structured data: web and call logs, social media, email, sensors, and photographs [and more] that can be mined
for useful information. Companies that are seeking ways to capitalize on the hidden potential of Big Data need to
consider data integration technologies to help bridge the gap and correlate that data across the enterprise.
Bridging the two worlds of Big Data and enterprise data means considering solutions that are complete, based on
traditional as well as emerging Hadoop technologies, and are poised for success through integrated design tools,
integrated platforms, and real-time analytics. Leveraging these best practices ensures improved productivity,
lowered TCO, IT optimization, and better business insights. Only Oracle provides the most complete, integrated, and
real-time solution data integration that helps bridge the two worlds of Big Data and enterprise data to accelerate
adoption and yield greater returns for these emerging technologies.
For more information on Oracle Data Integrator go to our website:
www.oracle.com/goto/dataintegration.
Oracle Corporation, World Headquarters
500 Oracle Parkway
Redwood Shores, CA 94065, USA
Worldwide Inquiries
Phone: +1.650.506.7000
Fax: +1.650.506.7200
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. This document is provided for information purposes only, and the
contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other
warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or
fitness for a particular purpose. We specifically disclaim any liability with respect to this document, and no contractual obligations are
formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any
means, electronic or mechanical, for any purpose, without our prior written permission.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and
are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are
trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.0115
White Paper Title
January 2015
Author: Oracle
C O N N E C T W I T H U S
Oracle Data Integration Blog
Oracle DI Facebook Page
Oracle Data Integration
Oracle DI Home Page

Weitere ähnliche Inhalte

Was ist angesagt?

Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Denodo
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeCaserta
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcarePaul Boal
 
The principles of the business data lake
The principles of the business data lakeThe principles of the business data lake
The principles of the business data lakeCapgemini
 
Data architecture for modern enterprise
Data architecture for modern enterpriseData architecture for modern enterprise
Data architecture for modern enterprisekayalvizhi kandasamy
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopDavid Yahalom
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureCaserta
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the CloudCaserta
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteMark van Rijmenam
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data LakeCaserta
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data VirtualizationKenneth Peeples
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceInformation Security Awareness Group
 

Was ist angesagt? (20)

Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
 
Combining hadoop with big data analytics
Combining hadoop with big data analyticsCombining hadoop with big data analytics
Combining hadoop with big data analytics
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to Healthcare
 
The principles of the business data lake
The principles of the business data lakeThe principles of the business data lake
The principles of the business data lake
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
MongoDB
MongoDBMongoDB
MongoDB
 
Data architecture for modern enterprise
Data architecture for modern enterpriseData architecture for modern enterprise
Data architecture for modern enterprise
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
The new EDW
The new EDWThe new EDW
The new EDW
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes Keynote
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data Virtualization
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 

Andere mochten auch

Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.
Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.
Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.Milomir Vojvodic
 
Sun microsystems bi infrastructure
Sun microsystems bi infrastructureSun microsystems bi infrastructure
Sun microsystems bi infrastructuresomnath8410
 
Rick Bicc Foundation Services
Rick   Bicc Foundation ServicesRick   Bicc Foundation Services
Rick Bicc Foundation Servicesdfwcug
 
Increase Marketing Effectiveness and ROI with Big Data
Increase Marketing Effectiveness and ROI with Big DataIncrease Marketing Effectiveness and ROI with Big Data
Increase Marketing Effectiveness and ROI with Big DataDemandbase
 
Oracle EPM BI Overview
Oracle EPM BI OverviewOracle EPM BI Overview
Oracle EPM BI Overviewcglylesu
 
Formulating Power BI Enterprise Strategy
Formulating Power BI Enterprise StrategyFormulating Power BI Enterprise Strategy
Formulating Power BI Enterprise StrategyTeo Lachev
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini
 

Andere mochten auch (8)

Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.
Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.
Milomir Vojvodic - Business Analytics And Big Data Partner Forum Dubai 15.11.
 
Sun microsystems bi infrastructure
Sun microsystems bi infrastructureSun microsystems bi infrastructure
Sun microsystems bi infrastructure
 
Rick Bicc Foundation Services
Rick   Bicc Foundation ServicesRick   Bicc Foundation Services
Rick Bicc Foundation Services
 
Increase Marketing Effectiveness and ROI with Big Data
Increase Marketing Effectiveness and ROI with Big DataIncrease Marketing Effectiveness and ROI with Big Data
Increase Marketing Effectiveness and ROI with Big Data
 
The Path to Enterprise IT Transformation
The Path to Enterprise IT TransformationThe Path to Enterprise IT Transformation
The Path to Enterprise IT Transformation
 
Oracle EPM BI Overview
Oracle EPM BI OverviewOracle EPM BI Overview
Oracle EPM BI Overview
 
Formulating Power BI Enterprise Strategy
Formulating Power BI Enterprise StrategyFormulating Power BI Enterprise Strategy
Formulating Power BI Enterprise Strategy
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 

Ähnlich wie Big Data and Enterprise Data - Oracle -1663869

bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000Kartik Padmanabhan
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL TechnologiesAmit Singh
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoopAnusha sweety
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoptionfaizrashid1995
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overviewNitesh Ghosh
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsRackspace
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache SoftwareBob Marcus
 
SQL vs NoSQL: Big Data Adoption & Success in the Enterprise
SQL vs NoSQL: Big Data Adoption & Success in the EnterpriseSQL vs NoSQL: Big Data Adoption & Success in the Enterprise
SQL vs NoSQL: Big Data Adoption & Success in the EnterpriseAnita Luthra
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond Rajesh Kumar
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...Kai Wähner
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessAjay Ohri
 

Ähnlich wie Big Data and Enterprise Data - Oracle -1663869 (20)

bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
 
Hadoop
HadoopHadoop
Hadoop
 
Big Data
Big DataBig Data
Big Data
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL Technologies
 
Big Data
Big DataBig Data
Big Data
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoption
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
SQL vs NoSQL: Big Data Adoption & Success in the Enterprise
SQL vs NoSQL: Big Data Adoption & Success in the EnterpriseSQL vs NoSQL: Big Data Adoption & Success in the Enterprise
SQL vs NoSQL: Big Data Adoption & Success in the Enterprise
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 

Mehr von Edgar Alejandro Villegas

What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016Edgar Alejandro Villegas
 
The Four Pillars of Analytics Technology Whitepaper
The Four Pillars of Analytics Technology WhitepaperThe Four Pillars of Analytics Technology Whitepaper
The Four Pillars of Analytics Technology WhitepaperEdgar Alejandro Villegas
 
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone Before
SQL in Hadoop  To Boldly Go Where no Data Warehouse Has Gone BeforeSQL in Hadoop  To Boldly Go Where no Data Warehouse Has Gone Before
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone BeforeEdgar Alejandro Villegas
 
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343Edgar Alejandro Villegas
 
Best Practices for Oracle Exadata and the Oracle Optimizer
Best Practices for Oracle Exadata and the Oracle OptimizerBest Practices for Oracle Exadata and the Oracle Optimizer
Best Practices for Oracle Exadata and the Oracle OptimizerEdgar Alejandro Villegas
 
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...Edgar Alejandro Villegas
 
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slidesFast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slidesEdgar Alejandro Villegas
 
BITGLASS - DATA BREACH DISCOVERY DATASHEET
BITGLASS - DATA BREACH DISCOVERY DATASHEETBITGLASS - DATA BREACH DISCOVERY DATASHEET
BITGLASS - DATA BREACH DISCOVERY DATASHEETEdgar Alejandro Villegas
 
Four Pillars of Business Analytics - e-book - Actuate
Four Pillars of Business Analytics - e-book - ActuateFour Pillars of Business Analytics - e-book - Actuate
Four Pillars of Business Analytics - e-book - ActuateEdgar Alejandro Villegas
 

Mehr von Edgar Alejandro Villegas (20)

What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016What's New in Predictive Analytics IBM SPSS - Apr 2016
What's New in Predictive Analytics IBM SPSS - Apr 2016
 
Oracle big data discovery 994294
Oracle big data discovery   994294Oracle big data discovery   994294
Oracle big data discovery 994294
 
Actian Ingres10.2 Datasheet
Actian Ingres10.2 DatasheetActian Ingres10.2 Datasheet
Actian Ingres10.2 Datasheet
 
Actian Matrix Datasheet
Actian Matrix DatasheetActian Matrix Datasheet
Actian Matrix Datasheet
 
Actian Matrix Whitepaper
 Actian Matrix Whitepaper Actian Matrix Whitepaper
Actian Matrix Whitepaper
 
Actian Vector Whitepaper
 Actian Vector Whitepaper Actian Vector Whitepaper
Actian Vector Whitepaper
 
Actian DataFlow Whitepaper
Actian DataFlow WhitepaperActian DataFlow Whitepaper
Actian DataFlow Whitepaper
 
The Four Pillars of Analytics Technology Whitepaper
The Four Pillars of Analytics Technology WhitepaperThe Four Pillars of Analytics Technology Whitepaper
The Four Pillars of Analytics Technology Whitepaper
 
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone Before
SQL in Hadoop  To Boldly Go Where no Data Warehouse Has Gone BeforeSQL in Hadoop  To Boldly Go Where no Data Warehouse Has Gone Before
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone Before
 
Realtime analytics with_hadoop
Realtime analytics with_hadoopRealtime analytics with_hadoop
Realtime analytics with_hadoop
 
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
Big Data SurVey - IOUG - 2013 - 594292
Big Data SurVey - IOUG - 2013 - 594292Big Data SurVey - IOUG - 2013 - 594292
Big Data SurVey - IOUG - 2013 - 594292
 
Best Practices for Oracle Exadata and the Oracle Optimizer
Best Practices for Oracle Exadata and the Oracle OptimizerBest Practices for Oracle Exadata and the Oracle Optimizer
Best Practices for Oracle Exadata and the Oracle Optimizer
 
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...Best Practices –  Extreme Performance with Data Warehousing  on Oracle Databa...
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
 
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slidesFast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
 
BITGLASS - DATA BREACH DISCOVERY DATASHEET
BITGLASS - DATA BREACH DISCOVERY DATASHEETBITGLASS - DATA BREACH DISCOVERY DATASHEET
BITGLASS - DATA BREACH DISCOVERY DATASHEET
 
Four Pillars of Business Analytics - e-book - Actuate
Four Pillars of Business Analytics - e-book - ActuateFour Pillars of Business Analytics - e-book - Actuate
Four Pillars of Business Analytics - e-book - Actuate
 
Sas hpa-va-bda-exadata-2389280
Sas hpa-va-bda-exadata-2389280Sas hpa-va-bda-exadata-2389280
Sas hpa-va-bda-exadata-2389280
 
Splice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakesSplice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakes
 

Kürzlich hochgeladen

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 

Kürzlich hochgeladen (20)

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 

Big Data and Enterprise Data - Oracle -1663869

  • 1. Big Data and Enterprise Data: Bridging Two Worlds with Oracle Data Integrator 12c (ODI12c) O R A C L E W H I T E P A P E R | J A N U A R Y 2 0 1 5
  • 2. Big Data and Enterprise Data: Bridging Two Worlds with Oracle Data Integrator 12c (ODI12c) The advantage of Big Data comes into play when you have the ability to correlate Big Data with your existing enterprise data. There’s an implicit product requirement here in consolidating these various architecture principles into a single integration solution. The advantages of a single solution allow you to address not only the complexities of mapping, accessing, and loading Big Data but also combining it with your enterprise data – and this correlation requires integrating across mixed application environments. The correlation is key to taking full advantage of Big Data and requires a single unified tool that can straddle both environments. Introduction Volume, Velocity, Variety - Big Data is all the vogue. Why? Enterprises know that there is a treasure trove of information they should be tapping into. This information is predominantly less structured data consisting of weblogs, social media, email, sensors, and photographs that can be mined for useful information. Whether it is healthcare, financial, manufacturing, government or retail, Big Data presents a big opportunity. A recent McKinsey study (see figure 1) illustrates that the US retail industry could realize a 60% increase in operating margins – just by fully exploiting Big Data. And this is just the beginning. Big Data turns traditional information architecture on its head, putting into question commonly accepted notions of where and how data should be aggregated processed, analyzed, and stored. This is where low cost options of Hadoop and NoSQL come in – new technologies which solve new problems for managing unstructured data. But one shouldn’t forget what’s already running in today’s IT. Today’s Business Analytics, Data Warehouses, Business Applications (ERP, CRM, SCM, HCM), and even many social, mobile, cloud applications still rely almost exclusively on structured data. This dilemma is what today’s IT leaders are up against: what are the best ways to bridge enterprise data with Big Data through a common set of unified data integration tools? And what are the best strategies for dealing with the complexities of these two unique worlds? FIGURE 1: REAL WORLD USES CASES FOR BIG DATA New Data What is Possible Why Health Care Improve Quality And Efficiency Practitioner’s Notes; Machine Statistics Best Practices; Reduced Hospitalization Increase industry value by $300Bn per year Retail One Size Fits All Marketing Weblogs; Click Streams Micro-Segmentation; Recommendations Increase net margin by 60% Banking Fraud Detection; Risk Analysis Weblogs, Transaction , Systems, Fraud Reports Semantic Discovery; Pattern Detection Billions of dollars lost annually in bank fraud Location Based Services Based On Home Zip Personal Location Data Geo- Advertising, Traffic, Local Search Increase Revenue For Providers By $100B+
  • 3. BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C) Code Utilities Resilient And Adaptable Grid Smart Meter Reading, Call Center Data Real Time And Predictive Utilization Analysis Energy use expected to increase by 22% by 2030. Harnessing the Power of Big Data – Beyond the SQL World Big data is having a tremendous impact on the data integration solution space. This has to do primarily with the fact that Big Data poses new questions for the best ways to process volumes and varieties of data at higher speeds and at faster velocities while keeping operating costs to a minimum. But before we look at the impact in more detail, let’s first look at the current state of data integration. Data integration in a ‘traditional sense’ solves the issues of bulk data movement, replication, synchronization, virtualization, transformation, data quality, and data services. These capabilities serve as a key technology component for moving data between Data Warehouses, Business Analytics, Master Data Management, Enterprise Applications, and Custom Applications. But now there’s more to be moved. Much, much more. In fact, it’s not just the scale, but it’s the complexity of new technologies. Data Integration tools have evolved to support integration to and interoperation with technology offerings like Hadoop Distributed File System (“HDFS”) using innovative techniques like MapReduce and NoSQL. Oozie, Sqoop and new access methodologies like MapReduce, Pig and HiveQL have all merged the realms of Data Integration into the Big Data world. And in many cases to effectively leverage the power of this technology effectively it has to run natively in the technology. Much like E-LT runs natively on relational database systems, so too data integration technology needs to run its transformation, loading natively in Big Data environments. Big Data Technologies – What You Need to Know » Hadoop - is an open source software project that supports data-intensive distributed applications. It enables applications to work with thousands of computational independent computers and petabytes of data. » MapReduce - is a software framework that allows developers to write programs that process massive amounts of unstructured data in parallel across a distributed cluster of processors or stand- alone computers » Hive – is a query language similar to standard SQL statements. The query language is called Hive Query Language (HQL). Hive is executed on Hadoop clusters based on MapReduce operations. » NoSQL – is a term used for a wide variety of database management systems whose one thing in common is that they don’t use SQL. Oracle NoSQL Database, for example, is a key value database, where data is stored under and accessed by a unique key, and variable schemas are easily supported. » Hadoop Distributed File System (HDFS) - is a distributed, scalable, and portable file system written in Java for the Hadoop framework. In order to scale a Hadoop cluster to hundreds and thousands of nodes, HDFS is critical. » MongoDB – is a NoSQL open source Database designed for scale, flexible data aggregation and to store files of any size. It has rich querying, high availability and full indexing support and is fast being adopted by many businesses. » HBase – is an Apache open source, distributed, versioned project that provides real - time read/write access to Big Data atop clusters of commodity hardware. Apache HBase provides these capabilities on top of Hadoop and HDFS. » Sqoop – is the tool designed for efficiently transferring bulk data between the Apache distribution of Hadoop and structured relation Data Bases. (SQL to Hadoop = Sqoop) » Pig – is a programming platform to write Map Reduce programs to query large scale data. General Pig scripts are written in the Pig Latin language.
  • 4. BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C) » Oozie –is a workflow scheduler system to manage Apache Hadoop jobs. Oozie is integrated with the rest of the Hadoop stack supporting several types of Hadoop jobs such as Java map-reduce, Streaming map-reduce, Pig, Hive, and Sqoop. Challenges Posed By Big Data: » A shift from familiar Data Integration tools to new technologies to deal with Big Data problems will impact productivity of the organization and increase costs to invest in new skills and tools. » Performance constraints of the Data Integration tools to deal with larger and more complex data within ever shortening time windows and real time analytics. » Costly network traffic in moving data within and outside the business and storage areas, and lastly » Redundancy of existing costly IT investment which become obsolete if they are not “Big Data Enabled.” Oracle’s approach to Big Data solves each of these concerns. Oracle Data Integrator 12c (ODI12c) addresses productivity and skill requirements without leaving familiar programming and IT environments. Oracle Data Integrator Application Adapter for Hadoop helps bring the data together while Oracle’s Integrated Platform delivers optimal Big Data flow ensuring not just data size and complexity, but also data speed and delivers Fast Data. Best Technology Approaches for Integrating Big Data Integrated Design Tools: Improve Productivity The advantage of Big Data comes into play when you have the ability to correlate Big Data with your existing enterprise data. There’s an implicit product requirement here in consolidating these various architecture principles into a single integration solution. The advantages of a single solution allow you to address not only the complexities of mapping, accessing, and loading Big Data but also combining it with your enterprise data – and this correlation requires integrating across mixed application environments. The correlation is key to taking full advantage of Big Data and requires a single unified tool that can straddle both environments. In addition, Big Data sources consist of many different types and in many different forms. How can anyone be sure of the quality of that data? In the Big Data scenario, Data Quality is important because of the multitude of data sources. Multiple data sources make it difficult to trust the underlying data. Being able to quickly and easily identify and resolve any data discrepancies, missing values, etc in an automated fashion is beneficial to the applications and systems that use this information. Oracle Enterprise Data Quality integrated with Oracle Data Integrator 12c (ODI12c) enables data analysts to investigate, identify and resolve data quality issues by discovering and analyzing anomalies. Oracle Data Integrator 12c (ODI12c): Loading and Transforming Big Data Oracle Data Integrator 12c (ODI12c) leverages unified tooling for both Big Data and enterprise data which translates into a faster learning curve as well as seamless usability so that the data scientist or data analyst can focus on integration versus usability. Flow based declarative designs in Oracle Data Integrator 12c (ODI12c) helps build complex expressions that are easy to maintain and support. Two representations of the same ELT mappings, the logical representation and the physical representation, provide customized working environments for business analysts and data scientists.
  • 5. BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C) Fig. 2: Loading and Transforming Big Data using Oracle Data Integrator12c (ODI12c) Oracle Data Integrator Application Adapters for Hadoop As a component of Oracle Big Data Connectors, Oracle Data Integrator Application Adapter for Hadoop provides native Hadoop integration together with Oracle Data Integrator 12c (ODI12c). Knowledge Modules can then be used to build Hadoop metadata within Oracle Data Integrator 12c (ODI12c), load data into Hadoop, transform data within Hadoop, and load data easily and directly into Oracle Database utilizing Oracle Loader for Hadoop. Once the data is processed and organized on the Hadoop cluster, Oracle Data Integrator 12c (ODI12c) loads the data directly into Oracle Database utilizing the Oracle Loader for Hadoop. Oracle Data Integrator Application Adapter for Hadoop simplifies data loading and movement between Hadoop and an Oracle Database through Oracle Data Integrator 12c (ODI12c)’s easy to use prebuilt interfaces. Oracle Data Integrator Application Adapter for Hadoop simplifies data integration from Hadoop and an Oracle Database through Oracle Data Integrator 12c (ODI12c)’s easy to use interface. By providing efficient connectivity between Oracle Database and Hadoop, Oracle Big Data Connectors enables analysis of all data, both structured and unstructured, in enterprise data warehouses. Additionally, enterprises that are already using a Hadoop solution integrate data from HDFS using Oracle Big Data Connectors as a standalone software solution as well. Integrated Platform: Simplify and Optimize Taking all the miscellaneous technologies around Big Data – which are new to many organizations - and making them each work with one another is challenging. Making them work together in a production- grade environment is even more daunting. Integrated systems can help an organization radically simplify their Big Data architectures by integrating the necessary hardware and software components to provide fast and cost-efficient access, and mapping, to NoSQL and HDFS.
  • 6. BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C) Combined hardware and software systems can be optimized for redundancy with mirrored disks, optimized for high availability with hot-swappable power, and optimized for scale by adding new racks with more memory and processing power. Take it one step further and you can use these same systems to build out more elastic capacity to meet the flexibility requirements Big Data demands. Figure 3: Oracle’s Big Data Story Oracle is uniquely qualified to combine everything needed to meet the Big Data challenge –including software and hardware – into one engineered system. The Oracle Big Data Appliance is an engineered system that combines optimized hardware with the most comprehensive software stack featuring both the Cloudera Distribution including Apache Hadoop and specialized solutions developed by Oracle to deliver a complete, easy-to-deploy solution for acquiring, organizing and loading Big Data into Oracle Database. It is designed to deliver extreme analytics on all data types, with enterprise-class performance, availability, supportability and security. With Oracle Big Data Connectors and Oracle Data Integrator 12c (ODI12c), the solution is tightly integrated with Oracle Exadata and Oracle Database, so that data can be analyzed data with extreme performance. Real-time Business Analytics: Extends the Value of Big Data One of the key expectations for Big Data is to yield real-time analytics that improve business insights. But how timely is the data is a question that will still need to be answered for both Big Data or traditional enterprise data. While Big Data on its own has no means of applying ‘traditional’ change data capture [since there are no log files in NoSQL or HDFS], it’s still an important requirement to implement real-time solutions in conjunction with big data. Otherwise the speed advantage to indexing realms of Big Data will be undone by sluggish ETL processing that it’s dependent on. Big Data can be processed at high volume with high velocity. In fact, this is the entire strategy behind the invention of MapReduce – providing search responses of the highest quality in close to the blink of a human eye. Combine this power with real-time solutions in replication, change data capture, synchronization, and the integration
  • 7. BIG DATA AND ENTERPRISE DATA: BRIDGING TWO WORLDS WITH ORACLE DATA INTEGRATOR 12C (ODI12C) to business analytics tooling, and you have what amounts to the compelling advantages of real-time business analytics. The combination of Oracle Data Integrator and Oracle GoldenGate pack a powerful punch when it comes to achieving true real-time business analytics. Oracle GoldenGate is an integral part of the real- time business analytics use case to accomplish real- time data replication and capture ensuring applications have the data they need immediately. Conclusion Big data continues to be the center of attention - take away the hype and there's a key takeaway. For years, companies have been running their critical business infrastructure and building business insights based on transactional data stored in relational databases. Beyond that critical data, however, is a potential treasure trove of less structured data: web and call logs, social media, email, sensors, and photographs [and more] that can be mined for useful information. Companies that are seeking ways to capitalize on the hidden potential of Big Data need to consider data integration technologies to help bridge the gap and correlate that data across the enterprise. Bridging the two worlds of Big Data and enterprise data means considering solutions that are complete, based on traditional as well as emerging Hadoop technologies, and are poised for success through integrated design tools, integrated platforms, and real-time analytics. Leveraging these best practices ensures improved productivity, lowered TCO, IT optimization, and better business insights. Only Oracle provides the most complete, integrated, and real-time solution data integration that helps bridge the two worlds of Big Data and enterprise data to accelerate adoption and yield greater returns for these emerging technologies. For more information on Oracle Data Integrator go to our website: www.oracle.com/goto/dataintegration.
  • 8. Oracle Corporation, World Headquarters 500 Oracle Parkway Redwood Shores, CA 94065, USA Worldwide Inquiries Phone: +1.650.506.7000 Fax: +1.650.506.7200 Copyright © 2015, Oracle and/or its affiliates. All rights reserved. This document is provided for information purposes only, and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document, and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.0115 White Paper Title January 2015 Author: Oracle C O N N E C T W I T H U S Oracle Data Integration Blog Oracle DI Facebook Page Oracle Data Integration Oracle DI Home Page