Hadoop is no longer optional. Companies of all sizes are in various phases of their own Big Data journey. Whether you are just starting to explore the platform or have multiple clusters up and running, everyone is presented with a similar challenge - developing their internal skillset. Hadoop specialists are hard to find. Hand coding is too prone to error when it comes to storing, integrating or analyzing your data. However, it doesn’t need to be this difficult.
In this recorded webinar, Talend and Hortonworks help you learn how to unify all your data in Hadoop, with no specialized Big Data skills.
Find the recording here. www.talend.com/resources/webinars/challenges-to-hadoop-adoption-if-you-can-dream-it-you-can-build-it
This webinar covers: How Hadoop opens a new world of analytic applications, How to bridge the skills gap with our Big Data solutions, Experience a real-world, simple technical demo
2. 2
Welcome
A
few
logis0cal
points..
• All
par0cipants
are
muted
• You
may
ask
ques0ons
using
the
Q&A
panel
located
on
boFom
or
GoToWebinar
applet
• Answers
will
be
provided
aJer
the
presenta0on
• If
0me
is
too
short
to
address
all
ques0ons,
answers
will
be
provided
via
email
• To
receive
a
replay
of
our
webinar
today,
please
send
us
an
email
to
webinar@talend.com
• If
you
are
experiencing
connec0on
problems,
please
use
the
Q&A
panel
to
communicate
4. 4
Your
Speakers
Today
Jim Walker
Director, Product Marketing
Shawn James
Director, Alliances & Business Development
Mark Balkenende
Sr. Sales Solution Architect
15. 15
Main
Challenges
in
the
Data
Integra3on
Market
BIG
DATA
More data, less structure
PRODUCTIVITY
Can’t
keep
up
with
demand
COST
Expensive
solu3ons
SKILLS
Hard
to
find
talent
16. 16
The
Big
Data
Demand
4.4
MILLION
JOBS
IN
BIG
DATA
BY
2015
but
only
one
third
of
those
jobs
will
be
filled
Source: Gartner
20. 20Select Icons made by Freepik, Situ Herrera, www.flaticon.com
Talend
Big
Data
Legacy
Systems
ERP
Internet of
Things
DBMS /
EDW
NoSQL
Standard
Reports
Ad-hoc
Query Tools
Data Mining
MDD/OLAP
Analytical
Applications
NoSQL
Web
Logs
Develop and Test Operations Team
Studio
Talend Big Data
Ingestion
Map Profile Parse Match
Cleanse Standardize
Change Data
Capture
Machine
Learning
Share Schedule
Native
Access
Future Proof Architecture
Lowest TCO
Increased Productivity
Benefits
21. 21
Easiest and Most Powerful Integra3on Solu3on for Big Data
Talend
Big
Data
25. 25
Key
Takeaways
• See how Talend’s Big Data Pla[orm addresses the Skills Gap
• See how Talend will increase your Big Data Produc3vity
• Agree Talend and Hortonworks has the technology and skills to
sa3sfy your business requirements
BIG
DATA
More data, less structure
PRODUCTIVITY
Can’t
keep
up
with
demand
SKILLS
Hard
to
find
talent
26. 26
Demonstra0on
Use
Case
Objec3ve of the Use Case was to iden3fy data quality issues prior to loading data to the
EDW without increasing the actual load window.
• Load 500 TB Compressed Files to HFDS
- 3rd Party Sales/Prescribing files delivered by Vendor
• Compute Monthly Totals
- Prior to loading to EDW compare prior month’s totals to current Month totals within new data
files
• Display Comparison results in Analy3cal Tool
- Display total Sales comparison for each Product to quickly show Data Quality issues before
loading to EDW Staging
27. 27
Typical
3rd
Party
Data
Load
Data Preparation Warehouse Processing Final Reports / Quality Check
Bad Big Data Quality issues results in lost time, resource & revenue
28. 28
Data
Warehouse
Op0miza0on
Data Preparation Warehouse Processing Final Reports / Quality Check
Hadoop Cluster
ü Upfront Quality Checks
ü Identify Master records earlier
ü Load Uncompressed data
directly to DWH staging
Optimized Loading
30. 30
What stood out most?
Recap
on
the
Demonstra0on?
• Hortonworks and Talend can help you reduce costs
• Offload costly ETL process
• Enrich the value of your EDW
• Graphical drag and drop visual environment showcasing
Talend and Hortonworks
31. 31
Hortonworks/Talend
Sandbox
• Graphical drag and drop visual environment showcasing Hortonworks
- Visually see the results of integra3on process
• Accelerates data loading and transforma3on with Hadoop
- Build and deploy MapReduce and Pig jobs on YARN
• Pre-‐built use cases: data warehouse op3miza3on, clickstream data, Twiger sen3ment,
Apache weblogs
• Demonstra3ons of several NoSQL databases
32. 32
From
Zero
to
Big
Data
in
10
Minutes
Download free www.talend.com/hortonworks-‐sandbox
• Get up and running in minutes, not weeks,
with a big data Sandbox and demos
• Includes: Sentiment analysis, ETL Offload,
Log file analysis
• Start working with Talend & Hortonworks
today!