Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Hadoop and SAP BI
1. Toronto, Calgary
Canada
E-mail: info@2iSolutions.com
Website: www.2isolutions.com
Delhi, Bangalore
India
E-mail: info@2iSolutions.com
Website: www.2iSolutions.com
Hadoop and Lumira
Lesson Learned: Moving the big data
647 977 2648 and meeting ID 500-989-286
2. • You can join by Phone or Web
• Screen Sharing will be by Web Only
• You can also dial by phone
647 977 2648 and meeting ID 500-989-286
How to join the voice of this webinar?
3. Agenda
• About Me
• Why Big Data?
• What is Big Data?
• What is Hadoop
• What did we do?
• What problems faced?
• Connectivity with
Lumira
5. • SAP Partner Company
• Cloudera Partner Company
• Vendor for
City of Toronto
Ontario Govt
Federal Govt
Private Sector
• Company Focus: UI5, S4HANA, BI, AI, Predictive
About 2iSolutions
6. • Since 2005
• Trained more than 12000 students in Big Data,
PMP, CBAP, BA, SAP, SCRUM and Software Testing,
ISTQB
• Involve in Training, Resume Prep, Projects
• LMS
• Almost Free Repeat Policy (Some condition apply)
IIBS.CA Introduction: Who made it possible!
• Partner with
PMI
IIBA
ISTQB
Pearson Vue
7. • Disruptive Technologies
• Digital
• So many Tools
• So many roadmaps
• HANA
• HADOOP
• Lumira
• SAP Analytics Cloud
• SAP BI 4.2
• SAP BW4HANA
• SAP S4HANA Analytics
• SAP Digital Boardroom
• SAP Leonardo
• SAP DataHub
What is the Challenges?
9. 72
72% of CEOs believe the next 3 years will be more critical for
their industry than the previous 50 …Forbes 2016
One of Top 3
One of the top three priorities of CEOS over the next 3 years
is implementing disruptive technologies
77
77% are concerned whether their organization is keeping up
with new technologies
Market Research: Forbes 2016
15. Analytics: Where are we heading?
• Descriptive : What happened ?
• Diagnostic : Why did it happen ?
• Predictive : What is it likely to happen ?
• Perspective : What should we do about it ?
18. • Hadoop is a software framework for distributed processing
of large datasets across large clusters of computers
Large datasets Terabytes or petabytes of data
Large clusters hundreds or thousands of nodes
• Hadoop is based on a simple data model, any data will fit
• Technically
Map Reduce
HDFS
Commodity Hardware
YARN
What is Hadoop?
21. HDFS : Hadoop File System
• Runs on top of any operating file system.
• Designed to handle very large files with streaming data
access patterns.
• Hadoop uses blocks to store file or part of a file
Input
Data
Block 1
Block 2
Block 3 …
Block
1
Block 2
Block 2
Block 3
Block
1
Block 3
22. Why Hadoop ?
Scalability (petabytes of data,
thousands of machines)
Database
vs.
Flexibility in accepting all data
formats (no schema)
Commodity inexpensive hardware
Efficient and simple fault-tolerant
mechanism
Performance (tons of indexing,
tuning, data organization tech.)
Features:
- Provenance tracking
- Annotation management
- ….
23. Enter Apache Spark
Flexible, in-memory data processing for Hadoop
Ease of Use
Advanced Analytics &
Machine Learning
Performance
• Rich & flexible APIs
for Scala, Java, and
Python
• Seamlessly
interleave SQL
syntax with code
• Interactive shell
• Unified framework
for batch and
stream processing
• Rich collection of
distributed ML
algorithms
• In-Memory caching
• Optimized
Scheduler
• Query optimizer
24. 1. Creating an Innovation Platform for Disruptive Technology
2. Any Data will fit
3. Commodity Hardware
4. Easier Data Management, ELT Model
5. Business Case e.g. AI, Deep Learning and Machine
Learning, Chatbot
6. Archival Storage
7. Data Lake
8. Scalable
9. Hidden opportunities for Saving and Innovation
10. Open Source
Top 10 Reasons for Big Data and Hadoop
35. • Identify your business case
• Start a PoC (Start Small)
• Confirm benefits and Iterate
Next 3 Steps
36. • Data Science
• Hadoop
• Python
IIBS.CA Training Program
SAP Programs:
SAP BI (BW on HANA, Webi, Lumira,
Design Studio)
SAP HANA
ECC and S4HANA
• PMP
• SCRUM
• Software Testing