SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Addressing Big Data Challenges: The Hadoop Way 
Presented by: Atul Dambalkar
Agenda Big Data Challenges Big Data Analytics Industry Trends Hadoop as a Solution Real Life Solution Studies 
•Case Study I - Retail Industry 
•Case Study II - Online Advertising Industry How Xoriant can help? Q & A
Big Data Challenges 
THE FOUR V’s OF BIG DATA 
Source: IBM
Traditional Approach & Its Limitations 
Data Warehousing Vendors (ETL) 
Costs - High Initial Setup, 
Maintenance, Subscription 
or Licensing Fees 
No support for 
unstructured data 
Multiple copies of data in different formats 
Data latency and 
bottlenecks 
No support for ad-hoc query 
Note: The Logos are proprietary of the individual companies
Big Data and Analytics - Trends 
Enterprise Data Hub or Data Lake (Hadoop with HDFS) 
Commodity Hardware 
No multiple data copies 
Fault-Tolerant storage for Raw data As-Is 
Current Limitations - Write/Append only, No Delete or Update 
Unified Data Access 
Multiple Data Processing Paradigms 
In-memory processing 
In-memory, 
Real-time Stream processing 
Analytics based on Distributed SQL Processing 
Falling hardware prices 
Batch mode processing for data size more than Hundreds of TBs 
In-memory processing for data size less than hundreds of TBs 
ETL Trends 
Data 
Processing 
Trends 
Architecture 
Trends 
Open Source Software
Hadoop Proposition 
Open Source Ecosystem 
No data loss through replicated storage (HDFS) 
Runs on commodity hardware 
•Map-Reduce 
•Script based (Pig Latin) 
•SQL like - HiveQL, Apache Drill, Presto (Facebook) 
•Impala (Cloudera), HAWQ (Pivotal) 
•In-memory processing (Apache Spark) 
Multiple data analysis/processing paradigms
Hadoop Data Flow
Apache - Hadoop Ecosystem
Case Study - 1 
Benefits 
Retail Industry 
Problem Scenario 
Personalize marketing campaigns, coupons, offers, marking down inventories 
Improving customer loyalty – leads to sales and profitability 
Competition from other retailers 
ETL based analysis tasks - taking lot of time – up to 6 weeks 
Software systems (Oracle, Greenplum, SAS, Teradata) 
Mainframe based expensive hardware systems 
Hadoop based Solution 
Data stored into HDFS with replication 
300 Hadoop nodes with 2PB data 
Data processing time down to 1 week and even daily 
Mainframe cost savings 
No software licensing costs 
Limitless data storage with HDFS 
No multiple data copies 
Low cost
Case Study - 2 
Benefits 
Attribution computation time down to 45 minutes 
Capable of processing up to 300GB data for each computation 
Manageable data storage with HDFS 
Low cost 
Online Advertising Industry (Attribution Computation) 
Problem Scenario 
Growing Ads Impression and conversion events 
Longer attribution computation time (6 to 8 hours for each computation run). Advertisers needed quick results 
Unable to process more than 150GB data within each computation 
IBM Netezza based solution along with Oracle 
Expensive hardware and software costs 
Hadoop based Solution 
Data stored into HDFS with replication 
Initially used HiveQL then moved to Cloudera Impala (MPP architecture based Distributed SQL Engine)
Xoriant Big Data Practice - Overview Understands technological needs and organizational challenges faced with respect to Big Data Understands rapidly evolving Big Data technology space Can help bridge the gaps with Big Data capabilities Brings Big Data and NoSQL technology expertise
Xoriant – Big Data Center of Excellence 
Email: bigdata@xoriant.com 
For FREE consultation, please contact us on the above mentioned email address. 
Do you have any Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityDATAVERSITY
 
Organizational Change Management for Data- and Analytics-Driven Projects
Organizational Change Management for Data- and Analytics-Driven ProjectsOrganizational Change Management for Data- and Analytics-Driven Projects
Organizational Change Management for Data- and Analytics-Driven ProjectsDATAVERSITY
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata StrategiesDATAVERSITY
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...Prof. Dr. Diego Kuonen
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesEric Kavanagh
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseData Con LA
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaTop 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaEdureka!
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduceRenuSuren
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as ProductDATAVERSITY
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 

Was ist angesagt? (20)

Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Organizational Change Management for Data- and Analytics-Driven Projects
Organizational Change Management for Data- and Analytics-Driven ProjectsOrganizational Change Management for Data- and Analytics-Driven Projects
Organizational Change Management for Data- and Analytics-Driven Projects
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata Strategies
 
Our big data
Our big dataOur big data
Our big data
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...
Big Data, Data-Driven Decision Making and Statistics Towards Data-Informed Po...
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
What is big data?
What is big data?What is big data?
What is big data?
 
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaTop 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
 
Big data
Big dataBig data
Big data
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Big data analysis using map/reduce
Big data analysis using map/reduceBig data analysis using map/reduce
Big data analysis using map/reduce
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Motivation for big data
Motivation for big dataMotivation for big data
Motivation for big data
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 

Ähnlich wie Addressing Big Data Challenges - The Hadoop Way

Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big DataDataWorks Summit
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointInside Analysis
 
Exploring the Wider World of Big Data
Exploring the Wider World of Big DataExploring the Wider World of Big Data
Exploring the Wider World of Big DataNetApp
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseCloudera, Inc.
 
Exploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis KapsalisExploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis KapsalisNetAppUK
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data AnalyticsAttunity
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
When Databases Meet Big data and Hadoop - Uni of Tromso Online Lecture
When Databases Meet Big data and Hadoop - Uni of Tromso Online LectureWhen Databases Meet Big data and Hadoop - Uni of Tromso Online Lecture
When Databases Meet Big data and Hadoop - Uni of Tromso Online LectureIrfan Elahi
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An IntroductionShankar R
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)GeeksLab Odessa
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarioskcmallu
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitecturePerficient, Inc.
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02email2jl
 
Stratebi Big Data
Stratebi Big DataStratebi Big Data
Stratebi Big DataStratebi
 

Ähnlich wie Addressing Big Data Challenges - The Hadoop Way (20)

Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big Data
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
 
Exploring the Wider World of Big Data
Exploring the Wider World of Big DataExploring the Wider World of Big Data
Exploring the Wider World of Big Data
 
Retail & CPG
Retail & CPGRetail & CPG
Retail & CPG
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data Warehouse
 
Exploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis KapsalisExploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis Kapsalis
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
When Databases Meet Big data and Hadoop - Uni of Tromso Online Lecture
When Databases Meet Big data and Hadoop - Uni of Tromso Online LectureWhen Databases Meet Big data and Hadoop - Uni of Tromso Online Lecture
When Databases Meet Big data and Hadoop - Uni of Tromso Online Lecture
 
Beyond TCO
Beyond TCOBeyond TCO
Beyond TCO
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An Introduction
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 
Hadoop & Data Warehouse
Hadoop & Data Warehouse Hadoop & Data Warehouse
Hadoop & Data Warehouse
 
Stratebi Big Data
Stratebi Big DataStratebi Big Data
Stratebi Big Data
 

Mehr von Xoriant Corporation

Webinar: Unlocking the potential of io t data
Webinar: Unlocking the potential of io t dataWebinar: Unlocking the potential of io t data
Webinar: Unlocking the potential of io t dataXoriant Corporation
 
Xoriant - Financial services expertise
Xoriant - Financial services expertiseXoriant - Financial services expertise
Xoriant - Financial services expertiseXoriant Corporation
 
Xoriant Smartphone apps accelerator
Xoriant Smartphone apps acceleratorXoriant Smartphone apps accelerator
Xoriant Smartphone apps acceleratorXoriant Corporation
 
Mobile porting and testing - Xoriant
Mobile porting and testing - Xoriant Mobile porting and testing - Xoriant
Mobile porting and testing - Xoriant Xoriant Corporation
 
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...Xoriant Corporation
 
Product Engineering Outsourcing: Looking beyond Cost Savings
Product Engineering Outsourcing: Looking beyond Cost SavingsProduct Engineering Outsourcing: Looking beyond Cost Savings
Product Engineering Outsourcing: Looking beyond Cost SavingsXoriant Corporation
 
Growth by Partnerships for ISVs in the financial software products markets
Growth by Partnerships for ISVs in the financial software products marketsGrowth by Partnerships for ISVs in the financial software products markets
Growth by Partnerships for ISVs in the financial software products marketsXoriant Corporation
 
Product Engineering - Distributed Agile
Product Engineering - Distributed AgileProduct Engineering - Distributed Agile
Product Engineering - Distributed AgileXoriant Corporation
 
The Xoriant Whitepaper: Last Mile Soa Implementation
The Xoriant Whitepaper: Last Mile Soa ImplementationThe Xoriant Whitepaper: Last Mile Soa Implementation
The Xoriant Whitepaper: Last Mile Soa ImplementationXoriant Corporation
 

Mehr von Xoriant Corporation (11)

Webinar: Unlocking the potential of io t data
Webinar: Unlocking the potential of io t dataWebinar: Unlocking the potential of io t data
Webinar: Unlocking the potential of io t data
 
Xoriant - Financial services expertise
Xoriant - Financial services expertiseXoriant - Financial services expertise
Xoriant - Financial services expertise
 
Xoriant Smartphone apps accelerator
Xoriant Smartphone apps acceleratorXoriant Smartphone apps accelerator
Xoriant Smartphone apps accelerator
 
Mobile porting and testing - Xoriant
Mobile porting and testing - Xoriant Mobile porting and testing - Xoriant
Mobile porting and testing - Xoriant
 
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...
SEP Webinar –HTML5: The GenX Technology for building scalable and high perfor...
 
Staying the Course
Staying the CourseStaying the Course
Staying the Course
 
Product Engineering Outsourcing: Looking beyond Cost Savings
Product Engineering Outsourcing: Looking beyond Cost SavingsProduct Engineering Outsourcing: Looking beyond Cost Savings
Product Engineering Outsourcing: Looking beyond Cost Savings
 
Growth by Partnerships for ISVs in the financial software products markets
Growth by Partnerships for ISVs in the financial software products marketsGrowth by Partnerships for ISVs in the financial software products markets
Growth by Partnerships for ISVs in the financial software products markets
 
Product Engineering - Distributed Agile
Product Engineering - Distributed AgileProduct Engineering - Distributed Agile
Product Engineering - Distributed Agile
 
The Xoriant Whitepaper: Last Mile Soa Implementation
The Xoriant Whitepaper: Last Mile Soa ImplementationThe Xoriant Whitepaper: Last Mile Soa Implementation
The Xoriant Whitepaper: Last Mile Soa Implementation
 
Offering For Tech Companies
Offering For Tech CompaniesOffering For Tech Companies
Offering For Tech Companies
 

Kürzlich hochgeladen

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Kürzlich hochgeladen (20)

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Addressing Big Data Challenges - The Hadoop Way

  • 1. Addressing Big Data Challenges: The Hadoop Way Presented by: Atul Dambalkar
  • 2. Agenda Big Data Challenges Big Data Analytics Industry Trends Hadoop as a Solution Real Life Solution Studies •Case Study I - Retail Industry •Case Study II - Online Advertising Industry How Xoriant can help? Q & A
  • 3. Big Data Challenges THE FOUR V’s OF BIG DATA Source: IBM
  • 4. Traditional Approach & Its Limitations Data Warehousing Vendors (ETL) Costs - High Initial Setup, Maintenance, Subscription or Licensing Fees No support for unstructured data Multiple copies of data in different formats Data latency and bottlenecks No support for ad-hoc query Note: The Logos are proprietary of the individual companies
  • 5. Big Data and Analytics - Trends Enterprise Data Hub or Data Lake (Hadoop with HDFS) Commodity Hardware No multiple data copies Fault-Tolerant storage for Raw data As-Is Current Limitations - Write/Append only, No Delete or Update Unified Data Access Multiple Data Processing Paradigms In-memory processing In-memory, Real-time Stream processing Analytics based on Distributed SQL Processing Falling hardware prices Batch mode processing for data size more than Hundreds of TBs In-memory processing for data size less than hundreds of TBs ETL Trends Data Processing Trends Architecture Trends Open Source Software
  • 6. Hadoop Proposition Open Source Ecosystem No data loss through replicated storage (HDFS) Runs on commodity hardware •Map-Reduce •Script based (Pig Latin) •SQL like - HiveQL, Apache Drill, Presto (Facebook) •Impala (Cloudera), HAWQ (Pivotal) •In-memory processing (Apache Spark) Multiple data analysis/processing paradigms
  • 8. Apache - Hadoop Ecosystem
  • 9. Case Study - 1 Benefits Retail Industry Problem Scenario Personalize marketing campaigns, coupons, offers, marking down inventories Improving customer loyalty – leads to sales and profitability Competition from other retailers ETL based analysis tasks - taking lot of time – up to 6 weeks Software systems (Oracle, Greenplum, SAS, Teradata) Mainframe based expensive hardware systems Hadoop based Solution Data stored into HDFS with replication 300 Hadoop nodes with 2PB data Data processing time down to 1 week and even daily Mainframe cost savings No software licensing costs Limitless data storage with HDFS No multiple data copies Low cost
  • 10. Case Study - 2 Benefits Attribution computation time down to 45 minutes Capable of processing up to 300GB data for each computation Manageable data storage with HDFS Low cost Online Advertising Industry (Attribution Computation) Problem Scenario Growing Ads Impression and conversion events Longer attribution computation time (6 to 8 hours for each computation run). Advertisers needed quick results Unable to process more than 150GB data within each computation IBM Netezza based solution along with Oracle Expensive hardware and software costs Hadoop based Solution Data stored into HDFS with replication Initially used HiveQL then moved to Cloudera Impala (MPP architecture based Distributed SQL Engine)
  • 11. Xoriant Big Data Practice - Overview Understands technological needs and organizational challenges faced with respect to Big Data Understands rapidly evolving Big Data technology space Can help bridge the gaps with Big Data capabilities Brings Big Data and NoSQL technology expertise
  • 12. Xoriant – Big Data Center of Excellence Email: bigdata@xoriant.com For FREE consultation, please contact us on the above mentioned email address. Do you have any Questions?