SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
Mohammad Reza Gerami
gerami@aryatadbir.com
mrgerami@aut.ac.ir
1
2
3
• ‘Big Data’ is similar to ‘small data’, but
bigger
•…but having data bigger it requires different
approaches:
• Techniques, tools and architecture
•…with an aim to solve new problems
• …or old problems in a better way
4
5
Characteristics of Big Data:
1-Scale (Volume)
• DataVolume
Exponential increase in
collected/generated data
6
Big Data in Today’s Business and Technology Environment
 2.7 Zetabytes of data exist in the digital universe today. (Source)
 235 Terabytes of data has been collected by the U.S. Library of Congress in April
2011. (Source)
 The Obama administration is investing $200 million in big data research projects.
(Source)
 IDC Estimates that by 2020,business transactions on the internet- business-to-
business and business-to-consumer – will reach 450 billion per day. (Source)
 Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data.
(Source)
 Akamai analyzes 75 million events per day to better target advertisements.
(Source)
 94% of Hadoop users perform analytics on large volumes of data not possible
before; 88% analyze data in greater detail; while 82% can now retain more of their
data. (Source)
7
 Walmart handles more than 1 million customer transactions
every hour, which is imported into databases estimated to
contain more than 2.5 petabytes of data. (Source)
 More than 5 billion people are calling, texting, tweeting and
browsing on mobile phones worldwide. (Source)
 Decoding the human genome originally took 10 years to
process; now it can be achieved in one week. (Source)
 In 2008, Google was processing 20,000 terabytes of data (20
petabytes) a day. (Source)
The largest AT&T database boasts titles including the largest
volume of data in one unique database (312 terabytes) and the
second largest number of rows in a unique
8
The Rapid Growth of Unstructured Data
YouTube users upload 48 hours of new video every minute
of the day. (Source)
571 new websites are created every minute of the day.
(Source)
Brands and organizations on Facebook receive 34,722
Likes every minute of the day. (Source)
100 terabytes of data uploaded daily to Facebook.
(Source)
According to Twitter’s own research in early 2012, it sees
roughly 175 million tweets every day, and has more than
465 million accounts. (Source)
30 Billion pieces of content shared on Facebook every
month. (Source)
Data production will be 44 times greater in 2020 than it
was in 2009. (Source)
9
The Rapid Growth of Unstructured Data
In late 2011, IDC Digital Universe published a
report indicating that some 1.8 zettabytes of
data will be created that year. (Source)
In other words, the amount of data in the world
today is equal to:
Every person in the US tweeting three tweets
per minute for 26,976 years.
Every person in the world having more than
215m high-resolution MRI scans a day.
More than 200bn HD movies – which would take a
person 47m years to watch.
10
Decimal
Value Metric
1000 kB kilobyte
10002 MB megabyte
10003 GB gigabyte
10004 TB terabyte
10005 PB petabyte
10006 EB exabyte
10007 ZB zettabyte
10008 YB yottabyte
11
Social media and networks
(all of us are generating data)
Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all objects all the time
Sensor technology and
networks
(measuring all kinds of data)
12
• No single standard definition…
Big Data
13
14
15
What to do with these data?
16
How much data?
640K ought to
be enough for
anybody.
17
Why Big Data
• Key enablers of appearance and growth of Big Data are
–Increase of storage capacities
–Increase of processing power
–Availability of data
–Every day we create 2.5 quintillion bytes of data;
90% of the data in the world today has been created
in the last two years alone
18
Big Data Analytics
• Examining large amount of data
• Appropriate information
• Identification of hidden patterns, unknown correlations
• Competitive advantage
• Better business decisions: strategic and operational
• Effective marketing, customer satisfaction, increased revenue
19
Applications for Big Data Analytics
Homeland Security
FinanceSmarter Healthcare Multi-channel sales
Telecom
Manufacturing
Traffic Control
Trading Analytics Fraud and Risk
Log Analysis
Search Quality
Retail: Churn, NBO
20
Healthcare
• 80% of medical data is unstructured and is clinically
relevant
• Data resides in multiple places like individual EMRs,
lab and imaging systems, physician notes, medical
correspondence, claims etc
• Leveraging Big Data
• Build sustainable healthcare systems
• Collaborate to improve care and outcomes
• Increase access to healthcare
21
Market Size
Source:WikibonTaming Big Data
By 2015 4.4 million IT jobs in Big Data ; 1.9 million is in US itself
22
PotentialTalent Pool -Big Data
India will require a minimum of 1 lakh data scientists in the next couple of years
in addition to data analysts and data managers to support the Big Data space.
23
24
Future of Big Data
25
Big DataAnalyticsTechnologies
NoSQL : non-relational or at least non-SQL database
solutions such as HBase (also a part of the Hadoop
ecosystem), Cassandra, MongoDB, Riak, CouchDB, and
many others.
Hadoop: It is an ecosystem of software packages,
including MapReduce, HDFS, and a whole host of other
software packages
26
Main Big DataTechnologies
Hadoop NoSQL Databases Analytic Databases
Hadoop
• Low cost, reliable
scale-out architecture
• Distributed computing
Proven success in
Fortune 500
companies
• Exploding interest
NoSQL Databases
• Huge horizontal scaling
and high availability
• Highly optimized for
retrieval and appending
• Types
• Document stores
• Key Value stores
• Graph databases
Analytic RDBMS
• Optimized for bulk-load
and fast aggregate
query workloads
• Types
• Column-oriented
• MPP
• In-memory
27
Thank you 
28
29

Weitere ähnliche Inhalte

Was ist angesagt?

Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)Mahmood Khosravi
 
A Short History of Big Data
A Short History of Big DataA Short History of Big Data
A Short History of Big DataGadi Eichhorn
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science suresh sood
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaSkillspeed
 
Social Big Data in Government
Social Big Data in GovernmentSocial Big Data in Government
Social Big Data in GovernmentAdegboyega Ojo
 
Interesting ways Big Data is used today
Interesting ways Big Data is used todayInteresting ways Big Data is used today
Interesting ways Big Data is used todayDaniel Sârbe
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big dataDeependra Jyoti
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceDataMites
 
Big data for official statistics @ Konferensi Big Data Indonesia 2016
Big data for official statistics @ Konferensi Big Data Indonesia 2016 Big data for official statistics @ Konferensi Big Data Indonesia 2016
Big data for official statistics @ Konferensi Big Data Indonesia 2016 Setia Pramana
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A reviewShilpa Soi
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsWay-Yen Lin
 

Was ist angesagt? (20)

Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
A Short History of Big Data
A Short History of Big DataA Short History of Big Data
A Short History of Big Data
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Dr Ohad Barzilay
Dr Ohad BarzilayDr Ohad Barzilay
Dr Ohad Barzilay
 
BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Social Big Data in Government
Social Big Data in GovernmentSocial Big Data in Government
Social Big Data in Government
 
Interesting ways Big Data is used today
Interesting ways Big Data is used todayInteresting ways Big Data is used today
Interesting ways Big Data is used today
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big data
 
Data mining on big data
Data mining on big dataData mining on big data
Data mining on big data
 
Big data
Big dataBig data
Big data
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data for official statistics @ Konferensi Big Data Indonesia 2016
Big data for official statistics @ Konferensi Big Data Indonesia 2016 Big data for official statistics @ Konferensi Big Data Indonesia 2016
Big data for official statistics @ Konferensi Big Data Indonesia 2016
 
Bigdatappt
BigdatapptBigdatappt
Bigdatappt
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 

Andere mochten auch

Arduino 與 raspberry pi 硬體差異與應用
Arduino 與  raspberry pi 硬體差異與應用Arduino 與  raspberry pi 硬體差異與應用
Arduino 與 raspberry pi 硬體差異與應用Marcus Pek
 
Greenhouse technology
Greenhouse technologyGreenhouse technology
Greenhouse technologyagrihortico
 
Cassava
CassavaCassava
Cassavaajako
 
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...CGIAR Generation Challenge Programme
 
Cassava.2pdf
Cassava.2pdfCassava.2pdf
Cassava.2pdfkemal1983
 
Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding
 Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding
Cultivated genetic diversity and farmers’ knowledge – keys to cassava breedingSIANI
 
CASSAVA BUSINESS IN ASEAN 29102011 REV-3
CASSAVA BUSINESS IN ASEAN 29102011 REV-3CASSAVA BUSINESS IN ASEAN 29102011 REV-3
CASSAVA BUSINESS IN ASEAN 29102011 REV-3Rhino H Pranapati
 
Embedded green house automation system
Embedded green house automation systemEmbedded green house automation system
Embedded green house automation systemgajula vijaya lakshmi
 
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.com
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.comINDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.com
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.comGustavo Adolfo Garzon Mosquera
 
Cassava for sustainable poverty alleviation
Cassava for sustainable poverty alleviationCassava for sustainable poverty alleviation
Cassava for sustainable poverty alleviationCIAT
 

Andere mochten auch (16)

Session 6.1 Markets of Cassava products: Issues in Marketing/Smallholder Prod...
Session 6.1 Markets of Cassava products: Issues in Marketing/Smallholder Prod...Session 6.1 Markets of Cassava products: Issues in Marketing/Smallholder Prod...
Session 6.1 Markets of Cassava products: Issues in Marketing/Smallholder Prod...
 
Cassava 3
Cassava 3Cassava 3
Cassava 3
 
Arduino 與 raspberry pi 硬體差異與應用
Arduino 與  raspberry pi 硬體差異與應用Arduino 與  raspberry pi 硬體差異與應用
Arduino 與 raspberry pi 硬體差異與應用
 
Greenhouse technology
Greenhouse technologyGreenhouse technology
Greenhouse technology
 
Session 5.1 Potential use of Cassava Wastes to Produce Energy: Outcomes of a ...
Session 5.1 Potential use of Cassava Wastes to Produce Energy: Outcomes of a ...Session 5.1 Potential use of Cassava Wastes to Produce Energy: Outcomes of a ...
Session 5.1 Potential use of Cassava Wastes to Produce Energy: Outcomes of a ...
 
Andy J - Climate change and the outlook for cassava
Andy J - Climate change and the outlook for cassavaAndy J - Climate change and the outlook for cassava
Andy J - Climate change and the outlook for cassava
 
Cassava
CassavaCassava
Cassava
 
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...
GRM 2013: Cassava product catalogue and project status -- Projects ongoing, c...
 
Cassava.2pdf
Cassava.2pdfCassava.2pdf
Cassava.2pdf
 
Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding
 Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding
Cultivated genetic diversity and farmers’ knowledge – keys to cassava breeding
 
Cassava Study
Cassava StudyCassava Study
Cassava Study
 
CASSAVA BUSINESS IN ASEAN 29102011 REV-3
CASSAVA BUSINESS IN ASEAN 29102011 REV-3CASSAVA BUSINESS IN ASEAN 29102011 REV-3
CASSAVA BUSINESS IN ASEAN 29102011 REV-3
 
Embedded green house automation system
Embedded green house automation systemEmbedded green house automation system
Embedded green house automation system
 
4.2 Disease Control and Pest Management in Cassava Production by Kumar, IITA
4.2 Disease Control and Pest Management in Cassava Production by Kumar, IITA4.2 Disease Control and Pest Management in Cassava Production by Kumar, IITA
4.2 Disease Control and Pest Management in Cassava Production by Kumar, IITA
 
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.com
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.comINDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.com
INDUSTRIAL CASSAVA PRODUCTION www.cassabisconsulting.com
 
Cassava for sustainable poverty alleviation
Cassava for sustainable poverty alleviationCassava for sustainable poverty alleviation
Cassava for sustainable poverty alleviation
 

Ähnlich wie Big-Data-AryaTadbirNetworkDesigners

Bigdata the technological renaissance
Bigdata the technological renaissanceBigdata the technological renaissance
Bigdata the technological renaissanceRituBhargava7
 
DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDATAVERSITY
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData Blueprint
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementationSandip Tipayle Patil
 
If companies are not careful, "Big Data" will become "Big Dilbert"
If companies are not careful, "Big Data" will become "Big Dilbert"If companies are not careful, "Big Data" will become "Big Dilbert"
If companies are not careful, "Big Data" will become "Big Dilbert"JAX Chamber IT Council
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big DataSonovate
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Oomph! Recruitment
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj
 
Puja(801),sanghamitra(819),surabhi(844)
Puja(801),sanghamitra(819),surabhi(844)Puja(801),sanghamitra(819),surabhi(844)
Puja(801),sanghamitra(819),surabhi(844)puja singh
 
bigdatappt-130621045034-phpapp01
bigdatappt-130621045034-phpapp01bigdatappt-130621045034-phpapp01
bigdatappt-130621045034-phpapp01Arun Sai
 

Ähnlich wie Big-Data-AryaTadbirNetworkDesigners (20)

Big Data - Gerami
Big Data - GeramiBig Data - Gerami
Big Data - Gerami
 
Ov big data
Ov big dataOv big data
Ov big data
 
Bigdata the technological renaissance
Bigdata the technological renaissanceBigdata the technological renaissance
Bigdata the technological renaissance
 
DataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big DataDataEd Online: Demystifying Big Data
DataEd Online: Demystifying Big Data
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big Data
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementation
 
If companies are not careful, "Big Data" will become "Big Dilbert"
If companies are not careful, "Big Data" will become "Big Dilbert"If companies are not careful, "Big Data" will become "Big Dilbert"
If companies are not careful, "Big Data" will become "Big Dilbert"
 
Big data
Big dataBig data
Big data
 
big data
big databig data
big data
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Big Data World
Big Data WorldBig Data World
Big Data World
 
Big Data
Big DataBig Data
Big Data
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Puja(801),sanghamitra(819),surabhi(844)
Puja(801),sanghamitra(819),surabhi(844)Puja(801),sanghamitra(819),surabhi(844)
Puja(801),sanghamitra(819),surabhi(844)
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
bigdatappt-130621045034-phpapp01
bigdatappt-130621045034-phpapp01bigdatappt-130621045034-phpapp01
bigdatappt-130621045034-phpapp01
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 

Kürzlich hochgeladen

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Kürzlich hochgeladen (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Big-Data-AryaTadbirNetworkDesigners

  • 2. 2
  • 3. 3
  • 4. • ‘Big Data’ is similar to ‘small data’, but bigger •…but having data bigger it requires different approaches: • Techniques, tools and architecture •…with an aim to solve new problems • …or old problems in a better way 4
  • 5. 5
  • 6. Characteristics of Big Data: 1-Scale (Volume) • DataVolume Exponential increase in collected/generated data 6
  • 7. Big Data in Today’s Business and Technology Environment  2.7 Zetabytes of data exist in the digital universe today. (Source)  235 Terabytes of data has been collected by the U.S. Library of Congress in April 2011. (Source)  The Obama administration is investing $200 million in big data research projects. (Source)  IDC Estimates that by 2020,business transactions on the internet- business-to- business and business-to-consumer – will reach 450 billion per day. (Source)  Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data. (Source)  Akamai analyzes 75 million events per day to better target advertisements. (Source)  94% of Hadoop users perform analytics on large volumes of data not possible before; 88% analyze data in greater detail; while 82% can now retain more of their data. (Source) 7
  • 8.  Walmart handles more than 1 million customer transactions every hour, which is imported into databases estimated to contain more than 2.5 petabytes of data. (Source)  More than 5 billion people are calling, texting, tweeting and browsing on mobile phones worldwide. (Source)  Decoding the human genome originally took 10 years to process; now it can be achieved in one week. (Source)  In 2008, Google was processing 20,000 terabytes of data (20 petabytes) a day. (Source) The largest AT&T database boasts titles including the largest volume of data in one unique database (312 terabytes) and the second largest number of rows in a unique 8
  • 9. The Rapid Growth of Unstructured Data YouTube users upload 48 hours of new video every minute of the day. (Source) 571 new websites are created every minute of the day. (Source) Brands and organizations on Facebook receive 34,722 Likes every minute of the day. (Source) 100 terabytes of data uploaded daily to Facebook. (Source) According to Twitter’s own research in early 2012, it sees roughly 175 million tweets every day, and has more than 465 million accounts. (Source) 30 Billion pieces of content shared on Facebook every month. (Source) Data production will be 44 times greater in 2020 than it was in 2009. (Source) 9
  • 10. The Rapid Growth of Unstructured Data In late 2011, IDC Digital Universe published a report indicating that some 1.8 zettabytes of data will be created that year. (Source) In other words, the amount of data in the world today is equal to: Every person in the US tweeting three tweets per minute for 26,976 years. Every person in the world having more than 215m high-resolution MRI scans a day. More than 200bn HD movies – which would take a person 47m years to watch. 10
  • 11. Decimal Value Metric 1000 kB kilobyte 10002 MB megabyte 10003 GB gigabyte 10004 TB terabyte 10005 PB petabyte 10006 EB exabyte 10007 ZB zettabyte 10008 YB yottabyte 11
  • 12. Social media and networks (all of us are generating data) Scientific instruments (collecting all sorts of data) Mobile devices (tracking all objects all the time Sensor technology and networks (measuring all kinds of data) 12
  • 13. • No single standard definition… Big Data 13
  • 14. 14
  • 15. 15
  • 16. What to do with these data? 16
  • 17. How much data? 640K ought to be enough for anybody. 17
  • 18. Why Big Data • Key enablers of appearance and growth of Big Data are –Increase of storage capacities –Increase of processing power –Availability of data –Every day we create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone 18
  • 19. Big Data Analytics • Examining large amount of data • Appropriate information • Identification of hidden patterns, unknown correlations • Competitive advantage • Better business decisions: strategic and operational • Effective marketing, customer satisfaction, increased revenue 19
  • 20. Applications for Big Data Analytics Homeland Security FinanceSmarter Healthcare Multi-channel sales Telecom Manufacturing Traffic Control Trading Analytics Fraud and Risk Log Analysis Search Quality Retail: Churn, NBO 20
  • 21. Healthcare • 80% of medical data is unstructured and is clinically relevant • Data resides in multiple places like individual EMRs, lab and imaging systems, physician notes, medical correspondence, claims etc • Leveraging Big Data • Build sustainable healthcare systems • Collaborate to improve care and outcomes • Increase access to healthcare 21
  • 22. Market Size Source:WikibonTaming Big Data By 2015 4.4 million IT jobs in Big Data ; 1.9 million is in US itself 22
  • 23. PotentialTalent Pool -Big Data India will require a minimum of 1 lakh data scientists in the next couple of years in addition to data analysts and data managers to support the Big Data space. 23
  • 24. 24
  • 25. Future of Big Data 25
  • 26. Big DataAnalyticsTechnologies NoSQL : non-relational or at least non-SQL database solutions such as HBase (also a part of the Hadoop ecosystem), Cassandra, MongoDB, Riak, CouchDB, and many others. Hadoop: It is an ecosystem of software packages, including MapReduce, HDFS, and a whole host of other software packages 26
  • 27. Main Big DataTechnologies Hadoop NoSQL Databases Analytic Databases Hadoop • Low cost, reliable scale-out architecture • Distributed computing Proven success in Fortune 500 companies • Exploding interest NoSQL Databases • Huge horizontal scaling and high availability • Highly optimized for retrieval and appending • Types • Document stores • Key Value stores • Graph databases Analytic RDBMS • Optimized for bulk-load and fast aggregate query workloads • Types • Column-oriented • MPP • In-memory 27
  • 29. 29